-
Hassing posted an update 10 months, 3 weeks ago
Monocular image-based 3-D model retrieval aims to search for relevant 3-D models from a dataset given one RGB image captured in the real world, which can significantly benefit several applications, such as self-service checkout, online shopping, etc. To help advance this promising yet challenging research topic, we built a novel dataset and organized the first international contest for monocular image-based 3-D model retrieval. Moreover, we conduct a thorough analysis of the state-of-the-art methods. Existing methods can be classified into supervised and unsupervised methods. The supervised methods can be analyzed based on several important aspects, such as the strategies of domain adaptation, view fusion, loss function, and similarity measure. The unsupervised methods focus on solving this problem with unlabeled data and domain adaptation. 680C91 Seven popular metrics are employed to evaluate the performance, and accordingly, we provide a thorough analysis and guidance for future work. To the best of our knowledge, this is the first benchmark for monocular image-based 3-D model retrieval, which aims to help related research in multiview feature learning, domain adaptation, and information retrieval.Zero-shot learning (ZSL) is a pretty intriguing topic in the computer vision community since it handles novel instances and unseen categories. In a typical ZSL setting, there is a main visual space and an auxiliary semantic space. Most existing ZSL methods handle the problem by learning either a visual-to-semantic mapping or a semantic-to-visual mapping. In other words, they investigate a unilateral connection from one end to the other. However, the connection between the visual space and the semantic space are bilateral in reality, that is, the visual space depicts the semantic space; the semantic space, on the other hand, describes the visual space. In this article, therefore, we investigate the bilateral connections in ZSL and present a novel model, called Boomerang-GAN, by taking advantage of conditional generative adversarial networks (GANs). Specifically, we generate unseen visual samples from their category semantic embeddings by a conditional GAN. Different from the existing generative ZSL methods that only consider generating visual features from class descriptions, our method also considers that the generated visual features can be translated back to their corresponding semantic embeddings by introducing a multimodal cycle-consistent loss. Extensive experiments of both ZSL and generalized ZSL on five widely used datasets verify that our method is able to outperform previous state-of-the-art approaches in both recognition and segmentation tasks.In the complex practical engineering systems, many interferences and attacking signals are inevitable in industrial applications. This article investigates the reinforcement learning (RL)-based resilient control algorithm for a class of Markovion jump systems with completely unknown transition probability information. Based on the Takagi-Sugeno logical structure, the resilient control problem of the nonlinear Markovion systems is converted into solving a set of local dynamic games, where the control policy and attacking signal are considered as two rival players. Combining the potential learning and forecasting abilities, the new integral RL (IRL) algorithm is designed via system data to compute the zero-sum games without using the information of stationary transition probability. Besides, the matrices of system dynamics can also be partially unknown, and the new architecture requires less transmission and computation during the learning process. The stochastic stability of the system dynamics under the developed overall resilient control is guaranteed based on the Lyapunov theory. Finally, the designed IRL-based resilient control is applied to a typical multimode robot arm system, and implementing results demonstrate the practicality and effectiveness.The location information of road and lane lines is the supremely important thing for the automatic drive and auxiliary drive. The detection accuracy of these two elements dramatically affects the reliability and practicality of the whole system. In real applications, the traffic scene can be very complicated, which makes it particularly challenging to obtain the precise location of road and lane lines. Commonly used deep learning-based object detection models perform pretty well on the lane line and road detection tasks, but they still encounter false detection and missing detection frequently. Besides, existing convolution neural network (CNN) structures only pay attention to the information flow between layers, while it cannot fully utilize the spatial information inside the layers. To address those problems, we propose an attention-based spatial segmentation network for traffic scene understanding. We use the convolutional attention module to improve the network’s understanding capacity of spatial location distribution. Spatial CNN (SCNN) obtains through the information flow within one single convolutional layer and improves the spatial relationship modeling ability of the network. The experimental results demonstrate that this method effectively improves the neural network’s application ability of the spatial information, thereby improving the effect of traffic scene understanding. Furthermore, a pixel-level road segmentation dataset called NWPU Road Dataset is built to help improve the process of traffic scene understanding.In the field of modern industrial engineering, many mechanical systems are underactuated, exhibiting strong nonlinear characteristics and high flexibility. However, the lack of control inputs brings about many difficulties for controller design and stability/convergence analysis. Additionally, some unavoidable practical issues, e.g., plant uncertainties and actuator deadzones, make the control of underactuated systems even more challenging. Hence, with the aid of elaborately constructed finite-time convergent surfaces, this article provides the first solution to address the control problem for a class of multi-input-multi-output (MIMO) underactuated systems subject to plant uncertainties and actuator deadzones. Specifically, this article overcomes the main obstacle in sliding-mode surface analysis for MIMO underactuated systems, that is, by the presented analysis method, the asymptotic stability of the system equilibrium point is strictly proven based on the composite surfaces. In addition, the unknown parts of the actuated/unactuated dynamic equations and actuator deadzones can be simultaneously handled, which is important for real applications.