—Deep reinforcement learning (DRL) has made great achievements since proposed. Generally, DRL agents receive high-dimensional inputs at each step, and make actions according to deep-neural-network-based policies. Thi...
详细信息
the performance of deep learning largely depends on the size of data. One data source is real-time streaming data, produced from mobile devices, sensors or social media, etc. Streaming data is high-speed and large-sca...
详细信息
the performance of deep learning largely depends on the size of data. One data source is real-time streaming data, produced from mobile devices, sensors or social media, etc. Streaming data is high-speed and large-scale, which needs real-time processing. However, current mainstream frameworks are mainly designed for off-line data. To suit this, we first propose a deep learning framework based on Apache Storm, which is a distributed stream processing frame, fast and fault-tolerant. Our framework implements the distributed training of CNNs. which is different from MMLSpark or TensorFlowOnSpark that is a pure Java implementation. The design of message passing and synchronization is also suitable to other MapReduce-family distributed computing platforms. To validate our work, MNIST and Cifar -10 datasets are used for evaluation and comparison with similar architectures. The results show our framework, in resource-limited environment, realizes about 10 times speedup.
Soft robotics has several promising properties for aquatic applications, such as safe interaction with environments, lightweight, low cost, etc. In this paper, we proposed the kinematic modeling and hydrodynamics expe...
详细信息
Soft robotics has several promising properties for aquatic applications, such as safe interaction with environments, lightweight, low cost, etc. In this paper, we proposed the kinematic modeling and hydrodynamics experiments of a soft robotic arm with 3D locomotion capacity. We developed a mathematical model that incorporates the angle correction, as well as the open-loop model-based motion control. The model could precisely predict the three-dimensional (3D) movement, and the location error is less than 5.7 mm in different attitudes. Furthermore, we performed the hydrodynamic investigations and simultaneously measured the hydrodynamic forces and the wake flows at different amplitudes (50 mm, 100 mm, 150 mm, 200 mm) and frequencies (0.3 Hz, 0.4 Hz, 0.5 Hz) of the soft arm. Surprisingly, we found that the magnitudes of the hydrodynamic force (〈1 N) and the torques (〈0.08 N-m) of dynamically moving soft arm were tiny, which leads to negligible inertial effect for the underwater vehicle than those of the traditional rigid underwater manipulator. Finally, we demonstrated underwater picking and placing tasks of the soft manipulator by using a computer program that controls the tip attitude and velocity. This study may inspire future underwater manipulators that have properties of low-inertial, low power cost and can safely interact with the aauatic environments.
This paper presents a real-time, pixelwise method to generate grasp synthesis based on fully convolutional netural networks (FCN). Our proposed Attention Grasping Network (AGN) applies a novel attention mechanism to r...
This paper presents a real-time, pixelwise method to generate grasp synthesis based on fully convolutional netural networks (FCN). Our proposed Attention Grasping Network (AGN) applies a novel attention mechanism to robotic grasp detection, which automatically learns to focus on salient features of the input image. The model with attention mechnisms can compensate for the loss of detail information in standard FCN, which increases the sensitivity of the model and accuracy of prediction. In addition,in order to ensure a real-time grasp and save computing resources, the light-weight AGN model predicts the position and angle of grasping point. Our method only takes 22ms to execute the grasp detection pipeline on a GPU-equipped computer, and achieves 97.8% accuracy on Cornell Grasping Dataset.
In this paper, we present an overview of robotic peg-in-hole assembly and analyze two main strategies: Contact model-based and contact model-free strategies. More specifically, we first introduce the contact model con...
详细信息
Dear editor, Fundamentally, there are only two main approaches so far in artificial intelligence (AI): reasoning-oriented formal logic approach and function-oriented computational intelligence approach, so called N...
Dear editor, Fundamentally, there are only two main approaches so far in artificial intelligence (AI): reasoning-oriented formal logic approach and function-oriented computational intelligence approach, so called Neats vs. Scuruffies, which is a reflection of the historical fight between two schools of thought for formalism and empiricism respectively in the field of AI that is continuing even today.
This paper presents a grasping convolutional neural network with image segmentation for mobile manipulating robot. The proposed method is cascaded by a feature pyramid network FPN and a grasping network DrGNet. The FP...
This paper presents a grasping convolutional neural network with image segmentation for mobile manipulating robot. The proposed method is cascaded by a feature pyramid network FPN and a grasping network DrGNet. The FPN network combined with point cloud clustering is used to obtain the mask of the target object. Then, the grayscale map and the depth map corresponding to the target object are combined and sent to the DrGNet network for providing multi-scale images. On this basis, depthwise separable convolution is used for encoding. The results of encoders are refined according to the light-weight RefineNet as well as sSE, which can achieve a better grasp detection. The proposed method is verified by the experiments on mobile manipulating robot.
Object reconstruction is one of the most crucial branches of computer vision. With the development of deep learning, many tasks have achieved remarkable improvements in computer vision. 3D reconstruction with deep lea...
详细信息
ISBN:
(数字)9781728107707
ISBN:
(纸本)9781728107714
Object reconstruction is one of the most crucial branches of computer vision. With the development of deep learning, many tasks have achieved remarkable improvements in computer vision. 3D reconstruction with deep learning also has attracted much attention in recent years. Deep learning methods based on CNN-based and GAN-based architectures have been adopted for 3D object prediction. In addition, researchers utilize different inputs such as RGB and depth images to achieve prediction based on different problem. In this paper, we provide a detailed overview of recent advances in 3D object reconstruction. The reviewed approaches are categorized into three groups depending on the input modality: RGB-based, depth-based and other-input-based. Particularly, we introduce the various methods and indirectly classify the shape representation. As a survey, we discuss the strong and weak points of exciting approaches.
Recently, with the development of both 3D sensors and 3D virtual network that bring the needs of interaction with the real world, many 3D applications burst out. However, it is difficult to understanding these three-d...
详细信息
Recently, with the development of both 3D sensors and 3D virtual network that bring the needs of interaction with the real world, many 3D applications burst out. However, it is difficult to understanding these three-dimensional scenes with a fixed program. Then, a data-driven method is required to process these 3D data, which brings a strong demand of 3D Deep Learning in 3D data. Towards this goal, with an end-to-end deep learning, the experiment is based on PointNet++, a well proposed method for feature extraction. The experiment optimizes the network structure and parameters to improve the classification results. Finally, the network is applied to tooth model for classification and identification so that the dental model can be found from different perspectives.
3D human face modeling is one of the most popular research directions in the field of computer stereo vision. On the one hand, the complex physiological structure of human face as well as various expressions and attit...
详细信息
3D human face modeling is one of the most popular research directions in the field of computer stereo vision. On the one hand, the complex physiological structure of human face as well as various expressions and attitude changes bring great challenges to three-dimensional modeling, which makes the modeling method of human face have high research value and reference significance. On the other hand, 3D face reconstruction has a broad application prospect, and the application demand attracts more researchers to invest in it. Based on the 3DMM and 3DDFA method, this paper utilizes regression neural network to realize end-to-end face reconstruction. After that, we build a visualization program with the training model to realize face modeling based on a single face image of any pose or expression. It’s a 3D face reconstruction system which has functions such as face alignment, face rotation and so on.
暂无评论