In recent years, with the rapid development of artificial intelligence technology, intelligent robots have become a popular research object in academia and industry. An indoor autonomous mobile robot must face three m...
详细信息
Computing the distance from a point to a triangle mesh is a key computational step in robotics pipelines such as registration and collision detection, with applications to path planning, SLAM, and RGB-D vision. Numero...
详细信息
ISBN:
(纸本)9781665491907
Computing the distance from a point to a triangle mesh is a key computational step in robotics pipelines such as registration and collision detection, with applications to path planning, SLAM, and RGB-D vision. Numerous techniques to accelerate this computation have been developed, many of which use a cheap pre-processing step to construct a hierarchical decomposition of the mesh. If the mesh is fixed and known ahead of time, there is an opportunity to conduct more expensive pre-computations to accelerate the subsequent distance queries. This work presents a voxelization approach, implemented on both CPU and GPU, to compute point to mesh distance that constructs for each voxel a near-minimal set of triangles that is guaranteed to include every triangle that is closest to at least one point in the voxel. Theoretical and numerical comparisons with six alternative distance algorithms demonstrate the speed advantages of the proposed method.
A fuzzy controller was designed for a table tennis robot to strike a ball with topspin or backspin. First, the topspin or backspin was classified by comparing the difference between the binocular-vision-based measurem...
详细信息
This research aimed to develop an automated blister package sorting system based on defects detected using image data. The proposed approach aims to reduce the lead time incurred by manual sorting in the production en...
详细信息
A typical way of local tone mapping (TM) is based on multi-layer decomposition of the source image. For this, the source image is decomposed into a base layer and a detail layer to compress from high dynamic range (HD...
详细信息
ISBN:
(数字)9781665496209
ISBN:
(纸本)9781665496209
A typical way of local tone mapping (TM) is based on multi-layer decomposition of the source image. For this, the source image is decomposed into a base layer and a detail layer to compress from high dynamic range (HDR) to low dynamic range (LDR). Perceptual quantization (PQ) is a standardized non-linear transfer function for HDR content. It mimics the non-linearity of human vision by compressing more strongly in bright regions and less in dark areas, when converting luminance values to electrical signals. We propose a CNN-based pipeline for local TM, which operates on the low-frequency base layer of the luminance signal, while keeping the detail layer unchanged. The proposed method works entirely in the PQ domain and is adaptable to display peak luminance. The tone-mapped LDR images obtained with our learning-based approach show significant improvements in PSNR, while the network size is reduced compared to previous work. Our experiments on the HDR datasets from Fairchild and Funt show PSNR improvements of 8 dB compared to the state-of-the-art approaches.
Reinforcement learning (RL) algorithms can achieve state-of-the-art performance in decision-making and continuous control tasks. However, applying RL algorithms on safety-critical systems still needs to be well justif...
详细信息
ISBN:
(纸本)9781665482370
Reinforcement learning (RL) algorithms can achieve state-of-the-art performance in decision-making and continuous control tasks. However, applying RL algorithms on safety-critical systems still needs to be well justified due to the exploration nature of many RL algorithms, especially when the model of the robot and the environment are unknown. To address this challenge, we propose a data-driven safety layer that acts as a filter for unsafe actions. The safety layer uses a data-driven predictive controller to enforce safety guarantees for RL policies during training and after deployment. The RL agent proposes an action that is verified by computing the data-driven reachability analysis. If there is an intersection between the reachable set of the robot using the proposed action, we call the data-driven predictive controller to find the closest safe action to the proposed unsafe action. The safety layer penalizes the RL agent if the proposed action is unsafe and replaces it with the closest safe one. In the simulation, we show that our method outperforms state-of-the-art safe RL methods on the robotics navigation problem for a Turtlebot 3 in Gazebo and a quadrotor in Unreal Engine 4 (UE4).
Glass is ubiquitous in the real world, and its perception has many applications, including robot navigation and drone tracking. However, due to the transparent property of glass, the interior of a glass area can be an...
详细信息
The proceedings contain 77 papers. The topics discussed include: thermodynamic calculation and design for desulfurization wastewater flue gas evaporation zero discharge system of the 1,000 MW unit;simulation and analy...
ISBN:
(纸本)9781643684826
The proceedings contain 77 papers. The topics discussed include: thermodynamic calculation and design for desulfurization wastewater flue gas evaporation zero discharge system of the 1,000 MW unit;simulation and analysis of multi-signal middle-orbit spacecraft positioning performance;research on vibration reduction design of railway wagons bearing based on dynamics;the solution of the control problem in installations implementing complex rotational-oscillatory motion of working elements;key issues and countermeasures of machine vision for fruit and vegetable picking robot;estimation of resonance echo signal of Kalman filter algorithm based on PCA under pipe string sound field model;development of a drilling cycle program with decreasing depth of pecking drilling;design and implementation of smart phone assistant system for the elderly based on WeChat mini program;and experimental study of active vibration control for flexible truss by using a Stewart platform manipulator.
We present a vision-based grasping system for humanoid and prosthetic hands using hardware-accelerated CNNs for real-time object classification and class-aware pixel-wise segmentation. The system is implemented on the...
详细信息
ISBN:
(纸本)9798350303278
We present a vision-based grasping system for humanoid and prosthetic hands using hardware-accelerated CNNs for real-time object classification and class-aware pixel-wise segmentation. The system is implemented on the hand internal processing hardware of a humanoid hand using a System-on-Chip (SoC) comprising a Processor System (PS) and an FPGA. As a sensor system, the hand provides an integrated RGB camera, a multi-region Time-of-Flight (ToF) depth sensor, and an Inertial Measurement Unit (IMU). We propose an algorithm for 3D object shape estimation based on sensory information provided by the hand internal sensor system. The 3D object mesh in combination with the object relative pose of the hand is used as input for a reactive grasp controller. For the design of the CNN-based object recognition and segmentation networks, we use a resource-aware algorithm for Network-Architecture-Synthesis (NAS). We evaluate the visual perception accuracy and 3D model estimation accuracy in grasping experiments with six objects. We obtain a mean object segmentation accuracy of 84.4 % and a mean error for object diameter estimation of 44 mm.
Deep neural networks (DNNs) offer the highest performance in a wide range of applications in computer vision. These results rely on over-parameterized backbones, which are expensive to run. This computational burden c...
详细信息
ISBN:
(纸本)9781728198354
Deep neural networks (DNNs) offer the highest performance in a wide range of applications in computer vision. These results rely on over-parameterized backbones, which are expensive to run. This computational burden can be dramatically reduced by quantizing (in either data-free (DFQ), post-training (PTQ) or quantization-aware training (QAT) scenarios) floating point values to ternary values (2 bits, with each weight taking value in {-1, 0, 1}). In this context, we observe that rounding to nearest minimizes the expected error given a uniform distribution and thus does not account for the skewness and kurtosis of the weight distribution, which strongly affects ternary quantization performance. This raises the following question: shall one minimize the highest or average quantization error? To answer this, we design two operators: TQuant and MQuant that correspond to these respective minimization tasks. We show experimentally that our approach allows to significantly improve the performance of ternary quantization through a variety of scenarios in DFQ, PTQ and QAT and give strong insights to pave the way for future research in deep neural network quantization.
暂无评论