Reward engineering has long been a challenge in Reinforcement Learning (RL) research, as it often requires extensive human effort and iterative processes of trial-and-error to design effective reward functions. In thi...
详细信息
This study unveils the In-Context Evolutionary Search (ICE-SEARCH) method, which is among the first works that melds large language models (LLMs) with evolutionary algorithms for feature selection (FS) tasks and demon...
详细信息
In this paper, we propose a system that enables visualization under the situation of scattering media such as fog and smoke by Peplography which is scattering media removal system, on a small GPU machine. Compared to ...
详细信息
Motion capture has become increasingly important, not only in computer animation but also in emerging fields like the virtual reality, bioinformatics, and humanoid training. Capturing outdoor environments offers exten...
详细信息
This paper introduces DiffTORI, which utilizes Differentiable Trajectory Optimization as the policy representation to generate actions for deep Reinforcement and Imitation learning. Trajectory optimization is a powerf...
ISBN:
(纸本)9798331314385
This paper introduces DiffTORI, which utilizes Differentiable Trajectory Optimization as the policy representation to generate actions for deep Reinforcement and Imitation learning. Trajectory optimization is a powerful and widely used algorithm in control, parameterized by a cost and a dynamics function. The key to our approach is to leverage the recent progress in differentiable trajectory optimization, which enables computing the gradients of the loss with respect to the parameters of trajectory optimization. As a result, the cost and dynamics functions of trajectory optimization can be learned end-to-end. DiffTORI addresses the "objective mismatch" issue of prior model-based RL algorithms, as the dynamics model in DiffTORI is learned to directly maximize task performance by differentiating the policy gradient loss through the trajectory optimization process. We further benchmark DiffTORI for imitation learning on standard robotic manipulation task suites with high-dimensional sensory observations and compare our method to feed-forward policy classes as well as Energy-Based Models (EBM) and Diffusion. Across 15 model-based RL tasks and 35 imitation learning tasks with high-dimensional image and point cloud inputs, DiffTORI outperforms prior state-of-the-art methods in both domains.
Haptic feedback to the surgeon during robotic surgery would enable safer and more immersive surgeries but estimating tissue interaction forces at the tips of robotically controlled surgical instruments has proven chal...
详细信息
ISBN:
(数字)9798350377705
ISBN:
(纸本)9798350377712
Haptic feedback to the surgeon during robotic surgery would enable safer and more immersive surgeries but estimating tissue interaction forces at the tips of robotically controlled surgical instruments has proven challenging. Few existing surgical robots can measure interaction forces directly and the additional sensor may limit the life of instruments. We present a hybrid model and learning-based framework for force estimation for the Patient Side Manipulators (PSM) of a da Vinci Research Kit (dVRK). The model-based component identifies the dynamic parameters of the robot and estimates free-space joint torque, while the learning-based component compensates for environmental factors, such as the additional torque caused by trocar interaction between the PSM instrument and the patient’s body wall. We evaluate our method in an abdominal phantom and achieve an error in force estimation of under 10% normalized root-mean-squared error. We show that by using a model-based method to perform dynamics identification, we reduce reliance on the training data covering the entire workspace. Although originally developed for the dVRK, the proposed method is a generalizable framework for other compliant surgical robots. The code is available at https://***/vu-maple-lab/dvrk_force_estimation.
Sensors are widely used to acquire biological and environmental information for medical diagnosis,and health and environmental *** is a promising new sensor material that has been widely used in sensor fabrication in ...
详细信息
Sensors are widely used to acquire biological and environmental information for medical diagnosis,and health and environmental *** is a promising new sensor material that has been widely used in sensor fabrication in recent *** with many other existing graphene preparation methods,laser-scribed graphene(LSG)is simple,low-cost,environmentally friendly,and has good conductivity and high thermal stability,making it widely used in the sensor *** paper summarizes existing LSG methods for sensor *** LSG preparation methods and their variants are introduced first,followed by a summary of LSG modification methods designed explicitly for sensor ***,the applications of LSG in stress,bio,gas,temperature,and humidity sensors are summarized with a particular focus on multifunctional integrated ***,the current challenges and prospects of LSG-based sensors are discussed.
Currently, Digital Holographic Microscopy(DHM) is used for researching disease diagnosis or microbes. It cannot obtain the correct three-dimensional (3D) profile for a little noise because biological cells are microsc...
详细信息
Motivated by the tremendous progress, we witnessed in recent years, this paper presents a survey of the scientific literature on the topic of Collaborative Simultaneous Localization and Mapping (C-SLAM), also known as...
详细信息
Motivated by the tremendous progress, we witnessed in recent years, this paper presents a survey of the scientific literature on the topic of Collaborative Simultaneous Localization and Mapping (C-SLAM), also known as multi-robot SLAM. With fleets of self-driving cars on the horizon and the rise of multi-robot systems in industrial applications, we believe that Collaborative SLAM will soon become a cornerstone of future robotic applications. In this survey, we introduce the basic concepts of C-SLAM and present a thorough literature review. We also outline the major challenges and limitations of C-SLAM in terms of robustness, communication, and resource management. We conclude by exploring the area's current trends and promising research avenues.
In this paper, we propose the depth mapping and occlusion removal method using integral imaging (InIm). InIm is a method to obtain 3D information by capturing and reconstructing an object from multiple viewpoints. InI...
详细信息
暂无评论