To address issues such as long learning time, poor exploration ability, over-reliance on prior environmental information, and lack of real-time response in obstacle avoidance for inspection robots under conventional p...
详细信息
The demand for massages is rising daily in the modern day due to population aging. The creation of an intelligent physicaltherapy massage robot is currently a pressing task due to the severe scarcity of qualified mass...
详细信息
robotic behavior synthesis, the problem of understanding multimodal inputs and generating precise physical control for robots, is an important part of Embodied AI. Despite successes in applying multimodal large langua...
详细信息
robotic behavior synthesis, the problem of understanding multimodal inputs and generating precise physical control for robots, is an important part of Embodied AI. Despite successes in applying multimodal large language models for high-level understanding, it remains challenging to translate these conceptual understandings into detailed robotic actions while achieving generalization across various scenarios. In this paper, we propose a tree-structured multimodal code generation framework for generalized robotic behavior synthesis, termed RoboCodeX. RoboCodeX decomposes high-level human instructions into multiple object-centric manipulation units consisting of physical preferences such as affordance and safety constraints, and applies code generation to introduce generalization ability across various robotics platforms. To further enhance the capability to map conceptual and perceptual understanding into control commands, a specialized multimodal reasoning dataset is collected for pre-training and an iterative refining methodology is introduced for supervised fine-tuning. Extensive experiments demonstrate that RoboCodeX achieves state-of-the-art performance in both simulators and real robots on four different kinds of manipulation tasks and one embodied navigation task. More demos and information can be found in our homepage. Copyright 2024 by the author(s)
An integrated optimal trajectory planning approach is suggested to increase the shock resilience and operational efficiency of industrial robots. First, the refractive reverse learning mechanism, Lévy flights, an...
详细信息
Obstacle avoidance in dense obstacle environments is difficult to be both efficient and effective owing to the characterics of the nonlinear optimization problems. This article proposed an Efficient Collision-Free Dua...
详细信息
robotics surgery is fast becoming ubiquitous in medical practices from cancer treatment to prostrate treatment to neurological problems. One of the major problem associated with this invasive surgery is how to minimiz...
详细信息
In 1957, Morton Heilig invented the Sensorama, the predecessor of augmented reality (AR), which offered 3D images, sound, and sensory effects. Ivan Sutherland created the first AR system in 1966, while Myron Krueger i...
详细信息
Learning from natural language is a programming-free and user friendly teaching method that allows users without programming knowledge or demonstration capabilities to instruct robots, which has great value in industr...
详细信息
Autonomous point-to-point robots employ path-finding algorithms to traverse from the start to the goal point by avoiding obstacles. One such prominent and efficient algorithm is A*. It uses the best first technique to...
详细信息
The planning algorithms that autonomous systems use sometimes fail to generate effective solutions, which prevents mission success. Metareasoning, which is "thinking about thinking," is one way to mitigate p...
详细信息
暂无评论