Graphs are ubiquitous in nature and can therefore serve as models for many practical but also theoretical problems. For this purpose, they can be defined as many different types which suitably reflect the individual c...
详细信息
Deep learning (DL) methods have shown promising results for solving ill-posed inverse problems such as MR image reconstruction from undersampled k-space data. However, these approaches currently have no guarantees for...
详细信息
Graph neural networks based on message-passing rely on the principle of neighborhood aggregation which has shown to work well for many graph tasks. In other cases these approaches appear insufficient, for example, whe...
Graph neural networks based on message-passing rely on the principle of neighborhood aggregation which has shown to work well for many graph tasks. In other cases these approaches appear insufficient, for example, when graphs are heterophilic. In such cases, it can help to modulate the aggregation method depending on the characteristic of the current neighborhood. Furthermore, when considering higher-order relations, heterophilic settings become even more important. In this work, we investigate a sparse version of message-passing that allows selective neighbor integration and aims for learning to identify most salient nodes that are then integrated over. In our approach, information on individual nodes is encoded by generating distinct walks. Because these walks follow distinct trajectories, the higher-order neighborhood grows only linearly which mitigates information bottlenecks. Overall, we aim to find the most salient substructures by deploying a learnable sampling strategy. We validate our method on commonly used graph benchmarks and show the effectiveness especially in heterophilic graphs. We finally discuss possible extensions to the framework.
In this paper, we propose a deep neural network that predicts the feasibility of a mixed-integer program from visual input for robot manipulation planning. Integrating learning into task and motion planning is challen...
详细信息
ISBN:
(数字)9781728173955
ISBN:
(纸本)9781728173962
In this paper, we propose a deep neural network that predicts the feasibility of a mixed-integer program from visual input for robot manipulation planning. Integrating learning into task and motion planning is challenging, since it is unclear how the scene and goals can be encoded as input to the learning algorithm in a way that enables to generalize over a variety of tasks in environments with changing numbers of objects and goals. To achieve this, we propose to encode the scene and the target object directly in the image *** experiments show that our proposed network generalizes to scenes with multiple objects, although during training only two objects are present at the same time. By using the learned network as a heuristic to guide the search over the discrete variables of the mixed-integer program, the number of optimization problems that have to be solved to find a feasible solution or to detect infeasibility can greatly be reduced.
Complex systems can be modelled at various levels of detail. Ideally, causal models of the same system should be consistent with one another in the sense that they agree in their predictions of the effects of interven...
Recent advances in Large Language Models (LLMs) have been instrumental in autonomous robot control and human-robot interaction by leveraging their vast general knowledge and capabilities to understand and reason acros...
详细信息
作者:
Cao, HelinBehnke, SvenAutonomous Intelligent Systems group
Computer Science Institute VI-Intelligent Systems and Robotics Center for Robotics and the Lamarr Institute for Machine Learning and Artificial Intelligence University of Bonn Germany
We introduce SLCF-Net, a novel approach for the Semantic Scene Completion (SSC) task that sequentially fuses LiDAR and camera data. It jointly estimates missing geometry and semantics in a scene from sequences of RGB ...
详细信息
Recent advances in Large Language Models (LLMs) have been instrumental in autonomous robot control and human-robot interaction by leveraging their vast general knowledge and capabilities to understand and reason acros...
详细信息
ISBN:
(数字)9798350373578
ISBN:
(纸本)9798350373585
Recent advances in Large Language Models (LLMs) have been instrumental in autonomous robot control and human-robot interaction by leveraging their vast general knowledge and capabilities to understand and reason across a wide range of tasks and scenarios. Previous works have investigated various prompt engineering techniques for improving the performance of LLMs to accomplish tasks, while others have proposed methods that utilize LLMs to plan and execute tasks based on the available functionalities of a given robot platform. In this work, we consider both lines of research by comparing prompt engineering techniques and combinations thereof within the application of high-level task planning and execution in service robotics. We define a diverse set of tasks and a simple set of functionalities in simulation, and measure task completion accuracy and execution time for several state-of-the-art models. We make our code, including all prompts, available at https://***/AIS-Bonn/Prompt_Engineering.
作者:
Alfred SchöttlUniversity of Applied Sciences Munich
Dept. of Electrical Engineering and Information Technology Institute for Applications of Machine Learning and Intelligent Systems (IAMLIS) Munich 80335 Germany
Deep classification networks play an important role as backbone networks in industrial AI applications. These applications are often cost or safety critical; explainability of the AI results is a highly demanded featu...
详细信息
Deep classification networks play an important role as backbone networks in industrial AI applications. These applications are often cost or safety critical; explainability of the AI results is a highly demanded feature. We introduce CAM fostering, a method to improve the explainability of classification nets based on local layers such as convolutional or pooling layers. Several CAM interpretability measures are defined and used as additional loss terms. Even though the method requires second-order derivatives, it is demonstrated that deep nets can be trained on large datasets without frozen parameters. The training parameters can be chosen such that the accuracy degradation remains decent in favor of the CAM interpretability improvement. We conclude by comparing the results of different training parameter configurations.
作者:
Cao, HelinBehnke, SvenThe Autonomous Intelligent Systems group
Computer Science Institute VI – Intelligent Systems and Robotics The Center for Robotics The Lamarr Institute for Machine Learning and Artificial Intelligence University of Bonn Germany
Perception systems play a crucial role in autonomous driving, incorporating multiple sensors and corresponding computer vision algorithms. 3D LiDAR sensors are widely used to capture sparse point clouds of the vehicle...
详细信息
暂无评论