While promising results have been achieved in weakly-supervised semantic segmentation (WSSS), limited supervision from image-level tags inevitably induces discriminative reliance and spurious relations between target ...
详细信息
The intricate and multi-stage task in dynamic public spaces like luggage trolley collection in airports presents both a promising opportunity and an ongoing challenge for automated service robots. Previous research ha...
详细信息
The application of robots in social life, equipped with sensors and actuators and embedded with AI, assists people in all aspects. However, the first perspective of the robot horizon is heavily constrained, which weak...
The application of robots in social life, equipped with sensors and actuators and embedded with AI, assists people in all aspects. However, the first perspective of the robot horizon is heavily constrained, which weakens its performance. A joint tracking system is designed and built to deal with this, by integrating a surveillance system with the robot visual, providing a third perspective. This system takes one horizontal view and two top views from various directions as inputs and matches a person among the frames and in time sequence. In order to deal with the identity match with a huge visual feature gap, a special dataset is collected, simultaneously labeling identities from a mobile robot perspective and multiple indoor static surveillance monitors. The experiment shows that such match is a task worth exploring that can be better handled by training on our dataset than existing open source Re-identification (Re-id) datasets. Moreover, in the real scenario, this system improves the performance on issues like in and out of the robot’s field of vision and heavy occlusion by people or objects.
Video instance segmentation is one of the core problems in computervision. Formulating a purely learning-based method, which models the generic track management required to solve the video instance segmentation task,...
详细信息
In this work, the seasonal predictive capabilities of Neural Radiance Fields (NeRF) applied to satellite images are investigated. Focusing on the utilization of satellite data, the study explores how Sat-NeRF, a novel...
详细信息
This work reviews the results of the NTIRE 2023 Challenge on Image Shadow Removal. The described set of solutions were proposed for a novel dataset, which captures a wide range of object-light interactions. It consist...
详细信息
Dense map that contains the surrounding geometry and vision information of a robot is widely used for path planning, navigation, obstacle avoidance and other applications. Considering the performance of the processing...
详细信息
Recent studies have integrated convolution into transformers to introduce inductive bias and improve generalization performance. However, the static nature of conventional convolution prevents it from dynamically adap...
详细信息
Industrial radiography is a pivotal non-destructive testing (NDT) method that ensures quality and safety in a wide range of industrial sectors. Nevertheless, the conventional human-based approaches to carrying out ind...
详细信息
Existing state-of-the-art methods for surgical phase recognition either rely on the extraction of spatial-temporal features at a short-range temporal resolution or adopt the sequential extraction of the spatial and te...
详细信息
暂无评论