Memory and other mental processes are both severely disrupted by Alzheimer's disease (AD), a neurodegenerative condition. It interferes with almost every cognitive process that the brain is capable of. This ultima...
详细信息
The Boolean Satisfiability (SAT) problem stands out as an attractive NP-complete problem in theoretic computer science and plays a central role in a broad spectrum of computing-related applications. Exploiting and tun...
详细信息
Unsupervised person re-identification (Re-ID) aims to retrieve person images across cameras without any identity labels. Most clustering-based methods roughly divide image features into clusters and neglect the featur...
Unsupervised person re-identification (Re-ID) aims to retrieve person images across cameras without any identity labels. Most clustering-based methods roughly divide image features into clusters and neglect the feature distribution noise caused by domain shifts among different cameras, leading to inevitable performance degradation. To address this challenge, we propose a novel label refinement framework with clustering intra-camera similarity. Intra-camera feature distribution pays more attention to the appearance of pedestrians and labels are more reliable. We conduct intra-camera training to get local clusters in each camera, respectively, and refine inter-camera clusters with local results. We hence train the Re-ID model with refined reliable pseudo labels in a self-paced way. Extensive experiments demonstrate that the proposed method surpasses state-of-the-art performance. Code is available at https://***/leeBooMla/ICSR.
Recent advancements in unified multimodal understanding and visual generation (or multimodal generation) models have been hindered by their quadratic computational complexity and dependence on large-scale training dat...
详细信息
In this paper, we focus on the continuous control of Unmanned Aerial Vehicle (UAV) group in large-scale 3D complex environment based on deep reinforcement learning (DRL) method. The purpose is to make the UAV group sa...
In this paper, we focus on the continuous control of Unmanned Aerial Vehicle (UAV) group in large-scale 3D complex environment based on deep reinforcement learning (DRL) method. The purpose is to make the UAV group safely reach the random target area from a certain starting point, and the flight height and speed are variable during the navigation process. In this paper, a DRL framework combining the human-in-the-loop method is designed. The UAV group is preformed into a mobile whole, and the sensor data of the UAV group is directly mapped to the control signal. The role of human-in-the-loop is to switch the human-machine control right if necessary, so that humans can intervene and correct the dangerous actions of the agent. Based on this framework, an improved Actor-Critic structure is designed, and the policy and value network of the original structure are modified accordingly. We verify the success rate and time efficiency of different numbers of UAV group navigation in the urban environment. The experimental results show that this method can reduce the training convergence time and improve the efficiency and success rate of navigation.
The frequency dynamic of converter-based renewable energy generators is much different from the traditional inertia-based generators in the restoration process of the high renewable energy penetrated power system (HRE...
详细信息
We introduce OmniRL 1, a highly generalizable in-context reinforcement learning (ICRL) model that is meta-trained on hundreds of thousands of diverse tasks. These tasks are procedurally generated by randomizing state ...
详细信息
A complex system with cluttered observations may be a coupled mixture of multiple simple subsystems corresponding to latent entities. Such sub-systems may hold distinct dynamics in the continuous-time domain;therein, ...
详细信息
Recently, Segment Anything Model (SAM) shows exceptional performance in generating high-quality object masks and achieving zero-shot image segmentation. However, as a versatile vision model, SAM is primarily trained w...
详细信息
Recent Compositional Zero-Shot Learning (CZSL) methods increasingly adopt the pre-trained vision-language models to capture the contextual relations between image and text spaces. However, the single-class-token desig...
详细信息
ISBN:
(数字)9798350368741
ISBN:
(纸本)9798350368758
Recent Compositional Zero-Shot Learning (CZSL) methods increasingly adopt the pre-trained vision-language models to capture the contextual relations between image and text spaces. However, the single-class-token design from Transformer-based encoder inevitably captures contextual information from unrelated objects and background, thus hindering the modeling of fine-grained class-specific visual features. Suffering from cross-modal gap, prior methods also struggle to improve compositional recognition performance. To address these issues, we propose a fine-grained cross-modal concepts refinement framework, termed as Refiner, which comprises two pivotal components: (i) the fine-grained concepts refinement of image embeddings to capture state-object context within visual scenes, and (ii) the cross-modal information fusion to mitigate the modality gap. By leveraging learnable query vectors to capture region-specific semantic information pertinent to composition labels, our approach refines visual representations with fine-grained state-object context information. As for cross-modal information fusion, we construct a robust image-to-text mapping by aligning visual embeddings with states, objects, and compositions, respectively. Extensive experiments demonstrate that our Refiner achieves new state-of-the-art performance across all popular benchmarks in both closed- and open-world settings.
暂无评论