In reality, the laborious nature of label annotation leads to the widespread existence of limited labeled data. Moreover, multi-scale data have received widespread attention due to its rich knowledge representation. H...
详细信息
In recent times, remote learning has experienced significant growth, yet the challenge of conducting academic examinations in a secure manner persists. Various approaches have been adopted by universities, such as col...
详细信息
Wireless ultraviolet (UV) has strong scattering characteristics and can communicate through non-direct *** UV signals are transmitted in the atmosphere,they are affected by the absorption and scattering effects of atm...
详细信息
Wireless ultraviolet (UV) has strong scattering characteristics and can communicate through non-direct *** UV signals are transmitted in the atmosphere,they are affected by the absorption and scattering effects of atmospheric particles and atmospheric turbulence,resulting in attenuation of UV signal energy and reduced reliability of the communication *** paper focuses on the channel model of UV non-direct-view single scattering communication,and simulates and analyzes the communication characteristics of UV light in atmospheric turbulence and mixed aerosol environment under horizontal,vertical and oblique range communication *** results show that at equal relative humidity,the wireless UV non-directive scattering communication performance for vertical communication scenarios is more affected by the mixed aerosol environment and the communication performance is worse.
Deep learning-based image semantic segmentation approaches heavily rely on large-scale training datasets with dense annotations and often suffer from scarce semantic labels for unseen categories. This limitation has s...
详细信息
This research provides a deep learning approach and handwriting recognition to understand medical notes. The system processes and interprets textual medical notes using connectionist temporal classification (CTC) and ...
详细信息
The video grounding(VG) task aims to locate the queried action or event in an untrimmed video based on rich linguistic descriptions. Existing proposal-free methods are trapped in the complex interaction between video ...
详细信息
The video grounding(VG) task aims to locate the queried action or event in an untrimmed video based on rich linguistic descriptions. Existing proposal-free methods are trapped in the complex interaction between video and query, overemphasizing cross-modal feature fusion and feature correlation for VG. In this paper, we propose a novel boundary regression paradigm that performs regression token learning in a transformer. Particularly, we present a simple but effective proposal-free framework, namely video grounding transformer(ViGT), which predicts the temporal boundary using a learnable regression token rather than multi-modal or cross-modal features. In ViGT, the benefits of a learnable token are manifested as follows.(1) The token is unrelated to the video or the query and avoids data bias toward the original video and query.(2) The token simultaneously performs global context aggregation from video and query ***, we employed a sharing feature encoder to project both video and query into a joint feature space before performing cross-modal co-attention(i.e., video-to-query attention and query-to-video attention) to highlight discriminative features in each modality. Furthermore, we concatenated a learnable regression token [REG] with the video and query features as the input of a vision-language transformer. Finally, we utilized the token [REG] to predict the target moment and visual features to constrain the foreground and background probabilities at each timestamp. The proposed ViGT performed well on three public datasets:ANet-Captions, TACoS, and YouCookⅡ. Extensive ablation studies and qualitative analysis further validated the interpretability of ViGT.
This research introduces an innovative method for forecasting cardiomegaly, a common heart condition marked by an enlarged heart, by combining deep learning and machine learning methods. Using the ResNet50 architectur...
详细信息
Diabetes is a prevalent and chronic disease affecting millions worldwide, posing significant challenges in its management and treatment. This review article aims to explore the current and potential future roles of ma...
详细信息
Fast technical advancements have been implemented in multiple domains of life, including agriculture. technology can help the agriculture industry cut down on the energy and time lost on using traditional methods. The...
详细信息
Indoor farming has emerged as a promising alternative to traditional farming methods, offering a controlled and optimized environment for crop growth that can improve yield and reduce the environmental impact of agric...
详细信息
暂无评论