Low-light image enhancement (LLIE) aims to improve low-illumination images. However, existing methods face two challenges: (1) uncertainty in restoration from diverse brightness degradations;(2) loss of texture and co...
详细信息
The activations of Facial Action Units (AUs) mutually influence one another. While the relationship between a pair of AUs can be complex and unique, existing approaches fail to specifically and explicitly represent su...
详细信息
In this paper, we hypothesize that the effects of the degree of typicality in natural semantic categories can be generated based on the structure of artificial categories learned with deep learning models. Motivated b...
详细信息
The rapid advancement of photorealistic generators has reached a critical juncture where the discrepancy between authentic and manipulated images is increasingly indistinguishable. Thus, benchmarking and advancing tec...
详细信息
In gastroenteroscopy image analysis, numerous CADs demonstrate that deep learning aids doctors' diagnosis. The shapes and sizes of the lesions are varied. And in the clinic, the dataset appears to be data imbalanc...
In gastroenteroscopy image analysis, numerous CADs demonstrate that deep learning aids doctors' diagnosis. The shapes and sizes of the lesions are varied. And in the clinic, the dataset appears to be data imbalanced. However, existing methods directly classify by texture and ignore lesions with various shapes and sizes. To address the issue above, we propose a deep neural network, which consists of multi-scale feature extraction, contrastive feature learning and a multi-scale feature fusion module. We train the contrastive feature learning module and multi-scale feature fusion module simultaneously to alleviate the issue of data distribution differences. Thus, the proposed network can better identify various categories. Extensive experiments on the Hyper Kvasir dataset show that the proposed Hybrid-M2CL outperforms the benchmark proposed by the dataset with 5.0% Macro Precision, 3.3% Macro Recall, 3.4% Macro F1-score, 3.3% Micro Precision, 3.6% MCC. In addition, it outperforms the SOTA by 1.1% Macro F1-score, 2.6% MCC, and 2.0% B-ACC.
Robot visual navigation is a relevant research topic. Current deep navigation models conveniently learn the navigation policies in simulation, given the large amount of experience they need to collect. Unfortunately, ...
详细信息
Robot visual navigation is a relevant research topic. Current deep navigation models mostly learn the navigation policies in simulation. This is convenient, given the efficiency offered by simulators to collect the re...
详细信息
作者:
Chen, JunliangZhao, XiaodongShen, LinlinComputer Vision Institute
School of Computer Science and Software Engineering Shenzhen University China Shenzhen Institute of Artificial Intelligence of Robotics of Society Shenzhen China Guangdong Key Laboratory of Intelligent Information Processing Shenzhen University Shenzhen 518060 China
Object detection has made substantial progress in the last decade, due to the capability of convolution in extracting local context of objects. However, the scales of objects are diverse and current convolution can on...
详细信息
Pyramidal networks are standard methods for multi-scale object detection. Current researches on feature pyramid networks usually adopt layer connections to collect features from certain levels of the feature hierarchy...
详细信息
Volumetric deep learning approach towards stereo matching aggregates a cost volume computed from input left and right images using 3D convolutions. Recent works showed that utilization of extracted image features and ...
详细信息
暂无评论