Small target detection is a critical problem in the Infrared Search And Track (IRST) system. Although it has been studied for years, there are some challenges remained, e.g. cloud edges and horizontal lines are likely...
详细信息
It is well known that the backgrounds or the targets always change in real scenes, which weakens the effectiveness of classical tracking algorithms because of frequent model mismatches. In this paper, an object tracki...
详细信息
In the entire diabetic population, the total number of patients with diabetic retinopathy is more than 50%, and the longer diabetes, the higher the incidence of retinopathy and the rate of blindness. Besides, the bloo...
详细信息
In the entire diabetic population, the total number of patients with diabetic retinopathy is more than 50%, and the longer diabetes, the higher the incidence of retinopathy and the rate of blindness. Besides, the blood vessel of ocular fundus is the only blood vessel that can be directly observed, which has excellent application value in medical diagnostics. Photoacoustic Tomography (PAT) is an emerging technique that can obtain high-resolution 3D in-vivo images of optical absorption by sensing laser-generated ultrasound. Therefore, in this paper, we applied U-net neural network for the segmentation of blood vessel of ocular fundus images that opens up new methods for fundus medical image processing. Then we use 2D Time Reversal photoacoustic simulation based on k-WAVE MATLAB toolbox to convert the fundus segmentation of blood vessel images into photoacoustic images. Finally, we use the ResNet Network for the diagnosis of diabetes, in which the input data are the healthy and patient photoacoustic images of the fundus segmentation of blood vessel. We achieved 85% accuracy with 158 training samples. These results demonstrate the power of using deep learning for the analysis of diabetes through the fundus segmentation photoacoustic images of the blood vessel.
A deep Neural Network model was trained to classify the facial expression in unconstrained images, which comprises nine layers, including input layer, convolutional layer, pooling layer, fully connected layers and out...
详细信息
Classifying the sub-categories of an object from the same super-category (e.g., bird) in a fine-grained visual classification (FGVC) task highly relies on mining multiple discriminative features. Existing approaches m...
详细信息
Detecting Out-of-Domain (OOD) or unknown intents from user queries is essential in a task-oriented dialog system. A key challenge of OOD detection is to learn discriminative semantic features. Traditional cross-entrop...
详细信息
As fine-grained visual classification (FGVC) being developed for decades, great works related have exposed a key direction - finding discriminative local regions and revealing subtle differences. However, unlike ident...
详细信息
In this paper, we denote a color image by a quaternion function, then find edge points by solving the maximum of quaternion fractional directional differentiation(QFDD)'s norm. This method is called edge detection...
详细信息
Recently there has been a growing interest in Transformer not only in NLP but also in computer vision. We wonder if transformer can be used in face recognition and whether it is better than CNNs. Therefore, we investi...
详细信息
Domain adaptation aims to leverage a labeled source domain to learn a classifier for the unlabeled target domain with a different distribution. Previous methods mostly match the distribution between two domains by glo...
详细信息
暂无评论