The performance of computer-vision based face image retrieval system declines significantly when large illumination, pose, and facial expression variations are presented. To tackle such a problem, we propose a closed ...
详细信息
ISBN:
(数字)9781510628298
ISBN:
(纸本)9781510628298
The performance of computer-vision based face image retrieval system declines significantly when large illumination, pose, and facial expression variations are presented. To tackle such a problem, we propose a closed loop face image retrieval system with implicit eye-tracking based feedback. It combines the state-of-the-art computervision method Face++ with the powerful cognitive ability of human. In this system, the Face++ provides initial retrieving results corresponding to a target sample face image whose top ranked 36 images are then displayed on the screen for collecting eye-tracking data of the users. Upon mining the user's cognition results from the eye-tracking data with a deep neural network and feeding them back to the system, the system begins its new round retrieving process. Experimental results from 10 volunteers in a face database containing 1,500 images of 50 celebrities show that the performance of our system becomes better and better over iterations and finally our system achieve an average precision of higher than 0.918 and an average recall rate of higher than 0.897 upon convergence.
Online garment shopping has gained many customers in recent years. Describing a dress using keywords does not always yield the proper results, which in turn leads to dissatisfaction of customers. A visual search based...
详细信息
The proceedings contain 10 papers. The special focus in this conference is on computervision Applications. The topics include: AECNN: Autoencoder with Convolutional Neural Network for Hyperspectral image Classificati...
ISBN:
(纸本)9789811513862
The proceedings contain 10 papers. The special focus in this conference is on computervision Applications. The topics include: AECNN: Autoencoder with Convolutional Neural Network for Hyperspectral image Classification;optic Disc Segmentation in Fundus images Using Anatomical Atlases with Nonrigid Registration;bird Species Classification Using Transfer Learning with Multistage Training;a Deep Learning Paradigm for Automated Face Attendance;Robust Detection of Iris Region Using an Adapted SSD Framework;dynamic image Networks for Human Fall Detection in 360-degree Videos;image Segmentation and Geometric Feature Based Approach for Fast Video Summarization of Surveillance Videos;supervised Hashing for Retrieval of Multimodal Biometric Data.
Breast cancer has become a worldwide disease in recent years. However, despite its growing prominence, the number of pathologists equipped to handle these cases is insufficient. computer-aided diagnosis (CAD) system c...
详细信息
ISBN:
(数字)9781510628298
ISBN:
(纸本)9781510628298
Breast cancer has become a worldwide disease in recent years. However, despite its growing prominence, the number of pathologists equipped to handle these cases is insufficient. computer-aided diagnosis (CAD) system contributes to reduce costs and improve efficiency of this process. A framework based on convolutional neural networks (CNNs) which could be used to automatically detect the multi-class cancer areas on gigapixel pathology slide images was proposed. Moreover, combining the slide image characters, rescale and careful data augmentation methods were used to train the patch-based model with a small dataset. To validate the developed framework, we conducted experiments with Breast Cancer Histology Challenge (BACH) dataset and obtained International conference on image Analysis and Recognition (ICIAR) score of 0.582, outperforming the second-place finisher in BACH2018, for the 4-class tissue segmentation task.
Melanoma is a life-threatening form of skin cancer when left undiagnosed at the early stages. Although there are more cases of non-melanoma cancer than melanoma cancer, melanoma cancer is more deadly. Early detection ...
详细信息
This paper proposes a method for image smoothing which is invariant under general coordinate transformation. This method is based on a position dependent metric tensor which transforms appropriately against the genera...
详细信息
ISBN:
(数字)9781510628298
ISBN:
(纸本)9781510628298
This paper proposes a method for image smoothing which is invariant under general coordinate transformation. This method is based on a position dependent metric tensor which transforms appropriately against the general coordinate transformation. Using this metric tensor, a method for invariant image smoothing against the general coordinate transformation is constructed. Effectiveness of the proposed method is confirmed by computer experiments.
During gameplay, a player experiences emotional turmoil. In most of the cases, these emotions directly reflect the outcome of the game. Adapting game features based on players’ emotions necessitates a way to detect t...
详细信息
Coin classification automatically plays important roles in many applications, e.g., vending systems. Glossy reflection is one of the key factor that affect the performance of vision-based coin classification, especial...
详细信息
ISBN:
(数字)9781510628298
ISBN:
(纸本)9781510628298
Coin classification automatically plays important roles in many applications, e.g., vending systems. Glossy reflection is one of the key factor that affect the performance of vision-based coin classification, especially in a complex environment. In this paper, we propose a novel method for robust coin classification. Contrary to the previous method, we get the glossy area first. Edge features and texture features are used in glossy area detection. Then the deep learning features are extracted based on non-glossy area instead of the whole coin image. Finally, the coin classification results are got from the VGG nets scheme. Comprehensive experiments show that our method is robust under various complex environments. The comparison experiments demonstrate that our method can outperform the state-of-the-art method. Our method achieves 95.80% accuracy.
Mine water bodies create enormous water pollution due to heavy use of water in different stages of mining. Detection and monitoring of such water bodies are necessary for environmental benefits. In the past, mine wate...
详细信息
This paper presents an Optical Character Recognition (OCR) system for documents with English text and mathematical expressions. Neural network architectures using CNN layers and/or dense layers achieve high level accu...
详细信息
暂无评论