检索结果-内蒙古大学图书馆

Active Vision for deep visual learning: A Unified Pooling Framework

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS 2022年第10期18卷 6610-6618页

作者： Guo, Nan Gu, Ke Qiao, Junfei Liu, Hantao Beijing Univ Technol Fac Informat TechnolBeijing Artificial Intellige Engn Res Ctr Intelligent Percept & Autonomous Con Minist EducBeijing Lab Smart Environm ProtectBe Beijing 100124 Peoples R China Cardiff Univ Sch Comp Sci & Informat Cardiff CF10 3AT Wales

Convolutional neural networks (CNNs) can be generally regarded as learning-based visual systems for computer vision tasks. By imitating the operating mechanism of the human visual system (HVS), CNNs can even achieve better results than human beings in some visual tasks. However, they are primary when compared to the HVS for the reason that the HVS has the ability of active vision to promptly analyze and adapt to specific tasks. In this article, a new unified pooling framework is proposed and a series of pooling methods are designed based on the framework to implement active vision to CNNs. In addition, an active selection pooling (ASP) is put forward to reorganize the existing and newly proposed pooling methods. The CNN models with an ASP tend to have a behavior of focus selection according to tasks during the training process, which acts extremely similar to the HVS.

关键词： visual systems Task analysis visualization Training Convolutional neural networks Informatics Image color analysis Active vision deep convolutional neural networks (CNNs) deep visual learning human visual system (HVS) pooling framework

来源：评论

学校读者我要写书评

暂无评论

Kinship recognition from faces using deep learning with imbalanced data

引用

MULTIMEDIA TOOLS AND APPLICATIONS 2023年第10期82卷 15859-15874页

作者： Othmani, Alice Han, Duqing Gao, Xin Ye, Runpeng Hadid, Abdenour Univ Paris Est Creteil UPEC LISSI F-94400 Vitry Sur Seine France Ecole Ingn Generaliste Numer EFREI Paris F-94800 Villejuif France Sorbonne Univ Abu Dhabi Sorbonne Ctr Artificial Intelligence Abu Dhabi U Arab Emirates

Kinship verification from faces aims to determine whether two person share some family relationship based only on the visual facial patterns. This has attracted a significant interests among the scientific community due to its potential applications in social media mining and finding missing children. In this work, We propose a novel pattern analysis technique for kinship verification based on a new deep learning-based approach. More specifically, given a pair of face images, we first use Resnet50 to extract deep features from each image. Then, feature distances between each pair of images are computed. Importantly, to overcome the problem of unbalanced data, One Hot Encoding for labels is utilised. The distances finally are fed to a deep neural networks to determine the kinship relation. Extensive experiments are conducted on FIW dataset containing 11 classes of kinship relationships. The experiments showed very promising results and pointed out the importance of balancing the training dataset. Moreover, our approach showed interesting ability of generalization. Results show that our approach performs better than all existing approaches on grandparents-grandchildren type of kinship. To support the principle of open and reproducible research, we are soon making our code publicly available to the research community: ***/Steven-HDQ/Kinship-Recognition.

关键词： Human-computer interaction Kinship recognition deep visual learning deep learning Biometrics

来源：评论

学校读者我要写书评

暂无评论

Multi-facial patches aggregation network for facial expression recognition and facial regions contributions to emotion display

引用

MULTIMEDIA TOOLS AND APPLICATIONS 2021年第9期80卷 13639-13662页

作者： Hazourli, Ahmed Rachid Djeghri, Amine Salam, Hanan Othmani, Alice Univ Paris Saclay F-91400 Orsay France Sorbonne Univ F-75006 Paris France Emlyon F-69130 Ecully France Univ Paris Est Creteil LISSI F-94400 Vitry Sur Seine France

In this paper, an approach for Facial Expressions Recognition (FER) based on a multi-facial patches (MFP) aggregation network is proposed. deep features are learned from facial patches using convolutional neural sub-networks and aggregated within one architecture for expression classification. Besides, a framework based on two data augmentation techniques is proposed to expand FER labels training datasets. Consequently, the proposed shallow convolutional neural networks (CNN) based approach does not need large datasets for training. The proposed framework is evaluated on three FER datasets. Results show that the proposed approach achieves state-of-art FER deep learning approaches performance when the model is trained and tested on images from the same dataset. Moreover, the proposed data augmentation techniques improve the expression recognition rate, and thus can be a solution for training deep learning FER models using small datasets. The accuracy degrades significantly when testing for dataset bias. A fine-tuning can overcome the problem of transition from laboratory-controlled conditions to in-the-wild conditions. Finally, the emotional face is mapped using the MFP-CNN and the contribution of the different facial areas in displaying emotion as well as their importance in the recognition of each facial expression are studied.

关键词： Human-computer interaction Facial expression recognition deep visual learning multi-facial patches Conditional generative adversarial network

来源：评论

学校读者我要写书评

暂无评论

OPENING deep NEURAL NETWORKS WITH GENERATIVE MODELS

OPENING DEEP NEURAL NETWORKS WITH GENERATIVE MODELS

引用

IEEE International Conference on Image Processing (ICIP)

作者： Vendramini, Marcos Oliveira, Hugo Machado, Alexei dos Santos, Jefersson A. Univ Fed Minas Gerais Dept Comp Sci Belo Horizonte MG Brazil Univ Sao Paulo Inst Math & Stat Sao Paulo Brazil Univ Fed Minas Gerais Dept Anat & Imaging Belo Horizonte MG Brazil Pontificia Univ Catolica Minas Gerais Dept Comp Sci Belo Horizonte MG Brazil

ISBN: (纸本)9781665441155

Image classification methods are usually trained to perform predictions taking into account a predefined group of known classes. Real-world problems, however, may not allow for a full knowledge of the input and label spaces, making failures in recognition a hazard to deep visual learning. Open set recognition methods are characterized by the ability to correctly identifying inputs of known and unknown classes. In this context, we propose GeMOS: simple and plug-and-play open set recognition modules that can be attached to pre-trained deep Neural Networks for visual recognition. The GeMOS framework pairs pre-trained Convolutional Neural Networks with generative models for open set recognition to extract open set scores for each sample, allowing for failure recognition in object recognition tasks. We conduct a thorough evaluation of the proposed method in comparison with state-of-the-art open set algorithms, finding that GeMOS either outperforms or is statistically indistinguishable from more complex and costly models.

关键词： Open Set Recognition Image Classification deep visual learning Out-of-Distribution Detection

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：