检索结果-内蒙古大学图书馆

Multi-Head attentional Point Cloud Classification and Segmentation Using Strictly Rotation-Invariant Representations

引用

IEEE ACCESS 2021年 9卷 71133-71144页

作者： Tao, Zhiyong Zhu, Yixin Wei, Tong Lin, Sen Liaoning Tech Univ Sch Elect & Informat Engn Huludao 125105 Peoples R China Eotvos Lorand Univ Fac Informat H-1117 Budapest Hungary Shenyang Ligong Univ Sch Automat & Elect Engn Shenyang 110159 Peoples R China

Point cloud processing plays an increasingly essential role in three-dimensional (3D) computer vision target, scene parsing, environmental perception, etc. Compared with using aligned point cloud data for classification and segmentation, the strictly rotation-invariant representations show enough robustness. Inspired by the great success of deep learning, we propose a novel neural network called Multi-head attentional Point Cloud Classification and Segmentation Using Strictly Rotation-invariant Representations. Our research focuses on processing the point cloud rotated in any direction effectively and precisely. First of all, the strictly rotation-invariant point cloud representations are obtained through point projection. Then we apply a multi-head attentional convolution layer (MACL) using attention coding to develop the performance of point cloud feature extraction. Finally, our network assigns different responses and recognizes the overall geometry well through a key point descriptor, adding to the global feature. Our method can explore more in-depth information for accuracy enhancement with attention pooling and multi-layer perceptron (MLP) based on an advanced DenseNet. Our network enjoys 90.63% and 87.50% classification accuracy testing on ModelNet10 and ModelNet40, and 75.15% intersection over union metric (mIoU) evaluating on ShapeNet Part dataset, remaining under any rotation. Rotating experimental results indicate that our framework realizes better point cloud classification and segmentation performance than most state-of-the-art methods.

关键词： Three-dimensional displays Convolution Robustness Neural networks Feature extraction Deep learning Data processing Point cloud deep learning strictly rotation-invariant representations attention coding classification and segmentation

来源：评论

学校读者我要写书评

暂无评论

EAC-Net: Deep Nets with Enhancing and Cropping for Facial Action Unit Detection

引用

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2018年第11期40卷 2583-2596页

作者： Li, Wei Abtahi, Farnaz Zhu, Zhigang Yin, Lijun CUNY City Coll Grove Sch Engn Dept Elect Engn New York NY 10031 USA CUNY Grad Ctr Dept Comp Sci New York NY 10016 USA SUNY Binghamton Thomas J Watson Sch Engn & Appl Sci Dept Comp Sci Binghamton NY 13902 USA

In this paper, we propose a deep learning based approach for facial action unit (AU) detection by enhancing and cropping regions of interest of face images. The approach is implemented by adding two novel nets (a.k.a. layers): the enhancing layers and the cropping layers, to a pretrained convolutional neural network (CNN) model. For the enhancing layers (noted as E-Net), we have designed an attention map based on facial landmark features and apply it to a pretrained neural network to conduct enhanced learning. For the cropping layers (noted as C-Net), we crop facial regions around the detected landmarks and design individual convolutional layers to learn deeper features for each facial region. We then combine the E-Net and the C-Net to construct a so-called Enhancing and Cropping Net (EAC-Net), which can learn both features enhancing and region cropping functions effectively. The EAC-Net integrates three important elements, i.e., learning transfer, attention coding, and regions of interest processing, making our AU detection approach more efficient and more robust to facial position and orientation changes. Our approach shows a significant performance improvement over the state-of-the-art methods when tested on the BP4D and DISFA AU datasets. The EAC-Net with a slight modification also shows its potentials in estimating accurate AU intensities. We have also studied the performance of the proposed EAC-Net under two very challenging conditions: (1) faces with partial occlusion and (2) faces with large head pose variations. Experimental results show that (1) the EAC-Net learns facial AUs correlation effectively and predicts AUs reliably even with only half of a face being visible, especially for the lower half;(2) Our EAC-Net model also works well under very large head poses, which outperforms significantly a compared baseline approach. It further shows that the EAC-Net works much better without a face frontalization than with face frontalization through image warping

关键词： Convolutional neural network facial analysis attention coding regions of interest facial occlusion head poses

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：