文献详情 >Towards Accurate and Compact A... 收藏

Towards Accurate and Compact Architectures via Neural Architecture Transformer

作者：Guo, Yong Zheng, Yin Tan, Mingkui Chen, Qi Li, Zhipeng Chen, Jian Zhao, Peilin Huang, Junzhou

作者机构：South China Univ Technol Sch Software Engn Guangzhou 510006 Peoples R China Peng Cheng Lab Shenzhen 518066 Peoples R China Tencent Weixin Grp Shenzhen 518054 Peoples R China South China Univ Technol Key Lab Big Data & Intelligent Robot Minist Educ Guangzhou 510006 Peoples R China Tencent Tencent AI Lab Shenzhen 518054 Peoples R China

出版物：《IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE》 (IEEE Trans Pattern Anal Mach Intell)

年卷期：2022年第44卷第10期

页面：6501-6516页

核心收录：

学科分类：0808[工学-电气工程] 08[工学] 0812[工学-计算机科学与技术（可授工学、理学学位）]

基　　金：Key Realm R&D Program of Guangzhou Key-Area Research and Development Program of Guangdong Province National Natural Science Foundation of China Program for Guangdong Introducing Innovative and Enterpreneurial Teams Tencent AI Lab Rhino-Bird Focused Research Program Fundamental Research Funds for the Central Universities Ministry of Science and Technology Foundation Guangdong Basic and Applied Basic Research Foundation Guangzhou Science and Technology Planning Project Opening Project of Guangdong Key Laboratory of Big Data Analysis and Processing

主　　题：Architecture optimization neural architecture search compact architecture design operation transition

摘要：Designing effective architectures is one of the key factors behind the success of deep neural networks. Existing deep architectures are either manually designed or automatically searched by some Neural Architecture Search (NAS) methods. However, even a well-designed/searched architecture may still contain many nonsignificant or redundant modules/operations (e.g., some intermediate convolution or pooling layers). Such redundancy may not only incur substantial memory consumption and computational cost but also deteriorate the performance. Thus, it is necessary to optimize the operations inside an architecture to improve the performance without introducing extra computational cost. To this end, we have proposed a Neural Architecture Transformer (NAT) method which casts the optimization problem into a Markov Decision Process (MDP) and seeks to replace the redundant operations with more efficient operations, such as skip or null connection. Note that NAT only considers a small number of possible replacements/transitions and thus comes with a limited search space. As a result, such a small search space may hamper the performance of architecture optimization. To address this issue, we propose a Neural Architecture Transformer++ (NAT++) method which further enlarges the set of candidate transitions to improve the performance of architecture optimization. Specifically, we present a two-level transition rule to obtain valid transitions, i.e., allowing operations to have more efficient types (e.g., convolution - separable convolution) or smaller kernel sizes (e.g., 5x5 - 3x3). Note that different operations may have different valid transitions. We further propose a Binary-Masked Softmax (BMSoftmax) layer to omit the possible invalid transitions. Last, based on the MDP formulation, we apply policy gradient to learn an optimal policy, which will be used to infer the optimized architectures. Extensive experiments show that the transformed architectures significantly outperform

本地馆藏 | 借阅须知 | 我要预约

已订购，未入库

sda

目录详情 | 试阅读 |

读者评论与其他读者分享你的观点

学校读者

用户名:未登录

我的评分

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

CADAL相关文献

Towards Accurate and Compact Architectures via Neural Architecture Transformer

读者评论与其他读者分享你的观点

请选择收藏分类：

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

CADAL相关文献

Towards Accurate and Compact Architectures via Neural Architecture Transformer

读者评论 与其他读者分享你的观点

请选择收藏分类： 新增自定义分类 确定 取消

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

读者评论与其他读者分享你的观点

请选择收藏分类：