检索结果-内蒙古大学图书馆

Asymptotic statistics for multilayer perceptron with relu hidden units

NEUROCOMPUTING 2019年 342卷 16-23页

作者： Rynkiewicz, J. Univ Paris 01 SAMM 90 Rue Tolbiac F-75013 Paris France

In numerous tasks, deep networks are state of the art. However, they are still not well understood from a statistical point of view. In this article, we try to contribute to filling this gap, and we consider regression models involving deep multilayer perceptrons (MLP) with rectified linear (relu) functions for activation units. It is a difficult task to study the statistical properties of such models. The main reason is that in practice these models may be heavily overparameterized. For the sake of simplicity, we focus here on the sum of square errors (SSE) cost function which is the standard cost function for regression purposes. In this framework, we study the asymptotic behavior of the difference between the SSE of estimated models and the SSE of the theoretical best model. This behavior gives us information on the overfitting properties of such models. We use in this paper new methodology introduced to deal with models with a loss of identifiability, i.e. in the case that the true parameter cannot be identified uniquely. Hence, we don't have to assume that a unique parameter vector realizes the best regression function which seems to be a too strong assumption for heavily overparameterized models. Our results shed new light on the overfitting behavior of MLP models. (C) 2019 Elsevier B.V. All rights reserved.

关键词： Regression models Loss of identifiability Deep neural networks relu functions Donsker class

来源：评论

学校读者我要写书评

暂无评论

RAPIDO: a rejuvenating adaptive PID-type optimiser for deep neural networks

引用

ELECTRONICS LETTERS 2019年第16期55卷 899-901页

作者： Kim, S. Park, D. J. Chang, D. E. Korea Adv Inst Sci & Technol Sch Elect Engn Daejeon 34141 South Korea

The authors present a novel gradient descent algorithm called RAPIDO for deep learning. It adapts over time and performs optimisation using current, past and future information similar to the PID controller. The proposed method is suited for optimising deep neural networks that consist of activation functions such as sigmoid, hyperbolic tangent and relu functions because it can adapt appropriately to sudden changes in gradients. They experimentally study the authors' method and show the performance results by comparing with other methods on the quadratic objective function and the MNIST classification task. The proposed method shows better performance than the other methods.

关键词： pattern classification optimisation learning (artificial intelligence) neural nets gradient methods three-term control RAPIDO rejuvenating adaptive PID-type optimiser deep neural networks deep learning PID controller activation functions sigmoid functions hyperbolic tangent functions relu functions quadratic objective function gradient descent algorithm MNIST classification task

来源：评论

学校读者我要写书评

暂无评论

Through the Looking-Glass: Benchmarking Secure Multi-party Computation Comparisons for relu's 21st

Through the Looking-Glass: Benchmarking Secure Multi-party C...

引用

21st International Conference on Cryptology and Network Security (CANS)

作者： Aly, Abdelrahaman Nawaz, Kashif Salazar, Eugenio Sucasas, Victor Technol Innovat Inst Cryptog Res Ctr Abu Dhabi U Arab Emirates Katholieke Univ Leuven Imec COSIC Leuven Belgium

ISBN: (纸本)9783031209734;9783031209741

Comparisons or Inequality Tests are an essential building block of Rectified Linear Unit functions (relu's), ever more present in Machine Learning, specifically in Neural Networks. Motivated by the increasing interest in privacy-preserving Artificial Intelligence, we explore the current state of the art of privacy preserving comparisons over MultiParty Computation (MPC). We then introduce constant round variations and combinations, which are compatible with customary fixed point arithmetic over MPC. Our main focus is implementation and benchmarking;hence, we showcase our contributions via an open source library, compatible with current MPC software tools. Furthermore, we include a comprehensive comparative analysis on various adversarial settings. Our results improve running times in practical scenarios. Finally, we offer conclusions about the viability of these protocols when adopted for privacy-preserving Machine Learning.

关键词： Secure multi-party computation relu functions Applied cryptography

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：