检索结果-内蒙古大学图书馆

arXiv 2023年

作者： Xing, Yu Sun, Xudong Johansson, Karl H. Division of Decision and Control Systems EECS KTH Royal Institute of Technology Digital Futures StockholmSE-10044 Sweden

We study joint learning of network topology and a mixed opinion dynamics, in which agents may have different update rules. Such a model captures the diversity of real individual interactions. We propose a learning algorithm based on multi-armed bandit algorithms to address the problem. The goal of the algorithm is to find each agent’s update rule from several candidate rules and to learn the underlying network. At each iteration, the algorithm assumes that each agent has one of the updated rules and then modifies network estimates to reduce validation error. Numerical experiments show that the proposed algorithm improves initial estimates of the network and update rules, decreases prediction error, and performs better than other methods such as sparse linear regression and Gaussian process regression. © 2023, CC BY.

关键词： learning algorithms

来源：评论

学校读者我要写书评

暂无评论

FedLF: Layer-Wise Fair Federated learning 38

FedLF: Layer-Wise Fair Federated Learning

引用

38th AAAI Conference on Artificial Intelligence (AAAI) / 36th Conference on Innovative Applications of Artificial Intelligence / 14th Symposium on Educational Advances in Artificial Intelligence

作者： Pan, Zibin Li, Chi Yu, Fangchen Wang, Shuyi Wang, Haijin Tang, Xiaoying Zhao, Junhua Chinese Univ Hong Kong Sch Sci & Engn Shenzhen Peoples R China Shenzhen Inst Artificial Intelligence & Robot Soc Shenzhen Peoples R China Guangdong Prov Key Lab Future Networks Intelligen Shenzhen Peoples R China Shenzhen Res Inst Big Data Shenzhen Peoples R China

ISBN: (纸本)1577358872

Fairness has become an important concern in Federated learning (FL). An unfair model that performs well for some clients while performing poorly for others can reduce the willingness of clients to participate. In this work, we identify a direct cause of unfairness in FL - the use of an unfair direction to update the global model, which favors some clients while conflicting with other clients' gradients at the model and layer levels. To address these issues, we propose a layer-wise fair Federated learning algorithm (FedLF). Firstly, we formulate a multi-objective optimization problem with an effective fair-driven objective for FL. A layer-wise fair direction is then calculated to mitigate the model and layer-level gradient conflicts and reduce the improvement bias. We further provide the theoretical analysis on how FedLF can improve fairness and guarantee convergence. Extensive experiments on different learning tasks and models demonstrate that FedLF outperforms the SOTA FL algorithms in terms of accuracy and fairness. The source code is available at https://***/zibinpan/FedLF.

关键词： learning algorithms

来源：评论

学校读者我要写书评

暂无评论

Reinforcement learning establishes a minimal metacognitive process to monitor and control motor learning performance

引用

NATURE COMMUNICATIONS 2023年第1期14卷 1-14页

作者： Sugiyama, Taisei Schweighofer, Nicolas Izawa, Jun Univ Tsukuba Empowerment Informat Tsukuba Ibaraki 3058573 Japan Univ Southern Calif Biokinesiol & Phys Therapy Los Angeles CA 90089 USA Univ Tsukuba Inst Syst & Informat Engn Tsukuba Ibaraki 3058573 Japan

Humans and animals develop learning-to-learn strategies throughout their lives to accelerate learning. One theory suggests that this is achieved by a metacognitive process of controlling and monitoring learning. Although such learning-to-learn is also observed in motor learning, the metacognitive aspect of learning regulation has not been considered in classical theories of motor learning. Here, we formulated a minimal mechanism of this process as reinforcement learning of motor learning properties, which regulates a policy for memory update in response to sensory prediction error while monitoring its performance. This theory was confirmed in human motor learning experiments, in which the subjective sense of learning-outcome association determined the direction of up- and down-regulation of both learning speed and memory retention. Thus, it provides a simple, unifying account for variations in learning speeds, where the reinforcement learning mechanism monitors and controls the motor learning process. Metacognition is fundamental for regulating learning speeds and memory retention. Here, the authors demonstrate that reinforcement learning mediates this process in implicit motor learning, maximizing rewards and minimizing punishments.

关键词： Basal ganglia Cerebellum Decision learning algorithms Motivation

来源：评论

学校读者我要写书评

暂无评论

Behavior Recognition of College Students Based on Improved Deep learning Algorithm

引用

International Journal of Web-Based learning and Teaching Technologies 2023年第2期18卷 1-16页

作者： Ning, Xiaoke Jiangxi College of Foreign Studies China

With the vigorous development of intelligent campus construction, great changes have taken place in the development of information technology in colleges and universities from the previous digital to intelligent development. In the teaching process, the analysis of students' classroom learning has also changed from the previous manual observation to intelligent analysis. Based on this, this paper studies the behavior recognition of college students based on the improved deep learning algorithm. Based on a brief analysis of the research background of behavior recognition, the research framework of college students' behavior recognition is constructed. Finally, the authors designed an experiment to evaluate the accuracy of classroom student behavior recognition analysis. The results show that the improved recognition of college students' behavior based on deep learning algorithm can improve the recognition accuracy. © 2023 IGI Global. All rights reserved.

关键词： learning algorithms

来源：评论

学校读者我要写书评

暂无评论

An efficient deep learning architecture for Turkish Lira recognition and counterfeit detection

引用

Turkish Journal of Electrical Engineering and Computer Sciences 2023年第3期31卷 678-692页

作者： İyıkesıcı, Burak Erçelebı, Ergün Department of Electrical and Electronics Engineering Faculty of Engineering Gaziantep University Gaziantep Turkey

Banknote counterfeiting is a common practice worldwide. Due to the recent developments in technology, banknote imitation has become easier than before. There are different kinds of algorithms developed for the detection of counterfeit banknotes for different countries in the literature. The earlier algorithms utilized classical image processing techniques where the implementations of machine learning and deep learning algorithms appeared with the developments in the artificial intelligence field as well as the computer hardware. In this study, a novel convolutional neural networks-based deep learning algorithm has been developed that detects counterfeit Turkish Lira banknotes and their denominations using the banknote images taken under UV light. The results obtained with the proposed algorithm have been compared with the results obtained with state-of-the-art machine learning and deep learning algorithms. It is seen that the results obtained with the proposed algorithm are superior to the results obtained with the state-of-the-art machine learning and deep learning algorithms. The proposed algorithm achieved 100% accuracy on the training set and 99.95% accuracy on the test set while yielding low inference time with relatively few parameters and small file size. © TÜBİTAK.

关键词： learning algorithms

来源：评论

学校读者我要写书评

暂无评论

Optimization of Regression algorithms using learning curve in WSN

Optimization of Regression algorithms using Learning curve i...

引用

2021 International Conference on Advance Computing and Innovative Technologies in Engineering, ICACITE 2021

作者： Verma, Vivek Kumar Kumar, Vinod Abes Engineering College Department of Electrical and Electronics Engineering Ghaziabad UP India Srm Institute of Science and Technology Department of Electronics and Communication

ISBN: (纸本)9781728177410

Regression algorithm used for the prediction of output with given features and it is a supervised learning algorithm. In applying regression algorithms such as linear regression, Regression using ANN, Regression using deep learning there comes a problem of (1) High bias or under fitting due to simple model or (2) High Variance due to complex model. High variance problems occur when we take high order polynomial in linear regression, more hidden layers in ANN, or Deep learning. So to find which types of problems occur i.e. High bias or variance, one has to use the learning curve to find an acceptable error. This paper will be helpful for those researchers who are working in the regression algorithm and have lesser knowledge in the mathematical treatment of the regression algorithm. In this, simulation has done with the GNU octave simulator and verified the concept of the learning curve. © 2021 IEEE.

关键词： learning algorithms

来源：评论

学校读者我要写书评

暂无评论

Convergence Rate of learning a Strongly Variationally Stable Equilibrium

Convergence Rate of Learning a Strongly Variationally Stable...

引用

European Control Conference (ECC)

作者： Tatarenko, Tatiana Kamgarpour, Maryam Tech Univ Darmstadt Control Methods & Intelligent Syst Lab Darmstadt Germany EPFL Sycamore Lab Lausanne Switzerland

ISBN: (纸本)9798331540920;9783907144107

We derive the rate of convergence to the globally strongly variationally stable Nash equilibrium in a convex game, for a zeroth-order learning algorithm. Though we do not assume strong monotonicity of the game, our rates for the one-point feedback, O (Ndt(1/2)), and for the two-point feedback, O(N(2)d(2)t), match the best known rates for strongly monotone games under zeroth-order information.

关键词： learning algorithms

来源：评论

学校读者我要写书评

暂无评论

DePRL: Achieving Linear Convergence Speedup in Personalized Decentralized learning with Shared Representations 38

DePRL: Achieving Linear Convergence Speedup in Personalized ...

引用

38th AAAI Conference on Artificial Intelligence (AAAI) / 36th Conference on Innovative Applications of Artificial Intelligence / 14th Symposium on Educational Advances in Artificial Intelligence

作者： Xiong, Guojun Yan, Gang Wang, Shiqiang Li, Jian SUNY Stony Brook Stony Brook NY 11794 USA Binghamton Univ Binghamton NY USA IBM TJ Watson Res Ctr Yorktown Hts NY USA

ISBN: (纸本)1577358872

Decentralized learning has emerged as an alternative method to the popular parameter-server framework which suffers from high communication burden, single-point failure and scalability issues due to the need of a central server. However, most existing works focus on a single shared model for all workers regardless of the data heterogeneity problem, rendering the resulting model performing poorly on individual workers. In this work, we propose a novel personalized decentralized learning algorithm named DePRL via shared representations. Our algorithm relies on ideas from representation learning theory to learn a low-dimensional global representation collaboratively among all workers in a fully decentralized manner, and a user-specific low-dimensional local head leading to a personalized solution for each worker. We show that DePRL achieves, for the first time, a provable linear speedup for convergence with general non-linear representations (i.e., the convergence rate is improved linearly with respect to the number of workers). Experimental results support our theoretical findings showing the superiority of our method in data heterogeneous environments.

关键词： learning algorithms

来源：评论

学校读者我要写书评

暂无评论

Mesolimbic dopamine adapts the rate of learning from action

引用

NATURE 2023年第7947期614卷 294-+页

作者： Coddington, Luke T. T. Lindo, Sarah E. E. Dudman, Joshua T. T. Howard Hughes Med Inst Janelia Res Campus Ashburn VA 20147 USA

Recent success in training artificial agents and robots derives from a combination of direct learning of behavioural policies and indirect learning through value functions(1-3). Policy learning and value learning use distinct algorithms that optimize behavioural performance and reward prediction, respectively. In animals, behavioural learning and the role of mesolimbic dopamine signalling have been extensively evaluated with respect to reward prediction(4);however, so far there has been little consideration of how direct policy learning might inform our understanding(5). Here we used a comprehensive dataset of orofacial and body movements to understand how behavioural policies evolved as naive, head-restrained mice learned a trace conditioning paradigm. Individual differences in initial dopaminergic reward responses correlated with the emergence of learned behavioural policy, but not the emergence of putative value encoding for a predictive cue. Likewise, physiologically calibrated manipulations of mesolimbic dopamine produced several effects inconsistent with value learning but predicted by a neural-network-based model that used dopamine signals to set an adaptive rate, not an error signal, for behavioural policy learning. This work provides strong evidence that phasic dopamine activity can regulate direct learning of behavioural policies, expanding the explanatory power of reinforcement learning models for animal learning(6). Analysis of data collected from mice learning a trace conditioning paradigm shows that phasic dopamine activity in the brain can regulate direct learning of behavioural policies, and dopamine sets an adaptive learning rate rather than an error-like teaching signal.

关键词： Cellular neuroscience Classical conditioning learning algorithms Reward

来源：评论

学校读者我要写书评

暂无评论

Inference and learning for Generative Capsule Models

引用

NEURAL COMPUTATION 2023年第4期35卷 727-761页

作者： Nazabal, Alfredo Tsagkas, Nikolaos Williams, Christopher K. I. Amazon Dev Ctr Scotland Edinburgh EH1 3EG Scotland Univ Edinburgh Sch Informat Edinburgh EH8 9AB Scotland Alan Turing Inst London NW1 2DB England

Capsule networks (see Hinton et al., 2018) aim to encode knowledge of and reason about the relationship between an object and its parts. In this letter, we specify a generative model for such data and derive a variational algorithm for inferring the transformation of each model object in a scene and the assignments of observed parts to the objects. We derive a learning algorithm for the object models, based on variational expectation maximization (Jordan et al., 1999). We also study an alternative inference algorithm based on the RANSAC method of Fischler and Bolles (1981). We apply these inference methods to data generated from multiple geometric objects like squares and triangles ("constellations") and data from a parts-based model of faces. Recent work by Kosiorek et al. (2019) has used amortized inference via stacked capsule autoencoders to tackle this problem;our results show that we significantly outperform them where we can make comparisons (on the constellations data).

关键词： learning algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：