检索结果-内蒙古大学图书馆

End-to-End Multitask Learning With Vision Transformer

IEEE TRANSACTIONS ON neural networkS AND LEARNING SYSTEMS 2024年第7期35卷 9579-9590页

作者： Tian, Yingjie Bai, Kunlong Univ Chinese Acad Sci Sch Econ & Management Beijing 100190 Peoples R China Chinese Acad Sci Res Ctr Fictitious Econ & Data Sci Beijing 100190 Peoples R China Chinese Acad Sci Key Lab Big Data Min & Knowledge Management Beijing 100190 Peoples R China Univ Chinese Acad Sci Sch Comp Sci & Technol Beijing 100190 Peoples R China

Multitask learning (MTL) is a challenging puzzle, particularly in the realm of computer vision (CV). Setting up vanilla deep MTL requires either hard or soft parameter sharing schemes that employ greedy search to find the optimal network designs. Despite its widespread application, the performance of MTL models is vulnerable to under-constrained parameters. In this article, we draw on the recent success of vision transformer (ViT) to propose a multitask representation learning method called multitask ViT (MTViT), which proposes a multiple branch transformer to sequentially process the image patches (i.e., tokens in transformer) that are associated with various tasks. Through the proposed cross-task attention (CA) module, a task token from each task branch is regarded as a query for exchanging information with other task branches. In contrast to prior models, our proposed method extracts intrinsic features using the built-in self-attention mechanism of the ViT and requires just linear time on memory and computation complexity, rather than quadratic time. Comprehensive experiments are carried out on two benchmark datasets, including NYU-Depth V2 (NYUDv2) and CityScapes, after which it is found that our proposed MTViT outperforms or is on par with existing convolutional neural network (CNN)-based MTL methods. In addition, we apply our method to a synthetic dataset in which task relatedness is controlled. Surprisingly, experimental results reveal that the MTViT exhibits excellent performance when tasks are less related.

关键词： Task analysis Transformers neural networks Visualization Biological system modeling Benchmark testing Correlation deep neural network algorithms machine learning applications multitask learning (MTL)

来源：评论

学校读者我要写书评

暂无评论

SwitchPath: Enhancing Exploration in neural networks Learning Dynamics 27th

SwitchPath: Enhancing Exploration in Neural Networks Learnin...

引用

27th International Conference on Discovery Science

作者： Di Cecco, Antonio Papini, Andrea Metta, Carlo Fantozzi, Marco Galfre, Silvia Giulia Morandin, Francesco Parton, Maurizio Univ G dAnnunzio Chieti Italy Chalmers Univ Technol Gothenburg Sweden ISTI CNR Pisa Italy Univ Parma Parma Italy Univ Pisa Pisa Italy

ISBN: (纸本)9783031789762;9783031789779

We introduce SwitchPath, a novel stochastic activation function that enhances neural network exploration, performance, and generalization, by probabilistically toggling between the activation of a neuron and its negation. SwitchPath draws inspiration from the analogies between neural networks and decision trees, and from the exploratory and regularizing properties of DropOut as well. Unlike Dropout, which intermittently reduces network capacity by deactivating neurons, SwitchPath maintains continuous activation, allowing networks to dynamically explore alternative information pathways while fully utilizing their capacity. Building on the concept of epsilon-greedy algorithms to balance exploration and exploitation, SwitchPath enhances generalization capabilities over traditional activation functions. The exploration of alternative paths happens during training without sacrificing computational efficiency. This paper presents the theoretical motivations, practical implementations, and empirical results, showcasing all the described advantages of SwitchPath over established stochastic activation mechanisms.

关键词： deep Learning Theory deep neural network algorithms

来源：评论

学校读者我要写书评

暂无评论

Adaptive Tabu Dropout for Regularization of deep neural networks 29th

Adaptive Tabu Dropout for Regularization of Deep Neural Netw...

引用

29th International Conference on neural Information Processing

作者： Hasan, Md Tarek Akter, Ari Fa Shamael, Mohammad Nazmush Hossain, Md Al Emran Billah, H. M. Mutasim Islam, Sumayra Shatabda, Swakkhar United Int Univ Dept Comp Sci & Engn Plot 2Madani Ave Dhaka 1212 Badda Bangladesh

ISBN: (纸本)9783031301049;9783031301056

Dropout is an effective strategy for the regularization of deep neural networks. Applying tabu to the units that have been dropped in the recent epoch and retaining them for training ensures diversification in dropout. In this paper, we improve the Tabu Dropout mechanism for training deep neural networks in two ways. Firstly, we propose to use tabu tenure, or the number of epochs a particular unit will not be dropped. Different tabu tenures provide diversification to boost the training of deep neural networks based on the search landscape. Secondly, we propose an adaptive tabu algorithm that automatically selects the tabu tenure based on the training performances through epochs. On several standard benchmark datasets, the experimental results show that the adaptive tabu dropout and tabu tenure dropout diversify and perform significantly better compared to the standard dropout and basic tabu dropout mechanisms.

关键词： Online Learning & Bandits deep neural network algorithms Reinforcement Learning algorithms Heuristic Search Local Search

来源：评论

学校读者我要写书评

暂无评论

Performance analysis of a degraded PEM fuel cell stack for hydrogen passenger vehicles based on machine learning algorithms in real driving conditions

引用

ENERGY CONVERSION AND MANAGEMENT 2021年 248卷 114793-114793页

作者： Raeesi, Mehrdad Changizian, Sina Ahmadi, Pouria Khoshnevisan, Alireza Univ Tehran Sch Mech Engn Fac Engn POB 11155-4563 Tehran Iran

Fuel cell degradation is one of the main challenges of hydrogen fuel cell vehicles, which can be solved by robust prediction techniques like machine learning. In this research, a specific Proton-exchange membrane fuel cell stack is considered, and the experimental data are imported to predict the future behavior of the stack. Besides, four different prediction neural network algorithms are considered, and deep neural network is selected. Furthermore, Simcenter Amesim software is used with the ability of dynamic simulation to calculate real-time fuel consumption, fuel cell degradation, and engine performance. Finally, to better understand how fuel cell degradation affects fuel consumption and life cycle emission, lifecycle assessment as a potential tool is carried out using GREET software. The results show that a degraded Proton-exchange membrane fuel cell stack can result in an increase in fuel consumption by 14.32 % in the New European driving cycle and 13.9 % in the FTP-75 driving cycle. The Life Cycle Assessment analysis results show that fuel cell degradation has a significant effect on fuel consumption and total emission. The results show that a fuel cell with a predicted degradation will emit 26.4 % more CO2 emissions than a Proton-exchange membrane fuel cell without degradation.

关键词： Proton-exchange membrane fuel cell Machine learning deep neural network algorithms Life cycle assessment Degradation effects

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：