检索结果-内蒙古大学图书馆

41st International Conference on Machine Learning, ICML 2024

作者： Lei, Fenghao Yang, Long Wen, Shiting Huang, Zhixiong Zhang, Zhiwang Pang, Chaoyi College of Computer Science and Technology Zhejiang University Hangzhou China School of Artificial Intelligence Peking University Beijing China School of Computer and Data Engneering NingboTech University Ningbo China

Optimization and sampling based algorithms are two branches of methods in machine learning, while existing safe reinforcement learning (RL) algorithms are mainly based on optimization, it is still unclear whether sampling based methods can lead to desirable performance with safe policy. This paper formulates the Langevin policy for safe RL, and proposes Langevin Actor-Critic (LAC) to accelerate the process of policy inference. Concretely, instead of parametric policy, the proposed Langevin policy provides a stochastic process that directly infers actions, which is the numerical solver to the Langevin dynamic of actions on the continuous time. Furthermore, to make Langevin policy practical on RL tasks, the proposed LAC accumulates the transitions induced by Langevin policy and reproduces them with a generator. Finally, extensive empirical results show the effectiveness and superiority of LAC on the MuJoCo-based and Safety Gym tasks. Our implementation is available at https://***/Lfh404/LAC. Copyright 2024 by the author(s)

关键词： Adversarial machine learning

来源：评论

学校读者我要写书评

暂无评论

Langevin policy for safe reinforcement learning 24

Langevin policy for safe reinforcement learning

引用

Proceedings of the 41st International Conference on Machine Learning

作者： Fenghao Lei Long Yang Shiting Wen Zhixiong Huang Zhiwang Zhang Chaoyi Pang College of Computer Science and Technology Zhejiang University Hangzhou China School of Artificial Intelligence Peking University Beijing China School of Computer and Data Engneering NingboTech University Ningbo China

关键词：

来源：评论

学校读者我要写书评

暂无评论

MMLUP: Multi-Source & Multi-Task Learning for User Profiles in Social Network

引用

computers, Materials & Continua 2019年第9期61卷 1105-1115页

作者： Dongjie Zhu Yuhua Wang Chuiju You Jinming Qiu Ning Cao Chenjing Gong Guohua Yang Helen Min Zhou School of Computer Science and Technology Harbin Institute of TechnologyWeihai264209China College of Information Engineering Sanming University365004SanmingChina Fujian Province University Key Lab for Industry Big Data Analysis and Application FujianChina College of Information Engineering Qingdao Binhai UniversityQingdaoChina Jiangsu Province Wireless Sensing System Application Engneering Technology Research and Development Centre China School of Engineering Manukau Institute of TechnologyAuckland2241New Zealand

With the rapid development of the mobile Internet,users generate massive data in different forms in social network every day,and different characteristics of users are reflected by these social media *** to integrate multiple heterogeneous information and establish user profiles from multiple perspectives plays an important role in providing personalized services,marketing,and recommendation *** this paper,we propose Multi-source&Multi-task Learning for User Profiles in Social Network which integrates multiple social data sources and contains a multi-task learning framework to simultaneously predict various attributes of a ***,we design their own feature extraction models for multiple heterogeneous data ***,we design a shared layer to fuse multiple heterogeneous data sources as general shared representation for multi-task ***,we design each task’s own unique presentation layer for discriminant output of ***,we design a weighted loss function to improve the learning efficiency and prediction accuracy of each *** experimental results on more than 5000 Sina Weibo users demonstrate that our approach outperforms state-of-the-art baselines for inferring gender,age and region of social media users.

关键词： User profiles multi-source multi-task learning social network

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：