咨询与建议

限定检索结果

文献类型

  • 3 篇 期刊文献

馆藏范围

  • 3 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 3 篇 工学
    • 3 篇 计算机科学与技术...
    • 3 篇 软件工程
  • 2 篇 理学
    • 2 篇 数学
    • 1 篇 统计学(可授理学、...
  • 1 篇 管理学
    • 1 篇 图书情报与档案管...

主题

  • 2 篇 reinforcement le...
  • 1 篇 markov processes

机构

  • 3 篇 micromasters pro...
  • 3 篇 department of da...
  • 2 篇 department of co...
  • 1 篇 school of engine...

作者

  • 3 篇 xu jingzehua
  • 3 篇 zhang shuai
  • 2 篇 xie guanwen
  • 2 篇 yang yiyuan
  • 2 篇 ding yimian
  • 1 篇 lyu shangke
  • 1 篇 liu jinxin
  • 1 篇 wang donglin
  • 1 篇 zhang ziqi
  • 1 篇 zhang hongyin
  • 1 篇 zhuang zifeng

语言

  • 3 篇 英文
检索条件"机构=MicroMasters Program in Statistics and Data Science"
3 条 记 录,以下是1-10 订阅
排序:
Enhancing Information Freshness: An AoI Optimized Markov Decision Process Dedicated in The Underwater Task
arXiv
收藏 引用
arXiv 2024年
作者: Xu, Jingzehua Ding, Yimian Yang, Yiyuan Xie, Guanwen Zhang, Shuai MicroMasters Program in Statistics and Data Science Massachusetts Institute of Technology United States Department of Computer Science University of Oxford United Kingdom Department of Data Science New Jersey Institute of Technology United States
Ocean exploration utilizing autonomous underwater vehicles (AUVs) via reinforcement learning (RL) has emerged as a significant research focus. However, underwater tasks have mostly failed due to the observation delay ... 详细信息
来源: 评论
Large Language Models as Efficient Reward Function Searchers for Custom-Environment Multi-Objective Reinforcement Learning
arXiv
收藏 引用
arXiv 2024年
作者: Xie, Guanwen Xu, Jingzehua Yang, Yiyuan Ding, Yimian Zhang, Shuai MicroMasters Program in Statistics and Data Science Massachusetts Institute of Technology United States Department of Computer Science University of Oxford United Kingdom Department of Data Science New Jersey Institute of Technology United States
Achieving the effective design and improvement of reward functions in reinforcement learning (RL) tasks with complex custom environments and multiple requirements presents considerable challenges. In this paper, we pr... 详细信息
来源: 评论
A Dynamical Clipping Approach with Task Feedback for Proximal Policy Optimization
arXiv
收藏 引用
arXiv 2023年
作者: Zhang, Ziqi Xu, Jingzehua Zhuang, Zifeng Zhang, Hongyin Liu, Jinxin Wang, Donglin Zhang, Shuai Lyu, Shangke School of Engineering WestLake University China MicroMasters Program in Statistics and Data Science Massachusetts Institute of Technology United States Department of Data Science New Jersey Institute of Technology United States
Proximal Policy Optimization (PPO) has been broadly applied to robotics learning, showcasing stable training performance. However, the fixed clipping bound setting may limit the performance of PPO. Specifically, there... 详细信息
来源: 评论