咨询与建议

限定检索结果

文献类型

  • 16 篇 会议
  • 11 篇 期刊文献

馆藏范围

  • 27 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 25 篇 工学
    • 14 篇 计算机科学与技术...
    • 11 篇 控制科学与工程
    • 7 篇 信息与通信工程
    • 5 篇 电气工程
    • 4 篇 软件工程
    • 3 篇 机械工程
    • 3 篇 动力工程及工程热...
    • 3 篇 网络空间安全
    • 2 篇 仪器科学与技术
    • 2 篇 化学工程与技术
    • 2 篇 石油与天然气工程
    • 2 篇 核科学与技术
    • 1 篇 力学(可授工学、理...
    • 1 篇 交通运输工程
    • 1 篇 安全科学与工程
  • 5 篇 管理学
    • 5 篇 管理科学与工程(可...
    • 2 篇 公共管理
  • 2 篇 医学
    • 2 篇 公共卫生与预防医...
  • 1 篇 法学
    • 1 篇 社会学
  • 1 篇 文学
    • 1 篇 新闻传播学
  • 1 篇 理学
    • 1 篇 化学

主题

  • 27 篇 ppo algorithm
  • 7 篇 reinforcement le...
  • 4 篇 deep reinforceme...
  • 3 篇 pi controller
  • 2 篇 multi-objective ...
  • 2 篇 greenhouse
  • 2 篇 pid controller
  • 2 篇 step response
  • 2 篇 robotic arm
  • 2 篇 neural network
  • 2 篇 steam turbine sy...
  • 2 篇 crop yield
  • 2 篇 gain scheduling
  • 2 篇 once-through ste...
  • 2 篇 quadrotor contro...
  • 1 篇 multi-objective ...
  • 1 篇 eco-efficient dr...
  • 1 篇 lstm network
  • 1 篇 object detection
  • 1 篇 sample efficienc...

机构

  • 2 篇 naval univ engn ...
  • 1 篇 amrita school of...
  • 1 篇 beijing inst tec...
  • 1 篇 school of marxis...
  • 1 篇 jiangsu univ sch...
  • 1 篇 univ chinese aca...
  • 1 篇 zhejiang vie sci...
  • 1 篇 army acad armore...
  • 1 篇 chongqing univ s...
  • 1 篇 zhejiang ocean u...
  • 1 篇 zhejiang univ te...
  • 1 篇 intelligent tran...
  • 1 篇 chinese acad sci...
  • 1 篇 zhejiang univ te...
  • 1 篇 catarc automot t...
  • 1 篇 amrita school of...
  • 1 篇 sharif univ tech...
  • 1 篇 univ zagreb fac ...
  • 1 篇 college of autom...
  • 1 篇 school of creati...

作者

  • 2 篇 li cheng
  • 2 篇 zhang weizhong
  • 2 篇 yu wenmin
  • 2 篇 detroja ketan p.
  • 2 篇 gou siyuan
  • 2 篇 veluchamy s.
  • 2 篇 yu ren
  • 2 篇 gorantla snehith...
  • 2 篇 tang renjie
  • 2 篇 wang tianshu
  • 2 篇 zheng qingqing
  • 1 篇 qiu chengyun
  • 1 篇 zhang yunfeng
  • 1 篇 liu beihong
  • 1 篇 gao yuchen
  • 1 篇 zhang lei
  • 1 篇 wang jing
  • 1 篇 bai weisong
  • 1 篇 khooban mohammad...
  • 1 篇 gu yunyang

语言

  • 23 篇 英文
  • 4 篇 其他
检索条件"主题词=PPO Algorithm"
27 条 记 录,以下是21-30 订阅
排序:
A policy optimization algorithm based on sample adaptive reuse and dual-clipping for robotic action control
收藏 引用
APPLIED SOFT COMPUTING 2023年 134卷
作者: Zhao, Li -yang Chang, Tian-qing Zhang, Jie Zhang, Lei Chu, Kai-xuan Guo, Li -bin Kong, De-peng Army Acad Armored Forces Dept Weaponry & Control Beijing 100072 Peoples R China Naval Res Inst Beijing 100161 Peoples R China Unit 63963 PLA Beijing 100072 Peoples R China
When applying deep reinforcement learning in the real physical environment for decision-making, how to improve the sample efficiency while ensuring training stability is an urgent problem that needs to be solved. In o... 详细信息
来源: 评论
Optimization of Classroom Interaction Strategies Based on Deep Reinforcement Learning  25
Optimization of Classroom Interaction Strategies Based on De...
收藏 引用
Proceedings of the 2025 International Conference on Big Data and Informatization Education
作者: Ying Li Jiaqi Liu Wei Ji Yantao Li School of Marxism Guangdong University of Science and Technology Dongguan Guangdong China School of Creative Design Dongguan City University Dongguan Guangdong China Guangdong University of Science and Technology Dongguan Guangdong China Scientific Research Office Guangdong University of Science and Technology Dongguan Guangdong China
With the rapid development of information technology, intelligent education has gradually become an important direction of modern education reform. Classroom interaction, as a key factor in improving students' lea... 详细信息
来源: 评论
Pressure control of Once-through steam generator using Proximal policy optimization algorithm
收藏 引用
ANNALS OF NUCLEAR ENERGY 2022年 175卷
作者: Li, Cheng Yu, Ren Yu, Wenmin Wang, Tianshu Naval Univ Engn Wuhan 430033 Peoples R China China Nucl Power Operat Technol Corp LTD Wuhan 430000 Peoples R China
Due to the strong coupling characteristics of the once-through steam generator(OTSG), the outlet pressure control is difficult. The control system using the Proximal Policy Optimization(ppo) algorithm is designed to c... 详细信息
来源: 评论
Robotic Arm Motion Planning Based on Residual Reinforcement Learning  13
Robotic Arm Motion Planning Based on Residual Reinforcement ...
收藏 引用
13th International Conference on Computer and Automation Engineering (ICCAE)
作者: Zhou, Dongxu Jia, Ruiqing Yao, Haifeng Xie, Mingzuo China Univ Min & Technol Beijing Sch Mech Elect & Informat Engn Beijing Peoples R China
The application of reinforcement learning algorithms to motion planning is a research hotspot in robotics in recent years. However, training reinforcement learning agents from scratch has low training efficiency and d... 详细信息
来源: 评论
Multi-Agent Path Planning Based on Deep Reinforcement Learning
Multi-Agent Path Planning Based on Deep Reinforcement Learni...
收藏 引用
第32届中国过程控制会议(CPCC2021)
作者: Bai Weisong Zhang Chunmei Guo Hongge Shao Yang Taiyuan University of Science and Technology
Aiming at the problem of Multi-Agent Path Planning(MAPP),the current algorithms have the disadvantages of large data dimensions and complex *** this paper,A* algorithm and Proximal Policy Optimization(ppo) algorithm a... 详细信息
来源: 评论
A PID Gain Adjustment Scheme Based on Reinforcement Learning algorithm for a Quadrotor
A PID Gain Adjustment Scheme Based on Reinforcement Learning...
收藏 引用
39th Chinese Control Conference (CCC)
作者: Zheng Qingqing Tang Renjie Gou Siyuan Zhang Weizhong Beijing Inst Technol Sch Aerosp Engn Beijing 100081 Peoples R China
In this paper, a PID gain adjustment scheme with the basis on Reinforcement Learning algorithm is proposed, the validity of the scheme is demonstrated with the application to the control of a quadrotor. Specifically, ... 详细信息
来源: 评论
A PID Gain Adjustment Scheme Based on Reinforcement Learning algorithm for a Quadrotor
A PID Gain Adjustment Scheme Based on Reinforcement Learning...
收藏 引用
第三十九届中国控制会议
作者: Zheng Qingqing Tang Renjie Gou Siyuan Zhang Weizhong School of Aerospace Engineering Beijing Institute of Technology
In this paper, a PID gain adjustment scheme with the basis on Reinforcement Learning algorithm is proposed, the validity of the scheme is demonstrated with the application to the control of a quadrotor. Specifically, ... 详细信息
来源: 评论