咨询与建议

限定检索结果

文献类型

  • 12 篇 期刊文献
  • 12 篇 会议

馆藏范围

  • 24 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 23 篇 工学
    • 13 篇 计算机科学与技术...
    • 9 篇 电气工程
    • 9 篇 控制科学与工程
    • 4 篇 机械工程
    • 3 篇 仪器科学与技术
    • 3 篇 电子科学与技术(可...
    • 3 篇 软件工程
    • 2 篇 材料科学与工程(可...
    • 2 篇 动力工程及工程热...
    • 2 篇 信息与通信工程
    • 2 篇 石油与天然气工程
    • 1 篇 力学(可授工学、理...
    • 1 篇 化学工程与技术
    • 1 篇 航空宇航科学与技...
  • 10 篇 管理学
    • 10 篇 管理科学与工程(可...
  • 3 篇 理学
    • 3 篇 数学
    • 3 篇 物理学

主题

  • 24 篇 proximal policy ...
  • 15 篇 deep reinforceme...
  • 2 篇 reinforcement le...
  • 2 篇 guandan
  • 2 篇 path planning
  • 2 篇 manipulator cont...
  • 2 篇 self-learning
  • 2 篇 imperfect inform...
  • 1 篇 internet of thin...
  • 1 篇 obstacle distrib...
  • 1 篇 robotic arm moti...
  • 1 篇 online combustio...
  • 1 篇 uav altitude con...
  • 1 篇 segmented adapti...
  • 1 篇 advantage actor ...
  • 1 篇 relays
  • 1 篇 electricity mark...
  • 1 篇 bi-level optimiz...
  • 1 篇 a* algorithm
  • 1 篇 age of informati...

机构

  • 1 篇 army engn univ p...
  • 1 篇 jiangnan univ sc...
  • 1 篇 nanjing univ aer...
  • 1 篇 shanghai jiao to...
  • 1 篇 suzhou univ sci ...
  • 1 篇 changchun univ s...
  • 1 篇 shanghai baosigh...
  • 1 篇 college of elect...
  • 1 篇 shandong univ de...
  • 1 篇 school of comput...
  • 1 篇 naval equipment ...
  • 1 篇 dalian univ tech...
  • 1 篇 hunan univ finan...
  • 1 篇 wuhan univ sch i...
  • 1 篇 texas a&m univ q...
  • 1 篇 nanchang key lab...
  • 1 篇 harbin engn univ...
  • 1 篇 nanjing audit un...
  • 1 篇 jiangsu marine r...
  • 1 篇 minist nat resou...

作者

  • 1 篇 wang yanhong
  • 1 篇 jiahong pan
  • 1 篇 ding hongchang
  • 1 篇 han longzhe
  • 1 篇 hengheng shen
  • 1 篇 dong peng
  • 1 篇 li liang
  • 1 篇 shaoxiong yang
  • 1 篇 assi chadi
  • 1 篇 wang hui
  • 1 篇 chen huafeng
  • 1 篇 zhou jiantao
  • 1 篇 chen xuemei
  • 1 篇 jin xin
  • 1 篇 sun jialong
  • 1 篇 wu jie
  • 1 篇 hu rong
  • 1 篇 zhang hongnan
  • 1 篇 pan jiahong
  • 1 篇 zhao bo

语言

  • 20 篇 英文
  • 3 篇 其他
检索条件"主题词=Proximal Policy Optimization Algorithm"
24 条 记 录,以下是11-20 订阅
排序:
A dynamic flexible job shop scheduling method based on collaborative agent reinforcement learning
收藏 引用
FLEXIBLE SERVICES AND MANUFACTURING JOURNAL 2024年 1-33页
作者: Shao, Changshun Yu, Zhenglin Ding, Hongchang Cao, Guohua Ding, Kaifang Duan, Jingsong Changchun Univ Sci & Technol Coll Mech & Elect Engn Changchun 130022 Jilin Peoples R China Changchun Univ Sci & Technol Chongqing Res Inst Chongqing 401135 Peoples R China
This paper presents an innovative approach to solve the Dynamic Flexible Job Shop Scheduling Problem (DFJSP). Our method aims to enhance production efficiency by minimizing the average total tardiness. To achieve this... 详细信息
来源: 评论
System-Level Predictive Maintenance optimization for No-Wait Production Machine-Robot Collaborative Environment under Economic Dependency and Hybrid Fault Mode
收藏 引用
PROCESSES 2024年 第8期12卷 1690页
作者: Hu, Bing Chen, Zhaoxiang Zhen, Mengzi Chen, Zhen Pan, Ershun Shanghai Jiao Tong Univ Dept Ind Engn & Management State Key Lab Mech Syst & Vibrat Shanghai 200240 Peoples R China Shanghai Baosight Software Co Ltd Shanghai 201203 Peoples R China
For manufacturing systems such as hot rolling, where there is no wait in the production process, breaks between adjacent production batches provide "opportunities" for predictive maintenance. With the extens... 详细信息
来源: 评论
Research on Data-Driven Optimal Scheduling of Power System
收藏 引用
ENERGIES 2023年 第6期16卷 2926-2926页
作者: Luo, Jianxun Zhang, Wei Wang, Hui Wei, Wenmiao He, Jinpeng Qilu Univ Technol Shandong Acad Sci Sch Informat & Automat Jinan Peoples R China Shandong Univ Dept Elect Engn Jinan 250061 Peoples R China Huazhong Univ Sci & Technol Automat Acad Wuhan 430074 Peoples R China
The uncertainty of output makes it difficult to effectively solve the economic security dispatching problem of the power grid when a high proportion of renewable energy generating units are integrated into the power g... 详细信息
来源: 评论
Online Altitude Control and Scheduling policy for Minimizing AoI in UAV-Assisted IoT Wireless Networks
收藏 引用
IEEE TRANSACTIONS ON MOBILE COMPUTING 2022年 第7期21卷 2493-2505页
作者: Samir, Moataz Assi, Chadi Sharafeddine, Sanaa Ghrayeb, Ali Concordia Univ Concordia Inst Informat Syst Engn CIISE Montreal PQ H3G 1M8 Canada Lebanese Amer Univ Sch Arts & Sci SAS Beirut 11022801 Lebanon Texas A&M Univ Qatar Elect & Comp Engn ECE Dept Doha 23874 Qatar
This article considers unmanned aerial vehicle (UAV) assisted Internet of Things (IoT) networks, where low resource IoT devices periodically sample a stochastic process and need to upload more recent information to a ... 详细信息
来源: 评论
AUV Dynamic Obstacle Avoidance Method Based on Improved PPO algorithm
收藏 引用
IEEE ACCESS 2022年 10卷 121340-121351页
作者: Zhu, Guohao Shen, Zhou Liu, Laiyuan Zhao, Sicong Ji, Fangzheng Ju, Zixia Sun, Jialong Jiangsu Ocean Univ Sch Geomat & Marine Informat Lianyungang 222001 Peoples R China Jiangsu Marine Resources Dev Res Inst Lianyungang 222005 Peoples R China Jiangsu Ocean Univ Coinnovat Ctr Jiangsu Marine Bioind Technol Lianyungang 222001 Peoples R China Jiangsu Ocean Univ Jiangsu Key Lab Marine Bioresources & Environm Jiangsu Key Lab Marine Biotechnol Lianyungang 222001 Peoples R China Minist Nat Resources Marine Informat Technol Innovat Ctr Tianjin 300171 Peoples R China
Designing a reasonable obstacle avoidance method for AUV 3D path planning is difficult, and existing obstacle avoidance methods have certain drawbacks. For example, they are only applicable to 2D planar applications a... 详细信息
来源: 评论
Research on 3D Observation Path Planning Method for Mobile Platforms Based on Near-End Strategy optimization
Research on 3D Observation Path Planning Method for Mobile P...
收藏 引用
2024 International Conference on Guidance, Navigation and Control
作者: Zhang, Jing Jing Dong, Peng Da Shi, Wen Liu, Xin Yu Yu, Cong Rui Harbin Engn Univ Coll Intelligent Syst Sci & Engn Harbin 150000 Peoples R China Naval Equipment Dept Peoples Liberat Army Chinese Equipment Project Management Ctr Project Management Ctr Beijing 100000 Peoples R China China Ship Dev & Design Cente Underwater Part Hubei 430000 Peoples R China
It has been challenging for mobile observation platforms to solve the path planning problem in a three-dimensional dynamic marine environment. On the one hand, traditional path planning algorithms are highly dependent... 详细信息
来源: 评论
Dynamic flexible job shop scheduling algorithm based on deep reinforcement learning  35
Dynamic flexible job shop scheduling algorithm based on deep...
收藏 引用
35th Chinese Control and Decision Conference (CCDC)
作者: Zhao, Tianrui Wang, Yanhong Tan, Yuanyuan Zhang, Jun Shenyang Univ Technol Coll Artificial Intelligence Shenyang 145558 Peoples R China Shenyang Univ Technol Coll Artificial Intelligence Shenyang 12326 Peoples R China Shenyang Univ Technol Coll Artificial Intelligence Shenyang 52429 Peoples R China
The dynamic scheduling problem is a hot topic of current research. To solve the dynamic flexible job shop scheduling problem, an improved composite scheduling rule algorithm based on proximal policy optimization is pr... 详细信息
来源: 评论
A Futures Quantitative Trading Strategy Based on a Deep Reinforcement Learning algorithm  8
A Futures Quantitative Trading Strategy Based on a Deep Rein...
收藏 引用
IEEE 8th International Conference on Big Data Analytics (ICBDA)
作者: Chen, Xuemei Guo, Haoran Wuhan Univ Sch Informat Management Wuhan Peoples R China Shanghai Jiao Tong Univ Ningbo Inst Artificial Intelligence Ningbo Peoples R China
Deep reinforcement learning (DRL) is a type of machine learning algorithm that has gained a lot of attention for its application in the financial field. Based on the proximal policy optimization algorithm (PPO) in dee... 详细信息
来源: 评论
Multi-device cooperative reactive power optimization control strategy for high percentage distributed photovoltaic distribution grid based on proximal strategy optimization algorithm  9
Multi-device cooperative reactive power optimization control...
收藏 引用
9th International Conference on Energy System, Electricity, and Power, ESEP 2024
作者: Zhang, Liyuan Ren, Guitian Chen, Shangyue Zhang, Xiaotong Chengxi Power Supply Branch of State Grid Tianjin Electric Power Company Tianjin 300190 China
A high proportion of strongly intermittent distributed PV is connected to the distribution network in a decentralized and disordered manner, which leads to the difficulty of its reactive power balance. This paper prop... 详细信息
来源: 评论
Application of Deep Reinforcement Learning in Guandan Game  34
Application of Deep Reinforcement Learning in Guandan Game
收藏 引用
34th Chinese Control and Decision Conference (CCDC)
作者: Pan, Jiahong Zhang, Zhongtian Shen, Hengheng Zeng, Yi Wu, Lei Anhui Univ Sch Comp Sci & Technol Hefei 230601 Peoples R China
In recent years, imperfect information game has become an important touchstone to test the level of artificial intelligence. There are many imperfect information game scenarios in the real-world, such as economic tran... 详细信息
来源: 评论