咨询与建议

限定检索结果

文献类型

  • 15 篇 会议
  • 10 篇 期刊文献

馆藏范围

  • 25 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 24 篇 工学
    • 10 篇 电气工程
    • 10 篇 计算机科学与技术...
    • 9 篇 控制科学与工程
    • 4 篇 机械工程
    • 4 篇 仪器科学与技术
    • 3 篇 信息与通信工程
    • 3 篇 石油与天然气工程
    • 3 篇 软件工程
    • 2 篇 动力工程及工程热...
    • 2 篇 电子科学与技术(可...
    • 2 篇 交通运输工程
    • 2 篇 生物医学工程(可授...
    • 1 篇 安全科学与工程
    • 1 篇 公安技术
    • 1 篇 网络空间安全
  • 6 篇 理学
    • 3 篇 数学
    • 2 篇 化学
    • 2 篇 统计学(可授理学、...
    • 1 篇 系统科学
  • 4 篇 管理学
    • 4 篇 管理科学与工程(可...
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 1 篇 医学
    • 1 篇 基础医学(可授医学...

主题

  • 25 篇 sarsa algorithm
  • 16 篇 reinforcement le...
  • 3 篇 path planning
  • 3 篇 q-learning
  • 2 篇 function approxi...
  • 2 篇 dynamic role ass...
  • 2 篇 markov decision ...
  • 2 篇 robocup
  • 1 篇 self-learning al...
  • 1 篇 q-learning algor...
  • 1 篇 genetic algorith...
  • 1 篇 state discretiza...
  • 1 篇 adaptive control...
  • 1 篇 traffic signal
  • 1 篇 maximal control ...
  • 1 篇 diabetes
  • 1 篇 power grids
  • 1 篇 inspect
  • 1 篇 grid integration
  • 1 篇 intelligent body...

机构

  • 2 篇 suzhou univ sci ...
  • 2 篇 college of autom...
  • 2 篇 suzhou univ sci ...
  • 2 篇 suzhou univ sci ...
  • 1 篇 guangxi univ tec...
  • 1 篇 chongqing univ t...
  • 1 篇 donghua univ eng...
  • 1 篇 soochow univ sch...
  • 1 篇 vellore institut...
  • 1 篇 vellore institut...
  • 1 篇 key laboratory o...
  • 1 篇 wuhan univ sch c...
  • 1 篇 pes university b...
  • 1 篇 key laboratory o...
  • 1 篇 college of compu...
  • 1 篇 shanghai jiao to...
  • 1 篇 swiss fed inst t...
  • 1 篇 jiangsu univ ins...
  • 1 篇 key lab smart en...
  • 1 篇 xinjiang univ sc...

作者

  • 2 篇 fu qiming
  • 2 篇 chen jianping
  • 2 篇 hu lingyao
  • 2 篇 hu wen
  • 2 篇 yang yongyi
  • 2 篇 cui xuanyu
  • 2 篇 liu haoran
  • 2 篇 liang zhiwei
  • 2 篇 wang jiawen
  • 1 篇 huang chen
  • 1 篇 jia liruizhi
  • 1 篇 fan huahao
  • 1 篇 hoda nasereddin
  • 1 篇 renjith p.n.
  • 1 篇 fan kai
  • 1 篇 lin wei
  • 1 篇 kang han
  • 1 篇 shanta rangaswam...
  • 1 篇 zhang yuxin
  • 1 篇 kai fan

语言

  • 25 篇 英文
检索条件"主题词=SARSA algorithm"
25 条 记 录,以下是11-20 订阅
排序:
Optimal Decision-Making Method for a Plug-In Electric Taxi in Uncertain Environment
收藏 引用
IEEE ACCESS 2021年 9卷 62467-62477页
作者: You, Yang Zhu, Jisong Huang, Yichuan Jing, Zhaoxia South China Univ Technol Sch Elect Power Engn Guangzhou 510640 Peoples R China
This paper studies the optimal decision-making problem for a plug-in electric taxi (PET) in a time-varying complex environment, i.e., a passenger environment, charging station environment, traffic environment, and tax... 详细信息
来源: 评论
A Case Study: Characterization of Performance Inconsistency for Reinforcement Learning on Flappy Bird Game  12
A Case Study: Characterization of Performance Inconsistency ...
收藏 引用
12th International Conference on ICT Convergence (ICTC) - Beyond the Pandemic Era with ICT Convergence Innovation
作者: Shakerimov, Aidar Li, Dmitriy Park, Jurn-Gyu Nazarbayev Univ Comp Sci Sch Engn & Digital Sci Nur Sultan Kazakhstan
One of the serious problems in Reinforcement Learning (RL) algorithms is that their performance usually varies when the same experiment is repeated or reproduced. Although RL results are hard to reproduce due to algor... 详细信息
来源: 评论
Power Control Research for Device-to-Device Wireless Network Underlying Reinforcement Learning
Power Control Research for Device-to-Device Wireless Network...
收藏 引用
作者: Kang Han Chengyin Ye College of Computer and Communication LiaoNing Petrochemical University
Aiming at the problem that co-channel interference leads to decrease of system data throughput when reusing cellular user spectrum resources, a D2D communication link power control method combined with reinforcement l... 详细信息
来源: 评论
AGV Path Planning Model based on Reinforcement Learning
AGV Path Planning Model based on Reinforcement Learning
收藏 引用
Chinese Automation Congress (CAC)
作者: Liao, Xiaofei Wang, Yang Xuan, Yiliang Wu, Dequan Donghua Univ Coll Informat Sci & Technol Shanghai 201620 Peoples R China Donghua Univ Engn Res Ctr Digitized Text & Fash Technol Minist Educ Shanghai 201620 Peoples R China
With the rapid growth of logistics transportation, automated guided vehicle (AGV) technologY has developed speedily. Path planning is one of the key research topics of AGV. It is difficult to plan an optimal path from... 详细信息
来源: 评论
A Texas Hold'em decision model based on Reinforcement Learning  32
A Texas Hold'em decision model based on Reinforcement Learni...
收藏 引用
32nd Chinese Control And Decision Conference (CCDC)
作者: Zhang, XiaoChuan Li, Yi Chongqing Univ Technol Dept Comp Sci Chongqing Peoples R China
Texas Hold 'em is a typical example of computer incomplete information game. The traditional machine learning method has been unable to deal with the huge search state space of Texas Hold 'em. In this paper, t... 详细信息
来源: 评论
Backward Q-learning: The combination of sarsa algorithm and Q-learning
收藏 引用
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE 2013年 第9期26卷 2184-2193页
作者: Wang, Yin-Hao Li, Tzuu-Hseng S. Lin, Chih-Jui Natl Cheng Kung Univ Dept Elect Engn AiRobots Lab Tainan 70101 Taiwan
Reinforcement learning (RI) has been applied to many fields and applications, but there are still some dilemmas between exploration and exploitation strategy for action selection policy. The well-known areas of reinfo... 详细信息
来源: 评论
Reinforcement Learning algorithms in Global Path Planning for Mobile Robot
Reinforcement Learning Algorithms in Global Path Planning fo...
收藏 引用
International Conference on Industrial Engineering, Applications and Manufacturing (ICIEAM)
作者: Sichkar, Valentyn N. ITMO Univ Dept Control Syst & Robot St Petersburg Russia
The paper is devoted to the research of two approaches for global path planning for mobile robots, based on Q-Learning and sarsa algorithms. The study has been done with different adjustments of two algorithms that ma... 详细信息
来源: 评论
A sarsa-based adaptive controller for building energy conservation
收藏 引用
JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING 2018年 第2期18卷 329-338页
作者: Fu, Qiming Hu, Lingyao Wu, Hongjie Hu, Fuyuan Hu, Wen Chen, Jianping Suzhou Univ Sci & Technol Inst Elect & Informat Engn Suzhou 215009 Jiangsu Peoples R China Suzhou Univ Sci & Technol Jiangsu Key Lab Intelligent Bldg Energy Efficienc Suzhou 215009 Jiangsu Peoples R China Suzhou Univ Sci & Technol Suzhou Key Lab Mobile Networking & Appl Technol Suzhou 215009 Jiangsu Peoples R China
In the field of building equipment control, the traditional methods have some problems - instability and slow convergence. To deal with these problems, a new sarsa-based adaptive controller, SAC (sarsa-based adaptive ... 详细信息
来源: 评论
Hybrid Robotic Reinforcement Learning for Inspection/Correction Tasks
收藏 引用
Procedia Manufacturing 2019年 39卷 406-413页
作者: Hoda Nasereddin Gerald M. Knapp Louisiana State University 3277 Patrick F. Taylor Hall Baton Rouge 70803 USA Louisiana State University 3240-A Patrick F. Taylor Hall Baton Rouge 70803 USA
The ability to rapidly program robots for complex tasks is an important precursor to wider adoption of robotics in industry. Robot programming is often time consuming and brittle to unanticipated variations in process... 详细信息
来源: 评论
Online Energy Management and Heterogeneous Task Scheduling for Smart Communities with Residential Cogeneration and Renewable Energy
收藏 引用
ENERGIES 2018年 第8期11卷 2104-2104页
作者: Cao, Yongsheng Zhang, Guanglin Li, Demin Wang, Lin Li, Zongpeng Donghua Univ Minist Educ Coll Informat Sci & Technol Engn Res Ctr Digitized Text & Fash Technol Shanghai 201620 Peoples R China Shanghai Jiao Tong Univ Dept Automat Shanghai 200240 Peoples R China Wuhan Univ Sch Comp Sci Wuhan 430072 Hubei Peoples R China
With the development of renewable energy technology and communication technology in recent years, many residents now utilize renewable energy devices in their residences with energy storage systems. We have full confi... 详细信息
来源: 评论