咨询与建议

限定检索结果

文献类型

  • 15 篇 会议
  • 10 篇 期刊文献

馆藏范围

  • 25 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 24 篇 工学
    • 10 篇 电气工程
    • 10 篇 计算机科学与技术...
    • 9 篇 控制科学与工程
    • 4 篇 机械工程
    • 4 篇 仪器科学与技术
    • 3 篇 信息与通信工程
    • 3 篇 石油与天然气工程
    • 3 篇 软件工程
    • 2 篇 动力工程及工程热...
    • 2 篇 电子科学与技术(可...
    • 2 篇 交通运输工程
    • 2 篇 生物医学工程(可授...
    • 1 篇 安全科学与工程
    • 1 篇 公安技术
    • 1 篇 网络空间安全
  • 6 篇 理学
    • 3 篇 数学
    • 2 篇 化学
    • 2 篇 统计学(可授理学、...
    • 1 篇 系统科学
  • 4 篇 管理学
    • 4 篇 管理科学与工程(可...
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 1 篇 医学
    • 1 篇 基础医学(可授医学...

主题

  • 25 篇 sarsa algorithm
  • 16 篇 reinforcement le...
  • 3 篇 path planning
  • 3 篇 q-learning
  • 2 篇 function approxi...
  • 2 篇 dynamic role ass...
  • 2 篇 markov decision ...
  • 2 篇 robocup
  • 1 篇 self-learning al...
  • 1 篇 q-learning algor...
  • 1 篇 genetic algorith...
  • 1 篇 state discretiza...
  • 1 篇 adaptive control...
  • 1 篇 traffic signal
  • 1 篇 maximal control ...
  • 1 篇 diabetes
  • 1 篇 power grids
  • 1 篇 inspect
  • 1 篇 grid integration
  • 1 篇 intelligent body...

机构

  • 2 篇 suzhou univ sci ...
  • 2 篇 college of autom...
  • 2 篇 suzhou univ sci ...
  • 2 篇 suzhou univ sci ...
  • 1 篇 guangxi univ tec...
  • 1 篇 chongqing univ t...
  • 1 篇 donghua univ eng...
  • 1 篇 soochow univ sch...
  • 1 篇 vellore institut...
  • 1 篇 vellore institut...
  • 1 篇 key laboratory o...
  • 1 篇 wuhan univ sch c...
  • 1 篇 pes university b...
  • 1 篇 key laboratory o...
  • 1 篇 college of compu...
  • 1 篇 shanghai jiao to...
  • 1 篇 swiss fed inst t...
  • 1 篇 jiangsu univ ins...
  • 1 篇 key lab smart en...
  • 1 篇 xinjiang univ sc...

作者

  • 2 篇 fu qiming
  • 2 篇 chen jianping
  • 2 篇 hu lingyao
  • 2 篇 hu wen
  • 2 篇 yang yongyi
  • 2 篇 cui xuanyu
  • 2 篇 liu haoran
  • 2 篇 liang zhiwei
  • 2 篇 wang jiawen
  • 1 篇 huang chen
  • 1 篇 jia liruizhi
  • 1 篇 fan huahao
  • 1 篇 hoda nasereddin
  • 1 篇 renjith p.n.
  • 1 篇 fan kai
  • 1 篇 lin wei
  • 1 篇 kang han
  • 1 篇 shanta rangaswam...
  • 1 篇 zhang yuxin
  • 1 篇 kai fan

语言

  • 25 篇 英文
检索条件"主题词=SARSA algorithm"
25 条 记 录,以下是21-30 订阅
排序:
Glucose Level Control Using Temporal Difference Methods  25
Glucose Level Control Using Temporal Difference Methods
收藏 引用
25th Iranian Conference on Electrical Engineering (ICEE)
作者: Noori, Amin Sadrnia, Mohammad Ali Sistani, Mohammad Bagher Naghibi Shahrood Univ Technol Control Engn Shahrood Iran Ferdowsi Univ Mashhad Control Engn Mashhad Iran
Control theory has been widely used in various fields;one of these areas is medical issues. Diabetes is one of the new topics of interest in control. Obtaining the rates for the injection of insulin automatically alwa... 详细信息
来源: 评论
Multi-robot collaboration based on Markov decision process in Robocup3D soccer simulation game  27
Multi-robot collaboration based on Markov decision process i...
收藏 引用
27th Chinese Control and Decision Conference, CCDC 2015
作者: Cui, Xuanyu Liang, Zhiwei Yang, Yongyi Ping, Shen Wang, Jiawen Liu, Haoran Kai, Fan College of Automation Nanjing University of Posts and Telecommunications Nanjing China
Close collaboration and desired strategy is indispensable for humanoid robots in the RoboCup soccer competition. In order to solve the problem that the convergence rate is too low in training local strategies, this pa... 详细信息
来源: 评论
Multi-robot Collaboration Based on Markov Decision Process in Robocup3D Soccer Simulation Game
Multi-robot Collaboration Based on Markov Decision Process i...
收藏 引用
第27届中国控制与决策会议
作者: Cui Xuanyu Liang Zhiwei Yang Yongyi Shen Ping Wang Jiawen Liu Haoran Fan Kai College of Automation Nanjing University of Posts and Telecommunications
Close collaboration and desired strategy is indispensable for humanoid robots in the RoboCup soccer *** order to solve the problem that the convergence rate is too low in training local strategies,this paper mainly pr... 详细信息
来源: 评论
Urban Traffic Signal Learning Control Using sarsa algorithm Based on Adaptive RBF Network
Urban Traffic Signal Learning Control Using SARSA Algorithm ...
收藏 引用
International Conference on Measuring Technology and Mechatronics Automation
作者: Li Chun-gui Wang Meng Yang Shu-Hong Zhang Zeng-fang Guangxi Univ Technol Dept Comp Engn Liuzhou Peoples R China
Urban traffic control is very complicated, so to build a precise mathematical model for it is very difficult. In this paper, we use the sarsa reinforcement leaning algorithm to control the traffic signal, thus the dec... 详细信息
来源: 评论
Blackjack as a test bed for learning strategies in neural networks
Blackjack as a test bed for learning strategies in neural ne...
收藏 引用
2nd IEEE World Congress on Computational Intelligence (WCCI 98)
作者: Perez-Uribe, A Sanchez, E Swiss Fed Inst Technol Dept Comp Sci Log Syst Lab CH-1015 Lausanne Switzerland
Blackjack or twenty-one is a card game where the player attempts to beat the dealer, by obtaining a sum of card values that is equal to or less than 21 so that his total is higher than the dealer's. The probabilis... 详细信息
来源: 评论