咨询与建议

限定检索结果

文献类型

  • 15 篇 会议
  • 10 篇 期刊文献

馆藏范围

  • 25 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 24 篇 工学
    • 10 篇 电气工程
    • 10 篇 计算机科学与技术...
    • 9 篇 控制科学与工程
    • 4 篇 机械工程
    • 4 篇 仪器科学与技术
    • 3 篇 信息与通信工程
    • 3 篇 石油与天然气工程
    • 3 篇 软件工程
    • 2 篇 动力工程及工程热...
    • 2 篇 电子科学与技术(可...
    • 2 篇 交通运输工程
    • 2 篇 生物医学工程(可授...
    • 1 篇 安全科学与工程
    • 1 篇 公安技术
    • 1 篇 网络空间安全
  • 6 篇 理学
    • 3 篇 数学
    • 2 篇 化学
    • 2 篇 统计学(可授理学、...
    • 1 篇 系统科学
  • 4 篇 管理学
    • 4 篇 管理科学与工程(可...
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 1 篇 医学
    • 1 篇 基础医学(可授医学...

主题

  • 25 篇 sarsa algorithm
  • 16 篇 reinforcement le...
  • 3 篇 path planning
  • 3 篇 q-learning
  • 2 篇 function approxi...
  • 2 篇 dynamic role ass...
  • 2 篇 markov decision ...
  • 2 篇 robocup
  • 1 篇 self-learning al...
  • 1 篇 q-learning algor...
  • 1 篇 genetic algorith...
  • 1 篇 state discretiza...
  • 1 篇 adaptive control...
  • 1 篇 traffic signal
  • 1 篇 maximal control ...
  • 1 篇 diabetes
  • 1 篇 power grids
  • 1 篇 inspect
  • 1 篇 grid integration
  • 1 篇 intelligent body...

机构

  • 2 篇 suzhou univ sci ...
  • 2 篇 college of autom...
  • 2 篇 suzhou univ sci ...
  • 2 篇 suzhou univ sci ...
  • 1 篇 guangxi univ tec...
  • 1 篇 chongqing univ t...
  • 1 篇 donghua univ eng...
  • 1 篇 soochow univ sch...
  • 1 篇 vellore institut...
  • 1 篇 vellore institut...
  • 1 篇 key laboratory o...
  • 1 篇 wuhan univ sch c...
  • 1 篇 pes university b...
  • 1 篇 key laboratory o...
  • 1 篇 college of compu...
  • 1 篇 shanghai jiao to...
  • 1 篇 swiss fed inst t...
  • 1 篇 jiangsu univ ins...
  • 1 篇 key lab smart en...
  • 1 篇 xinjiang univ sc...

作者

  • 2 篇 fu qiming
  • 2 篇 chen jianping
  • 2 篇 hu lingyao
  • 2 篇 hu wen
  • 2 篇 yang yongyi
  • 2 篇 cui xuanyu
  • 2 篇 liu haoran
  • 2 篇 liang zhiwei
  • 2 篇 wang jiawen
  • 1 篇 huang chen
  • 1 篇 jia liruizhi
  • 1 篇 fan huahao
  • 1 篇 hoda nasereddin
  • 1 篇 renjith p.n.
  • 1 篇 fan kai
  • 1 篇 lin wei
  • 1 篇 kang han
  • 1 篇 shanta rangaswam...
  • 1 篇 zhang yuxin
  • 1 篇 kai fan

语言

  • 25 篇 英文
检索条件"主题词=Sarsa algorithm"
25 条 记 录,以下是1-10 订阅
排序:
Backward Q-learning: The combination of sarsa algorithm and Q-learning
收藏 引用
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE 2013年 第9期26卷 2184-2193页
作者: Wang, Yin-Hao Li, Tzuu-Hseng S. Lin, Chih-Jui Natl Cheng Kung Univ Dept Elect Engn AiRobots Lab Tainan 70101 Taiwan
Reinforcement learning (RI) has been applied to many fields and applications, but there are still some dilemmas between exploration and exploitation strategy for action selection policy. The well-known areas of reinfo... 详细信息
来源: 评论
Residual sarsa algorithm with function approximation
收藏 引用
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS 2019年 第1-Sup期22卷 795-807页
作者: Fu Qiming Hu Wen Liu Quan Luo Heng Hu Lingyao Chen Jianping Suzhou Univ Sci & Technol Inst Elect & Informat Engn Suzhou 215009 Peoples R China Suzhou Univ Sci & Technol Jiangsu Key Lab Intelligent Bldg Energy Efficienc Suzhou 215009 Jiangsu Peoples R China Suzhou Univ Sci & Technol Suzhou Key Lab Mobile Networking & Appl Technol Suzhou 215009 Jiangsu Peoples R China Soochow Univ Sch Comp Sci & Technol Suzhou 215000 Jiangsu Peoples R China Jilin Univ Minist Educ Key Lab Symbol Computat & Knowledge Engn Changchun 130012 Jilin Peoples R China Collaborat Innovat Ctr Novel Software Technol & I Nanjing 210000 Jiangsu Peoples R China
In this work, we proposed an efficient algorithm named the residual sarsa algorithm with function approximation (FARS) to improve the performance of the traditional sarsa algorithm, and we use the gradient-descent met... 详细信息
来源: 评论
Urban Traffic Signal Learning Control Using sarsa algorithm Based on Adaptive RBF Network
Urban Traffic Signal Learning Control Using SARSA Algorithm ...
收藏 引用
International Conference on Measuring Technology and Mechatronics Automation
作者: Li Chun-gui Wang Meng Yang Shu-Hong Zhang Zeng-fang Guangxi Univ Technol Dept Comp Engn Liuzhou Peoples R China
Urban traffic control is very complicated, so to build a precise mathematical model for it is very difficult. In this paper, we use the sarsa reinforcement leaning algorithm to control the traffic signal, thus the dec... 详细信息
来源: 评论
Research for UAV Path Planning Method Based on Guided sarsa algorithm  2
Research for UAV Path Planning Method Based on Guided Sarsa ...
收藏 引用
2nd IEEE International Conference on Software Engineering and Artificial Intelligence (SEAI) / 7th International Workshop on Pattern Recognition (IWPR)
作者: He Boming Lin Wei Mei Fuzeng Fan Huahao Wuhan Univ Technol Wuhan Peoples R China
As unmanned aerial vehicles (UAV) are widely used in unknown and complex environments, their path planning capabilities face higher requirements. In many cases, UAV cannot obtain the environmental information of the t... 详细信息
来源: 评论
Real-Time Network Optimization in 6G minimizing End-to-End Delay Using sarsa algorithm
Real-Time Network Optimization in 6G minimizing End-to-End D...
收藏 引用
2024 IEEE International Students' Conference on Electrical, Electronics and Computer Science, SCEECS 2024
作者: Chauhan, Tania Renjith, P.N. Vellore Institute of Technology Chennai India Vellore Institute of Technology School of Computer Science and Engineering Chennai India
As 6G networks take off, there is an increasing need for effective system optimization strategies that can decrease end-to-end (e2e) delay and improve the general efficiency of the network. The sarsa (State-Action-Rew... 详细信息
来源: 评论
Time-Sensitive Network Simulation for In-Vehicle Ethernet Using sarsa algorithm
收藏 引用
WORLD ELECTRIC VEHICLE JOURNAL 2024年 第1期15卷 21页
作者: Huang, Chen Wang, Yiqi Zhang, Yuxin Jilin Univ Coll Automot Engn Changchun 130012 Peoples R China Jiangsu Univ Inst Automot Engn Zhenjiang 212013 Peoples R China
In order to more accurately analyze the problem of time delay simulation and calculation in the time-sensitive network (TSN) of vehicular Ethernet, a TSN reservation class data delay analysis model improved based on t... 详细信息
来源: 评论
An Online Reinforcement Learning-Based Energy Management Strategy for Microgrids With Centralized Control
收藏 引用
IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS 2025年 第1期61卷 1501-1510页
作者: Meng, Qinglin Hussain, Sheharyar Luo, Fengzhang Wang, Zhongguan Jin, Xiaolong State Grid Tianjin Elect Power Co Comprehens Serv Ctr Tianjin 300010 Peoples R China Tianjin Univ Sch Elect & Informat Engn Tianjin 300072 Peoples R China Tiangong Univ Sch Elect Engn Tianjin 300387 Peoples R China Minist Educ Key Lab Smart Grid Tianjin 300072 Peoples R China Key Lab Smart Energy & Informat Technol Tianjin Mu Tianjin 300072 Peoples R China Zhejiang Univ Inst Marine Elect & Intelligent Syst Ocean Coll Zhoushan 316000 Peoples R China
To address the issue of significant unpredictability and intermittent nature of renewable energy sources, particularly wind and solar power, this paper introduces a novel optimization model based on online reinforceme... 详细信息
来源: 评论
Peer-to-peer Electricity Transaction Decisions of the User-side Smart Energy System Based on the sarsa Reinforcement Learning
收藏 引用
CSEE Journal of Power and Energy Systems 2022年 第3期8卷 826-837页
作者: Dan Wang Bo Liu Hongjie Jia Ziyang Zhang Jingcheng Chen Deyu Huang Key Laboratory of Smart Grid of Ministry of Education Tianjin UniversityTianjin 300072China Key Laboratory of Smart Energy&Information Technology of Tianjin Municipality Tianjin 30072China State Grid Jiangsu Power Co.Ltd. Zhenjiang Power Supply CompanyZhenjiang 212000Jiangsu ProvinceChina State Grid Tianjin Electric Power Company Hebei DistrictTianjin 300010China
With the deep integration of advanced information technologies,such as artificial intelligence and traditional energy technologies,smart energy systems have been proposed as a method to provide the best solution for t... 详细信息
来源: 评论
Optimal Decision-Making Method for a Plug-In Electric Taxi in Uncertain Environment
收藏 引用
IEEE ACCESS 2021年 9卷 62467-62477页
作者: You, Yang Zhu, Jisong Huang, Yichuan Jing, Zhaoxia South China Univ Technol Sch Elect Power Engn Guangzhou 510640 Peoples R China
This paper studies the optimal decision-making problem for a plug-in electric taxi (PET) in a time-varying complex environment, i.e., a passenger environment, charging station environment, traffic environment, and tax... 详细信息
来源: 评论
A sarsa-based adaptive controller for building energy conservation
收藏 引用
JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING 2018年 第2期18卷 329-338页
作者: Fu, Qiming Hu, Lingyao Wu, Hongjie Hu, Fuyuan Hu, Wen Chen, Jianping Suzhou Univ Sci & Technol Inst Elect & Informat Engn Suzhou 215009 Jiangsu Peoples R China Suzhou Univ Sci & Technol Jiangsu Key Lab Intelligent Bldg Energy Efficienc Suzhou 215009 Jiangsu Peoples R China Suzhou Univ Sci & Technol Suzhou Key Lab Mobile Networking & Appl Technol Suzhou 215009 Jiangsu Peoples R China
In the field of building equipment control, the traditional methods have some problems - instability and slow convergence. To deal with these problems, a new sarsa-based adaptive controller, SAC (sarsa-based adaptive ... 详细信息
来源: 评论