咨询与建议

限定检索结果

文献类型

  • 47 篇 期刊文献
  • 22 篇 会议

馆藏范围

  • 69 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 63 篇 工学
    • 31 篇 电气工程
    • 30 篇 计算机科学与技术...
    • 17 篇 控制科学与工程
    • 11 篇 信息与通信工程
    • 5 篇 仪器科学与技术
    • 5 篇 软件工程
    • 4 篇 电子科学与技术(可...
    • 4 篇 交通运输工程
    • 3 篇 机械工程
    • 3 篇 石油与天然气工程
    • 2 篇 材料科学与工程(可...
    • 2 篇 动力工程及工程热...
    • 2 篇 化学工程与技术
    • 1 篇 土木工程
    • 1 篇 水利工程
    • 1 篇 航空宇航科学与技...
    • 1 篇 环境科学与工程(可...
    • 1 篇 生物医学工程(可授...
    • 1 篇 公安技术
    • 1 篇 网络空间安全
  • 5 篇 管理学
    • 5 篇 管理科学与工程(可...
  • 2 篇 理学
    • 2 篇 数学
    • 1 篇 系统科学
  • 2 篇 医学
    • 1 篇 基础医学(可授医学...
    • 1 篇 临床医学
    • 1 篇 特种医学
  • 1 篇 艺术学
    • 1 篇 设计学(可授艺术学...

主题

  • 69 篇 reinforcement le...
  • 11 篇 learning (artifi...
  • 5 篇 reinforcement le...
  • 4 篇 multi-agent syst...
  • 3 篇 q-learning algor...
  • 3 篇 road traffic con...
  • 3 篇 computational in...
  • 3 篇 optimisation
  • 3 篇 q-learning
  • 3 篇 adaptive control
  • 3 篇 traffic engineer...
  • 2 篇 bilateral contra...
  • 2 篇 flexible hinged ...
  • 2 篇 automated negoti...
  • 2 篇 road vehicles
  • 2 篇 partially observ...
  • 2 篇 model identifica...
  • 2 篇 power markets
  • 2 篇 decision support...
  • 2 篇 control engineer...

机构

  • 2 篇 south china univ...
  • 2 篇 inesc tec porto
  • 2 篇 northeastern uni...
  • 2 篇 inner mongolia u...
  • 2 篇 northeastern uni...
  • 2 篇 bohai univ coll ...
  • 1 篇 univ sci & techn...
  • 1 篇 minist educ engn...
  • 1 篇 nanyang technol ...
  • 1 篇 hubei key labora...
  • 1 篇 polytech porto i...
  • 1 篇 chang gung univ ...
  • 1 篇 univ virginia de...
  • 1 篇 southeast univ s...
  • 1 篇 college of infor...
  • 1 篇 electric power c...
  • 1 篇 univ sci & techn...
  • 1 篇 northeastern uni...
  • 1 篇 univ tecnol fed ...
  • 1 篇 simulation labor...

作者

  • 2 篇 zhou jiantao
  • 2 篇 seyyedabbasi ami...
  • 2 篇 yu lei
  • 2 篇 tejer mateusz
  • 2 篇 qiu zhi-cheng
  • 1 篇 xiaolong han
  • 1 篇 hosseinian seyed...
  • 1 篇 kelkar atul
  • 1 篇 driessens kurt
  • 1 篇 biao zou
  • 1 篇 hawbani ammar
  • 1 篇 teng fang
  • 1 篇 renato duarte
  • 1 篇 zhang tianbao
  • 1 篇 tao yu
  • 1 篇 kang shengyang
  • 1 篇 chun jie
  • 1 篇 wang xingwei
  • 1 篇 zheng pengjun
  • 1 篇 gao weimin

语言

  • 67 篇 英文
  • 2 篇 其他
检索条件"主题词=Reinforcement learning algorithm"
69 条 记 录,以下是41-50 订阅
排序:
6Search: A reinforcement learning-based traceroute approach for efficient IPv6 topology discovery
收藏 引用
COMPUTER NETWORKS 2023年 第1期235卷
作者: Liu, Ning Jia, Chunbo Hou, Bingnan Hou, Changsheng Chen, Yingwen Cai, Zhiping Natl Univ Def Technol Coll Comp Changsha 410073 Hunan Peoples R China
Topology discovery can infer the interconnection relationship between network entities. A complete network topology is of great significance for network security analysis, application research, etc. However, due to th... 详细信息
来源: 评论
Automatic voltage control considering demand response: Approximatively completed observed Markov decision process-based reinforcement learning scheme
收藏 引用
INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS 2024年 161卷
作者: Gu, Yaru Huang, Xueliang Southeast Univ Sch Elect Engn Nanjing Peoples R China
To fully utilize the voltage regulation capacity of flexible load and distributed generations (DGs), we propose a novel Approximatively Completed Observed Markov Decision Process-based (ACOMDP-based) reinforcement Lea... 详细信息
来源: 评论
Burden Control Strategy Based on reinforcement learning for Gas Utilization Rate in Blast Furnace ⁎
收藏 引用
IFAC-PapersOnLine 2020年 第2期53卷 11704-11709页
作者: Xiaoling Shen Jianqi An Min Wu Jinhua She School of Automation China University of Geosciences Wuhan 430074 China Hubei Key Laboratory of Advanced Control and Intelligent Automation for Complex Systems Wuhan 430074 China School of Engineering Tokyo University of Technology HachiojiTokyo 192-0982 Japan
Gas utilization rate (GUR) is an important state parameter to reflect the energy consumption, the quality and production of the pig iron, and the distribution of the gas flow in a blast furnace. The GUR is mainly adju... 详细信息
来源: 评论
Two-level decision-making model for a distribution company in day-ahead market
收藏 引用
IET GENERATION TRANSMISSION & DISTRIBUTION 2015年 第12期9卷 1308-1315页
作者: Khazaei, Hossein Vahidi, Behrooz Hosseinian, Seyed Hossein Rastegar, Hasan Amirkabir Univ Technol Dept Elect Engn Tehran *** Iran
This study presents a two-level decision-making (TLDM) model for a distribution company (Disco) in the day-ahead market (DAM), where Disco has two additional resources, interruptible load (IL) and distribution generat... 详细信息
来源: 评论
learning to construct a solution for UAV path planning problem with positioning error correction
收藏 引用
KNOWLEDGE-BASED SYSTEMS 2024年 304卷
作者: Chun, Jie Chen, Ming Liu, Xiaolu Xiang, Shang Du, Yonghao Wu, Guohua Xing, Lining Natl Univ Def Technol Coll Syst Engn Changsha 410073 Hunan Peoples R China XiangTan Univ Sch Publ Adm Xiangtan 411100 Hunan Peoples R China Cent South Univ Sch Automat Changsha 410075 Hunan Peoples R China Xidian Univ Coll Elect Engn Xian 710126 Shanxi Peoples R China
Unmanned aerial vehicles (UAVs) are advanced flight systems. However, their positioning systems cause distance-dependent errors during flight. This study seeks to solve the UAV path planning problem with positioning e... 详细信息
来源: 评论
Optimized adaptive event-triggered tracking control for multi-agent systems with full-state constraints
收藏 引用
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL 2022年 第18期32卷 10101-10124页
作者: Yang, Xiaoyu Pan, Yingnan Sun, Jize Tan, Lihua Bohai Univ Coll Control Sci & Engn Jinzhou 121013 Liaoning Peoples R China Shenyang Aircraft Design & Res Inst Shenyang Liaoning Peoples R China Southwest Univ Coll Elect & Informat Engn Chongqing Peoples R China
In this article, the event-triggered optimized adaptive tracking control problem is investigated for a class of multi-agent systems subject to full-state constraints. To address the full-state constraints problem, a n... 详细信息
来源: 评论
Adaptive integral sliding mode control fault tolerant control for a class of uncertain nonlinear systems
收藏 引用
IET CONTROL THEORY AND APPLICATIONS 2018年 第13期12卷 1864-1872页
作者: Li, Yuan-Xin Yang, Guang-Hong Northeastern Univ Coll Informat Sci & Engn Shenyang 110819 Liaoning Peoples R China Liaoning Univ Technol Coll Math Jinzhou 121001 Peoples R China Northeastern Univ State Key Lab Synthet Automat Proc Ind Shenyang 110819 Liaoning Peoples R China
This study considers the problem of adaptive sliding mode control for a class of uncertain non-linear systems with actuator faults and external disturbances. First, a novel reinforcement learning algorithm is first in... 详细信息
来源: 评论
Context aware Q-learning-based model for decision support in the negotiation of energy contracts
收藏 引用
INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS 2019年 104卷 489-501页
作者: Rodriguez-Fernandez, J. Pinto, T. Silva, F. Praca, I Vale, Z. Corchado, J. M. Polytech Porto ISEP IPP GECAD Res Grp Porto Portugal Univ Salamanca BISITE Res Grp Salamanca Spain Polytech Porto IPP Porto Portugal Osaka Inst Technol Osaka Japan
Automated negotiation plays a crucial role in the decision support for bilateral energy transactions. In fact, an adequate analysis of past actions of opposing negotiators can improve the decision-making process of ma... 详细信息
来源: 评论
Adaptive optimisation of timeout policy for dynamic power management based on semi-Markov control processes
收藏 引用
IET CONTROL THEORY AND APPLICATIONS 2010年 第10期4卷 1945-1958页
作者: Jiang, Q. Xi, H. -S. Yin, B. -Q. Hefei Univ Technol Sch Elect Engn & Automat Hefei 230009 Peoples R China Univ Sci & Technol China Dept Automat Hefei 230027 Peoples R China
Timeout policy is an industry standard for dynamic power management (DPM), and thus is easy and safe to implement in many power-managed systems. The optimisation of timeout policy suffered from the lack of effective a... 详细信息
来源: 评论
Design of a Digital Exhibition Service System Under the Deep Belief Network Models
收藏 引用
IEEE ACCESS 2024年 12卷 108786-108796页
作者: Song, Qixin Tourism Coll Changchun Univ Sch Tourism & Culture Changchun 130607 Peoples R China Jilin Prov Res Ctr Cultural Tourism Educ & Enterp Changchun 130607 Peoples R China Northeast Asia Res Ctr Leisure Econ Changchun 130607 Peoples R China
This work aims to optimize the classification efficiency of the digital exhibition service system and achieve optimization of booth layout and visitor route planning. This work combines the Deep Belief Network (DBN) m... 详细信息
来源: 评论