咨询与建议

限定检索结果

文献类型

  • 46 篇 期刊文献
  • 22 篇 会议

馆藏范围

  • 68 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 62 篇 工学
    • 30 篇 电气工程
    • 30 篇 计算机科学与技术...
    • 16 篇 控制科学与工程
    • 11 篇 信息与通信工程
    • 5 篇 软件工程
    • 4 篇 仪器科学与技术
    • 4 篇 电子科学与技术(可...
    • 4 篇 交通运输工程
    • 3 篇 机械工程
    • 3 篇 石油与天然气工程
    • 2 篇 材料科学与工程(可...
    • 2 篇 动力工程及工程热...
    • 2 篇 化学工程与技术
    • 1 篇 土木工程
    • 1 篇 水利工程
    • 1 篇 航空宇航科学与技...
    • 1 篇 环境科学与工程(可...
    • 1 篇 生物医学工程(可授...
    • 1 篇 公安技术
    • 1 篇 网络空间安全
  • 5 篇 管理学
    • 5 篇 管理科学与工程(可...
  • 2 篇 理学
    • 2 篇 数学
    • 1 篇 系统科学
  • 2 篇 医学
    • 1 篇 基础医学(可授医学...
    • 1 篇 临床医学
    • 1 篇 特种医学
  • 1 篇 艺术学
    • 1 篇 设计学(可授艺术学...

主题

  • 68 篇 reinforcement le...
  • 10 篇 learning (artifi...
  • 5 篇 reinforcement le...
  • 4 篇 multi-agent syst...
  • 3 篇 q-learning algor...
  • 3 篇 road traffic con...
  • 3 篇 computational in...
  • 3 篇 q-learning
  • 3 篇 traffic engineer...
  • 2 篇 bilateral contra...
  • 2 篇 flexible hinged ...
  • 2 篇 automated negoti...
  • 2 篇 road vehicles
  • 2 篇 partially observ...
  • 2 篇 model identifica...
  • 2 篇 power markets
  • 2 篇 decision support...
  • 2 篇 control engineer...
  • 2 篇 neural networks
  • 2 篇 fault-tolerant c...

机构

  • 2 篇 south china univ...
  • 2 篇 inesc tec porto
  • 2 篇 northeastern uni...
  • 2 篇 inner mongolia u...
  • 2 篇 northeastern uni...
  • 2 篇 bohai univ coll ...
  • 1 篇 minist educ engn...
  • 1 篇 nanyang technol ...
  • 1 篇 hubei key labora...
  • 1 篇 polytech porto i...
  • 1 篇 chang gung univ ...
  • 1 篇 univ virginia de...
  • 1 篇 southeast univ s...
  • 1 篇 college of infor...
  • 1 篇 electric power c...
  • 1 篇 univ sci & techn...
  • 1 篇 northeastern uni...
  • 1 篇 univ tecnol fed ...
  • 1 篇 simulation labor...
  • 1 篇 univ vaasa sch t...

作者

  • 2 篇 zhou jiantao
  • 2 篇 seyyedabbasi ami...
  • 2 篇 yu lei
  • 2 篇 tejer mateusz
  • 2 篇 qiu zhi-cheng
  • 1 篇 xiaolong han
  • 1 篇 hosseinian seyed...
  • 1 篇 kelkar atul
  • 1 篇 driessens kurt
  • 1 篇 biao zou
  • 1 篇 hawbani ammar
  • 1 篇 teng fang
  • 1 篇 renato duarte
  • 1 篇 zhang tianbao
  • 1 篇 tao yu
  • 1 篇 kang shengyang
  • 1 篇 chun jie
  • 1 篇 wang xingwei
  • 1 篇 zheng pengjun
  • 1 篇 gao weimin

语言

  • 66 篇 英文
  • 2 篇 其他
检索条件"主题词=Reinforcement Learning algorithm"
68 条 记 录,以下是51-60 订阅
排序:
Adaptive Group-based Signal Control Using reinforcement learning with Eligibility Traces
Adaptive Group-based Signal Control Using Reinforcement Lear...
收藏 引用
International IEEE Conference on Intelligent Transportation Systems
作者: Junchen Jin Xiaoliang Ma Div. of Transp. Planning Econ. & Eng. KTH R. Inst. of Technol. Stockholm Sweden
Group-based signal controllers are widely deployed on urban networks in the European countries. However, group-based signal controls are usually implemented with rather simple timing logics, e.g. vehicle actuated timi... 详细信息
来源: 评论
Using hybrid multiobjective machine learning to optimise sonobuoy placement patterns
收藏 引用
IET RADAR SONAR AND NAVIGATION 2023年 第3期17卷 374-387页
作者: Taylor, Christopher M. Maskell, Simon Ralph, Jason F. Univ Liverpool Dept Elect & Elect Engn Brownlow Hill Liverpool L69 3GJ Merseyside England
This paper presents a new approach to finding optimal patterns for the placement of fields of sonobuoys in a complex undersea environment. We model the problem as a biobjective one, where the aim is to minimise both s... 详细信息
来源: 评论
Optimizing QoS routing in hierarchical ATM networks using computational intelligence techniques
收藏 引用
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS 2003年 第3期33卷 297-312页
作者: Vasilakos, A Saltouros, MP Atlassis, AF Pedrycz, W FORTH Fdn Res & Technol Hellas Inst Comp Sci Iraklion 15410 Greece Natl Tech Univ Athens Dept Elect & Comp Engn GR-15773 Athens Greece Univ Alberta Dept Elect & Comp Engn Edmonton AB T6G 2G7 Canada
In this paper, the use of a computational intelligence approach -a reinforcement learning algorithm (RLA)-for optimizing the routing in asynchronous transfer mode (ATM) networks based on the private network-to-network... 详细信息
来源: 评论
Research on coordinated scheduling of straddle carriers and quay cranes in automated container terminals based on reinforcement learning
Research on coordinated scheduling of straddle carriers and ...
收藏 引用
作者: Zhenyu Fang Xiaolong Han Shanghai Maritime University Institute of Logistics Science & Engineering
Aiming at the coordination and scheduling problem between Automated Straddle Carrier(Automated Straddle Carrier) and Quay Crane(QC) in automated container terminals,consider that the quay crane cannot cross the stradd... 详细信息
来源: 评论
Data-driven optimal control of operational indices for a class of industrial processes
收藏 引用
IET CONTROL THEORY AND APPLICATIONS 2016年 第12期10卷 1348-1356页
作者: Lu, Xinglong Kiumarsi, Bahare Chai, Tianyou Lewis, Frank L. Northeastern Univ State Key Lab Synthet Automat Proc Ind Shenyang 110819 Peoples R China Univ Texas Arlington UTA Res Inst Ft Worth TX 76118 USA
In this study, a data-driven optimisation solution for operational index control for a class of industrial processes is presented. First, the operational index control problem is formulated as an optimal tracking cont... 详细信息
来源: 评论
Online two-timescale service placement for time-sensitive applications in MEC-assisted network: A TMAGRL approach
收藏 引用
COMPUTER NETWORKS 2024年 244卷
作者: Du, An Jia, Jie Chen, Jian Guo, Liang Wang, Xingwei Northeastern Univ Sch Comp Sci & Engn Shenyang 110819 Liaoning Peoples R China Minist Educ Engn Res Ctr Secur Technol Complex Network Syst Shenyang 110819 Liaoning Peoples R China Northeastern Univ Key Lab Intelligent Comp Med Image Minist Educ Shenyang 110819 Liaoning Peoples R China
Mobile edge computing (MEC) integrated with the Network Functions Virtualization (NFV) technique has been regarded as a promising solution for flexible services provision and user service experience improvement. Howev... 详细信息
来源: 评论
Using a Collaborative Robot to the Upper Limb Rehabilitation  4th
Using a Collaborative Robot to the Upper Limb Rehabilitation
收藏 引用
4th Iberian Robotics Conference (Robot) - Advances in Robotics
作者: Fernandes, Lucas de Azevedo Lima, Jose Luis Leitao, Paulo Nakano, Alberto Yoshiro Univ Tecnol Fed Parana Curitiba Parana Brazil Polytech Inst Braganca CeDRI Res Ctr Digitalizat & Intelligent Robot Porto Portugal INESC TEC Porto Portugal
Rehabilitation is a relevant process for the recovery from dysfunctions and improves the realization of patient's Activities of Daily Living (ADLs). Robotic systems are considered an important field within the dev... 详细信息
来源: 评论
Multiagent-based Market Simulator for the Wholesale Electricity Spot Market
Multiagent-based Market Simulator for the Wholesale Electric...
收藏 引用
IEEE Region 10 Conference (TENCON) - Sustainable Development through Humanitarian Technology
作者: Pacaba, Dominic Dave P. Nerves, Allan C. Univ Philippines Diliman Elect & Elect Engn Inst Quezon City Philippines
As electricity markets develop into more complex structures, new modeling and simulation techniques are required to simulate the market and to identify strategic behavior that can profitably influence electricity pric... 详细信息
来源: 评论
Simultaneous learning of Spatial Visual Attention and Physical Actions
Simultaneous Learning of Spatial Visual Attention and Physic...
收藏 引用
IEEE/RSJ International Conference on Intelligent Robots and Systems
作者: Borji, Ali Ahmadabadi, Majid Nili Araabi, Babak Nadjar Univ So Calif Dept Comp Sci Hedco Neurosci BldgRoom 93641 Watt Way Los Angeles CA 90089 USA Univ Tehran Sch Elect & Comp Engn Sch Cognit Sci Tehran Iran
This paper introduces a new method for learning top-down and task-driven visual attention control along with physical actions in interactive environments. Our method is based on the reinforcement learning of Visual Cl... 详细信息
来源: 评论
Using the Online Cross-Entropy Method to Learn Relational Policies for Playing Different Games
Using the Online Cross-Entropy Method to Learn Relational Po...
收藏 引用
IEEE Conference on Computational Intelligence and Games (CIG)
作者: Sarjant, Samuel Pfahringer, Bernhard Driessens, Kurt Smith, Tony Univ Waikato Fac Comp & Math Sci Dunedin New Zealand Maastricht Univ Dept Knowledge Engn NL-6200 MD Maastricht Netherlands
By defining a video-game environment as a collection of objects, relations, actions and rewards, the relational reinforcement learning algorithm presented in this paper generates and optimises a set of concise, human-... 详细信息
来源: 评论