咨询与建议

限定检索结果

文献类型

  • 50 篇 期刊文献
  • 22 篇 会议

馆藏范围

  • 72 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 64 篇 工学
    • 32 篇 电气工程
    • 30 篇 计算机科学与技术...
    • 17 篇 控制科学与工程
    • 11 篇 信息与通信工程
    • 5 篇 仪器科学与技术
    • 5 篇 软件工程
    • 4 篇 电子科学与技术(可...
    • 4 篇 交通运输工程
    • 3 篇 机械工程
    • 3 篇 石油与天然气工程
    • 2 篇 材料科学与工程(可...
    • 2 篇 化学工程与技术
    • 1 篇 动力工程及工程热...
    • 1 篇 土木工程
    • 1 篇 水利工程
    • 1 篇 航空宇航科学与技...
    • 1 篇 环境科学与工程(可...
    • 1 篇 生物医学工程(可授...
    • 1 篇 公安技术
    • 1 篇 网络空间安全
  • 5 篇 管理学
    • 5 篇 管理科学与工程(可...
  • 2 篇 理学
    • 2 篇 数学
    • 1 篇 系统科学
  • 2 篇 医学
    • 1 篇 基础医学(可授医学...
    • 1 篇 临床医学
    • 1 篇 特种医学
  • 1 篇 艺术学
    • 1 篇 设计学(可授艺术学...

主题

  • 72 篇 reinforcement le...
  • 11 篇 learning (artifi...
  • 6 篇 reinforcement le...
  • 4 篇 multi-agent syst...
  • 3 篇 q-learning algor...
  • 3 篇 road traffic con...
  • 3 篇 computational in...
  • 3 篇 optimisation
  • 3 篇 q-learning
  • 3 篇 power system man...
  • 3 篇 decision making
  • 3 篇 adaptive control
  • 3 篇 optimisation tec...
  • 3 篇 traffic engineer...
  • 2 篇 bilateral contra...
  • 2 篇 flexible hinged ...
  • 2 篇 automated negoti...
  • 2 篇 road vehicles
  • 2 篇 partially observ...
  • 2 篇 model identifica...

机构

  • 2 篇 south china univ...
  • 2 篇 inesc tec porto
  • 2 篇 northeastern uni...
  • 2 篇 inner mongolia u...
  • 2 篇 northeastern uni...
  • 2 篇 bohai univ coll ...
  • 1 篇 univ sci & techn...
  • 1 篇 minist educ engn...
  • 1 篇 nanyang technol ...
  • 1 篇 hubei key labora...
  • 1 篇 polytech porto i...
  • 1 篇 chang gung univ ...
  • 1 篇 univ virginia de...
  • 1 篇 southeast univ s...
  • 1 篇 college of infor...
  • 1 篇 electric power c...
  • 1 篇 univ sci & techn...
  • 1 篇 northeastern uni...
  • 1 篇 univ tecnol fed ...
  • 1 篇 simulation labor...

作者

  • 2 篇 zhou jiantao
  • 2 篇 seyyedabbasi ami...
  • 2 篇 yu lei
  • 2 篇 tejer mateusz
  • 2 篇 qiu zhi-cheng
  • 1 篇 xiaolong han
  • 1 篇 alami reda
  • 1 篇 hosseinian seyed...
  • 1 篇 kelkar atul
  • 1 篇 driessens kurt
  • 1 篇 biao zou
  • 1 篇 hawbani ammar
  • 1 篇 teng fang
  • 1 篇 renato duarte
  • 1 篇 zhang tianbao
  • 1 篇 tao yu
  • 1 篇 kang shengyang
  • 1 篇 chun jie
  • 1 篇 wang xingwei
  • 1 篇 zheng pengjun

语言

  • 70 篇 英文
  • 2 篇 其他
检索条件"主题词=reinforcement learning algorithm"
72 条 记 录,以下是11-20 订阅
排序:
reinforcement learning algorithm for One-warehouse Multi-retailer Inventory Problem
Reinforcement Learning Algorithm for One-warehouse Multi-ret...
收藏 引用
2015 International Conference on Automation,Mechanical and Electrical Engineering(AMEE 2015)
作者: C.Y.Li X.T.Wang T.W.Zhang School of Computer Science and Technology Harbin University of Science and Technology School of Computer Science and Technology Harbin Institute of Technology
We considered a supply chain inventory scheduling problem in which a central warehouse serves n-retailers under. Mathematical model was developed to obtain the optimal revenue for the proposed policy and the objective... 详细信息
来源: 评论
A Memory-based reinforcement learning algorithm for Partially Observable Markovian Decision Processes
A Memory-based Reinforcement Learning Algorithm for Partiall...
收藏 引用
International Joint Conference on Neural Networks
作者: Zheng, Lei Cho, Siu-Yeung Quek, Chai Nanyang Technol Univ Sch Comp Engn Singapore Singapore
This paper presents a modified version of U-Tree [1], a memory-based reinforcement learning (RL) algorithm that uses selective perception and short-term memory to handle partially observable Markovian decision process... 详细信息
来源: 评论
Design of Human Resources Management Decision System Based on Multi-Agent System and reinforcement learning algorithm  24
Design of Human Resources Management Decision System Based o...
收藏 引用
3rd International Conference on Cyber Security, Artificial Intelligence and Digital Economy (CSAIDE)
作者: Yao, Wenyan Zhang, Tianbao Nanning Univ Nanning 530200 Guangxi Peoples R China Univ Sains Malaysia Sch Management George Town Malaysia Qinzhou Tobacco Monopoly Bur Qinzhou 535000 Guangxi Peoples R China
This study aims to address the lack of scientific and systematic decision systems in the field of Human Resources Management (HRM). By designing a HRM decision support system based on Multi-Agent systems and reinforce... 详细信息
来源: 评论
Research on Computer Aided learning System Based On reinforcement learning algorithm
收藏 引用
Procedia Computer Science 2024年 243卷 472-481页
作者: Haiyan Lu School of Education and Foreign Languages Wuhan Donghu University Wuhan 430212 Hubei China
In today's information-based education era, the computer-aided instruction system under the background of "Internet +" is ushering in unprecedented development opportunities. Based on VARK model, this pa... 详细信息
来源: 评论
Construction of Automatic Scheduling and Visualization System for Power Grid Space Operation Based on reinforcement learning algorithm  23
Construction of Automatic Scheduling and Visualization Syste...
收藏 引用
Proceedings of the 2023 International Conference on Big Data Mining and Information Processing
作者: Xiaokang Zhu Ning Wang Biao Zou Songtao Zhu Teng Fang Yubo Gao State Grid Electric Power Space Technology Company Limited China
With the complexity and increasing demand of the power grid, more efficient scheduling methods are needed. reinforcement learning, as an artificial intelligence technology, provides adaptive decision-making solutions ... 详细信息
来源: 评论
Combination of reinforcement learning and bee algorithm for controlling two-link arm with six muscle: simplified human arm model in the horizontal plane
收藏 引用
PHYSICAL AND ENGINEERING SCIENCES IN MEDICINE 2020年 第1期43卷 135-142页
作者: Rahatabad, Fereidoun Nowshiravan Rangraz, Parisa Islamic Azad Univ Dept Biomed Engn Sci & Res Branch Tehran Iran
The aim of this study was to improve reinforcement learning algorithm by combining artificial bee colony algorithm. The traditional method of reinforcement learning algorithm has a very low convergence rate due to ran... 详细信息
来源: 评论
Smart Scheduling Strategy for Islanded Microgrid Based on reinforcement learning algorithm
Smart Scheduling Strategy for Islanded Microgrid Based on Re...
收藏 引用
上海市研究生“新能源与智能电网”学术论坛
作者: Lingxiao Gan Tao Yu Jing Li Electric Power College South China University of TechnologyGuangzhou 510640China
This paper investigates a hierarchical Automatic Generation Control (AGC) strategy for an islanded microgrid, including wind power, solar photovoltaic, micro turbines, small hydropower and energy storage *** upper AGC... 详细信息
来源: 评论
Dynamic Coordination of Energy and Hops in WSNs Using reinforcement learning Routing algorithm
Dynamic Coordination of Energy and Hops in WSNs Using Reinfo...
收藏 引用
International Conference on Information Sciences,Machinery,Materials and Energy(ICISMME 2015)
作者: Jianyong Li Huang Wei Department of Computer and Information Science Southwest University
In wireless sensor network,the existing reinforcement learning routing algorithm usually optimize single goal and the process of route establishment is *** also has problem of data forwarding control *** this paper,we... 详细信息
来源: 评论
Innovation and Evaluation of Machine Translation Models Combining reinforcement learning algorithms and RNN
收藏 引用
Procedia Computer Science 2025年 261卷 821-828页
作者: Ni Xiao School of Public Basic Courses Wuhan Institute of Design and Sciences Wuhan 430205 Hubei China
With the rapid acceleration of global integration, machine translation serves as a crucial conduit for cross - cultural interaction. Nevertheless, the existing conventional models demonstrate insufficient resilience w... 详细信息
来源: 评论
Neural network-based adaptive reinforcement learning for optimized backstepping tracking control of nonlinear systems with input delay
收藏 引用
APPLIED INTELLIGENCE 2025年 第2期55卷 1-16页
作者: Zhu, Boyan Karimi, Hamid Reza Zhang, Liang Zhao, Xudong Bohai Univ Coll Control Sci & Engn Jinzhou 121013 Liaoning Peoples R China Politecn Milan Dept Mech Engn Via Masa 1 I-20156 Milan Italy DaLian Univ Technol Fac Elect Informat & Elect Engn Dalian 116024 Liaoning Peoples R China
In this paper, the problem of adaptive optimized tracking control design is addressed for a class of nonlinear systems in strict-feedback form. The system under consideration contains input delay and has unmeasurable ... 详细信息
来源: 评论