咨询与建议

限定检索结果

文献类型

  • 299 篇 会议
  • 8 篇 期刊文献

馆藏范围

  • 307 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 180 篇 工学
    • 158 篇 计算机科学与技术...
    • 56 篇 电气工程
    • 48 篇 软件工程
    • 47 篇 控制科学与工程
    • 13 篇 信息与通信工程
    • 10 篇 机械工程
    • 6 篇 仪器科学与技术
    • 4 篇 力学(可授工学、理...
    • 4 篇 生物工程
    • 3 篇 动力工程及工程热...
    • 2 篇 交通运输工程
    • 2 篇 核科学与技术
    • 2 篇 生物医学工程(可授...
    • 1 篇 建筑学
    • 1 篇 化学工程与技术
    • 1 篇 航空宇航科学与技...
    • 1 篇 食品科学与工程(可...
  • 40 篇 理学
    • 35 篇 数学
    • 9 篇 系统科学
    • 8 篇 统计学(可授理学、...
    • 4 篇 物理学
    • 4 篇 生物学
    • 1 篇 化学
    • 1 篇 天文学
    • 1 篇 大气科学
    • 1 篇 地球物理学
    • 1 篇 地质学
  • 18 篇 管理学
    • 17 篇 管理科学与工程(可...
    • 7 篇 工商管理
  • 4 篇 经济学
    • 4 篇 应用经济学
  • 1 篇 医学

主题

  • 115 篇 dynamic programm...
  • 76 篇 reinforcement le...
  • 67 篇 learning
  • 47 篇 optimal control
  • 30 篇 neural networks
  • 27 篇 control systems
  • 21 篇 approximate dyna...
  • 21 篇 approximation al...
  • 20 篇 function approxi...
  • 20 篇 equations
  • 17 篇 convergence
  • 16 篇 adaptive dynamic...
  • 16 篇 state-space meth...
  • 16 篇 heuristic algori...
  • 14 篇 mathematical mod...
  • 13 篇 stochastic proce...
  • 12 篇 learning (artifi...
  • 12 篇 adaptive control
  • 12 篇 cost function
  • 11 篇 algorithm design...

机构

  • 5 篇 arizona state un...
  • 4 篇 department of el...
  • 4 篇 school of inform...
  • 4 篇 department of in...
  • 4 篇 univ sci & techn...
  • 4 篇 chinese acad sci...
  • 4 篇 department of el...
  • 3 篇 princeton univ d...
  • 3 篇 northeastern uni...
  • 3 篇 national science...
  • 3 篇 robotics institu...
  • 3 篇 univ illinois de...
  • 3 篇 univ utrecht dep...
  • 2 篇 univ groningen i...
  • 2 篇 sharif univ tech...
  • 2 篇 univ texas autom...
  • 2 篇 pengcheng labora...
  • 2 篇 guangxi univ sch...
  • 2 篇 chinese acad sci...
  • 2 篇 cemagref lisc au...

作者

  • 14 篇 liu derong
  • 9 篇 wei qinglai
  • 8 篇 si jennie
  • 7 篇 xu xin
  • 5 篇 derong liu
  • 4 篇 lewis frank l.
  • 4 篇 martin riedmille...
  • 4 篇 huaguang zhang
  • 4 篇 jennie si
  • 4 篇 marco a. wiering
  • 4 篇 xin xu
  • 4 篇 zhang huaguang
  • 4 篇 dongbin zhao
  • 4 篇 lei yang
  • 4 篇 powell warren b.
  • 4 篇 riedmiller marti...
  • 3 篇 hado van hasselt
  • 3 篇 van hasselt hado
  • 3 篇 jagannathan s.
  • 3 篇 munos remi

语言

  • 305 篇 英文
  • 1 篇 其他
  • 1 篇 中文
检索条件"任意字段=IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning"
307 条 记 录,以下是21-30 订阅
Optimal Tracking Control of the Boiler-turbine System Based on Adaptive dynamic programming
Optimal Tracking Control of the Boiler-turbine System Based ...
收藏 引用
international Joint Conference on Neural Networks (IJCNN)
作者: Liu, Yujia Gao, Aiguo Wei, Qinglai Chinese Acad Sci Inst Automat Beijing Peoples R China North China Elect Power Res Inst Co Ltd Beijing Peoples R China
To guarantee the efficient performance of the power plant, an adaptive tacking controller for the nonlinear boiler-turbine system based on offline policy iteration adaptive dynamic prorgamming (ADP) method is proposed... 详细信息
来源: 评论
Adaptive railway traffic control using approximate dynamic programming
收藏 引用
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES 2020年 113卷 91-107页
作者: Ghasempour, Taha Heydecker, Benjamin UCL Ctr Transport Studies London WC1E 6BT England
This study presents an adaptive railway traffic controller for real-time operations based on approximate dynamic programming (ADP). By assessing requirements and opportunities, the controller aims to limit consecutive... 详细信息
来源: 评论
Solving Unit Commitment Problems with Multi-step Deep reinforcement learning
Solving Unit Commitment Problems with Multi-step Deep Reinfo...
收藏 引用
2021 ieee international Conference on Communications, Control, and Computing Technologies for Smart Grids, SmartGridComm 2021
作者: Qin, Jingtao Yu, Nanpeng Gao, Yuanqi University of California Department of Electrical and Computer Engineering RiversideCA92507 United States
Solving the unit commitment (UC) problem in a computationally efficient manner is a critical issue of electricity market operations. Optimization-based methods such as heuristics, dynamic programming, and mixed-intege... 详细信息
来源: 评论
A Review of Safe Online learning for Nonlinear Control Systems
A Review of Safe Online Learning for Nonlinear Control Syste...
收藏 引用
2021 international Conference on Unmanned Aircraft Systems, ICUAS 2021
作者: Osborne, Matthew Shin, Hyo-Sang Tsourdos, Antonios Centre for Autonomous and Cyber-Physical Systems School of Aerospace Transport and Manufacturing Cranfield University CranfieldMK430AL United Kingdom
learning for autonomous dynamic control systems that can adapt to unforeseen environmental changes are of great interest but the realisation of a practical and safe online learning algorithm is incredibly challenging.... 详细信息
来源: 评论
Multi-agent Deep reinforcement learning based Information-Energy Collaboration in Vehicle Edge Computing Networks
Multi-agent Deep Reinforcement Learning based Information-En...
收藏 引用
ieee international symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)
作者: Yaoyu Feng Biling Zhang Jung-Lang Yu School of Network Education Beijing University of Posts and Telecommunications P. R. China Department of Electrical Engineering Fu Jen Catholic University New Taipei City Taiwan
In the vehicle edge computing network (VECN), how to deal with the computation resources and energy resources shortage problem the roadside units (RSUs) encounter when they are performing delay sensitive computation t... 详细信息
来源: 评论
Deep reinforcement learning for Perishable Inventory Optimization Problem
Deep Reinforcement Learning for Perishable Inventory Optimiz...
收藏 引用
ieee international Conference on Industrial Engineering and Engineering Management
作者: Yusuke Nomura Ziang Liu Tatsushi Nishi Graduate School of Environmental Life Natural Science and Technology Okayama University Okayama City Okayama Japan
While global attention on reducing food waste has increased, the demand for perishable commodities such as food and pharmaceuticals is growing. This emphasizes the need for effective perishable inventory management, w...
来源: 评论
Policy Iteration Algorithm for Constrained Cost Optimal Control of Discrete-Time Nonlinear System
Policy Iteration Algorithm for Constrained Cost Optimal Cont...
收藏 引用
international Joint Conference on Neural Networks (IJCNN)
作者: Li, Tao Wei, Qinglai Li, Hongyang Song, Ruizhuo Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing Peoples R China Univ Chinese Acad Sci Sch Artificial Intelligence Beijing Peoples R China Univ Sci & Technol Beijing Sch Automat Beijing Peoples R China
In this paper, optimal control problems with constraints on summation of auxiliary utility function are called constrained cost optimal control problems and a constrained cost policy iteration adaptive dynamic program... 详细信息
来源: 评论
Bayesian Sequential Optimal Experimental Design for Linear Regression with reinforcement learning
Bayesian Sequential Optimal Experimental Design for Linear R...
收藏 引用
international Conference on Machine learning and Applications (ICMLA)
作者: Fadil Santosa Loren Anderson Dept. of Applied Mathematics and Statistics Johns Hopkins University Baltimore MD USA School of Mathematics University of Minnesota Twin Cities Minneapolis MN USA
We perform a comparison study on Bayesian sequential optimal experimental design algorithms applied to linear regression in two unknowns. We transform the Bayesian sequential optimal experimental design problem into a... 详细信息
来源: 评论
Safe Adaptive dynamic programming Method for Nonlinear Safety-Critical Systems with Disturbance  6
Safe Adaptive Dynamic Programming Method for Nonlinear Safet...
收藏 引用
6th international Conference on Robotics and Automation Engineering, ICRAE 2021
作者: Wang, Jinguang Zhang, Dehua Zhang, Jishi Zhu, Heyang Hu, Shaolin Qin, Chunbin Henan University School of Artificial Intelligence Kaifeng China Guangdong University of Petrochemical Technology School of Automation Maoming China
In this paper, a safe adaptive dynamic programming (SADP) method based on the barrier function (BF) is proposed for the optimal control problem of nonlinear safety-critical systems with the safety constraints and exte... 详细信息
来源: 评论
DATE: Disturbance-Aware Traffic Engineering with reinforcement learning in Software-Defined Networks  29
DATE: Disturbance-Aware Traffic Engineering with Reinforceme...
收藏 引用
29th ieee/ACM international symposium on Quality of Service (IWQOS)
作者: Ye, Minghao Zhang, Junjie Guo, Zehua Chao, H. Jonathan NYU Dept Elect & Comp Engn New York NY 11201 USA Fortinet Inc Sunnyvale CA 94086 USA Beijing Inst Technol Beijing 100081 Peoples R China
Traffic Engineering (TE) has been applied to optimize network performance by routing/rerouting flows based on traffic loads and network topologies. To cope with network dynamics from emerging applications, it is essen... 详细信息
来源: 评论