咨询与建议

限定检索结果

文献类型

  • 748 篇 会议
  • 271 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,023 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 712 篇 工学
    • 520 篇 计算机科学与技术...
    • 381 篇 电气工程
    • 278 篇 控制科学与工程
    • 153 篇 软件工程
    • 79 篇 信息与通信工程
    • 40 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 7 篇 土木工程
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 航空宇航科学与技...
    • 3 篇 安全科学与工程
  • 118 篇 理学
    • 98 篇 数学
    • 32 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 66 篇 管理学
    • 63 篇 管理科学与工程(可...
    • 14 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 313 篇 reinforcement le...
  • 216 篇 dynamic programm...
  • 206 篇 optimal control
  • 107 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 88 篇 neural networks
  • 78 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 41 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 13 篇 guangdong univ t...
  • 12 篇 northeastern uni...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 7 篇 beijing univ tec...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 22 篇 wang ding
  • 21 篇 xu xin
  • 19 篇 jiang zhong-ping
  • 17 篇 lewis frank l.
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 derong liu
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 10 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 992 篇 英文
  • 25 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1023 条 记 录,以下是351-360 订阅
A dynamic Consensus Scheme for Unlicensed Spectrum Sharing in Heterogeneous Networks
A Dynamic Consensus Scheme for Unlicensed Spectrum Sharing i...
收藏 引用
International Conference on Communication Technology (ICCT)
作者: Hanwen Zhang Supeng Leng Pengcheng Zhao Jianhua He School of Information and Communication Enginering University of Electronic Science and Technology of China Chengdu China School of Computer Science and Electronic Engineering University of Essex UK
Exploiting and sharing unlicensed spectrum resources among cellular and WiFi networks is critical for the fifth-generation (5G) and beyond networks due to the severe spectrum shortage and huge traffic demands. While d...
来源: 评论
learning Run-time Compositions of Interacting Adaptations  15
Learning Run-time Compositions of Interacting Adaptations
收藏 引用
ieee/ACM 15th International symposium on Software Engineering for adaptive and Self-Managing Systems (SEAMS)
作者: Cardozo, Nicolas Dusparic, Ivana Univ Los Andes Syst & Comp Engn Dept Bogota Colombia Trinity Coll Dublin Sch Comp Sci & Stat Dublin Ireland
Self-adaptive systems continuously adapt to internal and external changes in their execution environment. In context-based self-adaptation, adaptations take place in response to the characteristics of the execution en... 详细信息
来源: 评论
adaptive dynamic programming Based on Multi-dimensional Taylor Network for Time-Delay Nonlinear System with Uncertainties  2
Adaptive Dynamic Programming Based on Multi-dimensional Tayl...
收藏 引用
2nd World symposium on Artificial Intelligence, WSAI 2020
作者: Duan, Zheng-Yi Yan, Hong-Sen Southeast University School of Automation Nanjing Jiangsu210096 China
For the uncertain time-delay system, this paper investigates a novel robust adaptive dynamic programming (ADP) to guarantee the stability and performance of the system. By devising a novel cost function which integrat... 详细信息
来源: 评论
Editorial Special Issue on adaptive dynamic programming and reinforcement learning
收藏 引用
ieee Transactions on Systems, Man, and Cybernetics: Systems 2020年 第11期50卷 3944-3947页
作者: Liu, Derong Lewis, Frank L. Wei, Qinglai School of Automation Guangdong University of Technology Guangzhou510006 China Uta Research Institute University of Texas at Arlington Fort WorthTX76118 United States State Key Laboratory of Management and Control for Complex Systems Istitute of Automation Chinese Academy of Sciences Beijing100190 China University of Chinese Academy of Sciences Beijing100049 China
The past decade has witnessed a surge in research activities related to adaptive dynamic programming (ADP) and reinforcement learning (RL), particularly for control applications. Several books [item 1)–5) in the Appe... 详细信息
来源: 评论
Sparse learning-Based Approximate dynamic programming With Barrier Constraints
收藏 引用
ieee CONTROL SYSTEMS LETTERS 2020年 第3期4卷 743-748页
作者: Greene, Max L. Deptula, Patryk Nivison, Scott Dixon, Warren E. Univ Florida Dept Mech & Aerosp Engn Gainesville FL 32611 USA Charles Stark Draper Lab Inc Percept & Auton Grp Cambridge MA 02139 USA Air Force Res Lab Munit Directorate Eglin AFB FL 32542 USA
This letter provides an approximate online adaptive solution to the infinite-horizon optimal control problem for control-affine continuous-time nonlinear systems while formalizing system safety using barrier certifica... 详细信息
来源: 评论
adaptive dynamic programming Based on Parallel Control Theory for Underwater Vehicles
Adaptive Dynamic Programming Based on Parallel Control Theor...
收藏 引用
Digital Twins and Parallel Intelligence (DTPI), ieee International Conference on
作者: Peng Bo Xingbin Tu Fengzhong Qu Fei-Yue Wang Key Laboratory of Ocean Observation-Imaging Testbed of Zhejiang Province Zhejiang University Zhoushan China Institute of Automation Chinese Academy of Sciences Beijing China
Parallel control theory can provide an effective solution for the control problem of complex system with unknown models and time-varying characteristics. The adaptive dynamic programming (ADP) method, which combines r... 详细信息
来源: 评论
reinforcement learning for Linear Continuous-time Systems: an Incremental learning Approach
收藏 引用
ieee/CAA Journal of Automatica Sinica 2019年 第2期6卷 433-440页
作者: Tao Bian Zhong-Ping Jiang Bank of America Merrill Lynch IEEE the Control and Networks Lab Department of Electrical and Computer Engineering Tandon School of Engineering New York University
In this paper, we introduce a novel reinforcement learning(RL) scheme for linear continuous-time dynamical systems. Different from traditional batch learning algorithms,an incremental learning approach is developed, w... 详细信息
来源: 评论
Revisiting Maximum Entropy Inverse reinforcement learning: New Perspectives and Algorithm
Revisiting Maximum Entropy Inverse Reinforcement Learning: N...
收藏 引用
ieee symposium Series on Computational Intelligence (ieee SSCI)
作者: Snoswell, Aaron J. Singh, Surya P. N. Ye, Nan Univ Queensland Sch Informat Technol & Elect Engn Brisbane Qld Australia Intuit Surg Sunnyvale CA USA Univ Queensland Sch Math & Phys Brisbane Qld Australia
We provide new perspectives and inference algorithms for Maximum Entropy (MaxEnt) Inverse reinforcement learning (IRL), which provides a principled method to find a most non-committal reward function consistent with g... 详细信息
来源: 评论
Toward autonomous adaptive embedded systems for sustainable services using reinforcement learning (WiP report)  8
Toward autonomous adaptive embedded systems for sustainable ...
收藏 引用
8th International symposium on Computing and Networking Workshops, CANDARW 2020
作者: Nakamoto, Yukikazu Kumalija, Elhard Zhang, Menglei University of Hyogo Graduate School of Applied Informatics Kobe Japan
A connected space comprises embedded systems that are attached to the physical space and cloud systems through the Internet. Using the connected space, various services can be continuously provided. These services can... 详细信息
来源: 评论
Distributed Optimal Coordination Control for Continuous-Time Nonlinear Multi-Agent Systems With Input Constraints  9
Distributed Optimal Coordination Control for Continuous-Time...
收藏 引用
9th ieee Data Driven Control and learning Systems Conference (DDCLS)
作者: Deng, Yunhong Xiao, Jun Wei, Qinglai Univ Chinese Acad Sci Beijing 100049 Peoples R China
U This paper is concerned with an optimal coordination control problem for nonlinear multi-agent systems (MASs) with constraints of the control inputs. The idea of daptive dynamic programming (ADP) algorithm is to use... 详细信息
来源: 评论