咨询与建议

限定检索结果

文献类型

  • 28 篇 期刊文献
  • 7 篇 会议
  • 2 篇 学位论文

馆藏范围

  • 37 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 24 篇 工学
    • 11 篇 电气工程
    • 11 篇 控制科学与工程
    • 10 篇 计算机科学与技术...
    • 3 篇 仪器科学与技术
    • 3 篇 信息与通信工程
    • 1 篇 机械工程
    • 1 篇 安全科学与工程
  • 14 篇 理学
    • 12 篇 数学
    • 2 篇 化学
    • 2 篇 生物学
    • 2 篇 系统科学
    • 1 篇 物理学
  • 13 篇 管理学
    • 12 篇 管理科学与工程(可...
    • 1 篇 工商管理

主题

  • 37 篇 value iteration ...
  • 5 篇 dynamic programm...
  • 5 篇 optimal control
  • 4 篇 iterative method...
  • 4 篇 markov decision ...
  • 4 篇 optimal policy
  • 4 篇 policy iteration...
  • 3 篇 reinforcement le...
  • 3 篇 markov processes
  • 2 篇 feedback
  • 2 篇 numerical comple...
  • 2 篇 game theory
  • 2 篇 polynomials
  • 2 篇 markov decision ...
  • 1 篇 saddle-point equ...
  • 1 篇 approximate dyna...
  • 1 篇 limited inventor...
  • 1 篇 asynchronous tra...
  • 1 篇 markov
  • 1 篇 vehicle dynamics

机构

  • 2 篇 univ sci & techn...
  • 2 篇 univ shanghai sc...
  • 1 篇 univ adelaide sc...
  • 1 篇 hakim sabzevari ...
  • 1 篇 inrs-energie ver...
  • 1 篇 tel aviv univ sc...
  • 1 篇 ferdowsi univ ma...
  • 1 篇 jiangnan univ sc...
  • 1 篇 southern methodi...
  • 1 篇 hefei comprehens...
  • 1 篇 3501 daxue rd pe...
  • 1 篇 univ pretoria de...
  • 1 篇 cent univ kerala...
  • 1 篇 nanyang technol ...
  • 1 篇 shandong prov ke...
  • 1 篇 dalian maritime ...
  • 1 篇 texas a&m univ e...
  • 1 篇 islamic azad uni...
  • 1 篇 islamic azad uni...
  • 1 篇 university of ou...

作者

  • 2 篇 wang chaoli
  • 2 篇 hao longyan
  • 2 篇 jing chonglin
  • 2 篇 guo xianping
  • 2 篇 herzberg m
  • 2 篇 yechiali u
  • 1 篇 niyato dusit
  • 1 篇 chafik sanaa
  • 1 篇 balochian saeed
  • 1 篇 zhou peixin
  • 1 篇 huang yonghui
  • 1 篇 daoui cherki
  • 1 篇 lan wei
  • 1 篇 heng zhang
  • 1 篇 wang jin-yuan
  • 1 篇 holzbaur u
  • 1 篇 shi yibo
  • 1 篇 anahtarci berkay
  • 1 篇 wen xian
  • 1 篇 xu yujing

语言

  • 33 篇 英文
  • 4 篇 其他
检索条件"主题词=Value iteration algorithm"
37 条 记 录,以下是11-20 订阅
排序:
Optimal decision strategy for discrete-time Markovian jump linear systems
收藏 引用
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE 2023年 第3期54卷 565-582页
作者: Zhu, Jin Zhang, Qingkun Univ Sci & Technol China Dept Automat Hefei Peoples R China Hefei Comprehens Natl Sci Ctr Inst Artificial Intelligence Hefei Peoples R China
This paper investigates the discrete-time Markovian jump linear systems (MJLSs) whose mode transition probability matrix (MTPM) can be adjusted by decisions. Motivated by switching law design in switched systems, the ... 详细信息
来源: 评论
Convergence and Numerical Complexity of Policy and value iterations in Linear-Quadratic Discrete-Time Reinforcement Learning  4
Convergence and Numerical Complexity of Policy and Value Ite...
收藏 引用
4th Modeling, Estimation, and Control Conference (MECC)
作者: Xu, Lingyi Gajic, Zoran Rutgers State Univ Dept Elect & Comp Engn Piscataway NJ 08854 USA
This paper demonstrates that the value iteration (VI) algorithm of reinforcement learning of discrete -time (DT) linear-quadratic (LQ) optimal control problem converges very slowly mostly linearly, compared to the qua... 详细信息
来源: 评论
Prospect-theoretic DRL Approach for Container Provisioning in Energy-constrained Edge Platforms  97
Prospect-theoretic DRL Approach for Container Provisioning i...
收藏 引用
97th IEEE Vehicular Technology Conference (VTC-Spring)
作者: Hlophe, M. C. Maharaj, B. T. Univ Pretoria Dept Elect Elect & Comp Engn Pretoria South Africa
Due to the increase in resource-constrained internet of things (IoT) devices, the multi-access edge computing (MEC) have become very competitive environments in terms of successful data offloading and allocation of co... 详细信息
来源: 评论
A Hybrid Handover Scheme for Vehicular VLC/RF Communication Networks
收藏 引用
SENSORS 2024年 第13期24卷 4323页
作者: Jia, Linqiong Feng, Shicheng Zhang, Yijin Wang, Jin-Yuan Nanjing Univ Sci & Technol Sch Elect & Opt Engn Nanjing 210094 Peoples R China Nanjing Univ Posts & Telecommun Sch Commun & Informat Engn Nanjing 210003 Peoples R China
Visible light communication (VLC) is a promising complementary technology to its radio frequency (RF) counterpart to satisfy the high quality-of-service (QoS) requirements of intelligent vehicular communications by re... 详细信息
来源: 评论
value iteration Networks with Double Estimator for Planetary Rover Path Planning
收藏 引用
SENSORS 2021年 第24期21卷 8418页
作者: Jin, Xiang Lan, Wei Wang, Tianlin Yu, Pengyao Dalian Maritime Univ Sch Naval Architecture & Ocean Engn Dalian 116026 Peoples R China
Path planning technology is significant for planetary rovers that perform exploration missions in unfamiliar environments. In this work, we propose a novel global path planning algorithm, based on the value iteration ... 详细信息
来源: 评论
Convergence and Numerical Complexity of Policy and value iterations in Linear-Quadratic Discrete-Time Reinforcement Learning
收藏 引用
IFAC-PapersOnLine 2024年 第28期58卷 96-101页
作者: Lingyi Xu Zoran Gajić Department of Electrical & Computer Engineering Rutgers The State University of New Jersey Piscataway NJ 08854 USA
This paper demonstrates that the value iteration (VI) algorithm of reinforcement learning of discrete-time (DT) linear-quadratic (LQ) optimal control problem converges very slowly mostly linearly, compared to the quad... 详细信息
来源: 评论
value iteration Solver Networks  3
Value Iteration Solver Networks
收藏 引用
3rd International Conference on Intelligent Autonomous Systems (ICoIAS)
作者: Urtans, Evalds Vecins, Valters Riga Tech Univ Riga Latvia
value iteration algorithm is iterative and can't be parallelized. Computation time grows exponentially when the size of the input maps is increased. We propose UNet-RNN-Skip artificial neural network architecture ... 详细信息
来源: 评论
Model-free optimal tracking policies for Markov jump systems by solving non-zero-sum games
收藏 引用
INFORMATION SCIENCES 2023年 第1期647卷
作者: Zhou, Peixin Xue, Huiwen Wen, Jiwei Shi, Peng Luan, Xaoli Jiangnan Univ Sch Internet Things Engn Key Lab Adv Proc Control Light Ind Minist Educ Wuxi 214122 Peoples R China Univ Adelaide Sch Elect & Mech Engn Adelaide SA 5005 Australia Obuda Univ Res & Innovat Ctr H-1034 Budapest Hungary
This paper develops model-free optimal tracking policies for Markov jump systems by solving nonzero-sum games (NZSGs). First, coupled action and mode-dependent value functions (CAMDVFs) are built for solving a two-pla... 详细信息
来源: 评论
Optimal rearrangement and preventive maintenance policies for heterogeneous balanced systems with three failure modes
收藏 引用
RELIABILITY ENGINEERING & SYSTEM SAFETY 2023年 第1期238卷
作者: Wang, Jingjing Liu, Huimin Lin, Tianran Qingdao Univ Technol Sch Management Engn Qingdao 266525 Peoples R China Qingdao Univ Technol Ctr Struct Acoust & Machine Fault Diag Qingdao 266525 Peoples R China
This paper studies a heterogeneous balanced system composed of multiple interchangeable components. The degradation process of components is described by a gamma process, and deterioration rates in different positions... 详细信息
来源: 评论
Reinforcement learning approach to the control of heavy material for robots
收藏 引用
COMPUTERS & ELECTRICAL ENGINEERING 2022年 第PartB期104卷
作者: Wu, Xiaoming Chi, Jing Jin, Xiao-Zheng Deng, Chao Qilu Univ Technol Shandong Acad Sci Sch Comp Sci & Technol Jinan 250353 Shandong Peoples R China Nat Supercomp Ctr Jinan Shandong Comp Sci Ctr Jinan 250014 Shandong Peoples R China Shandong Prov Key Lab Comp Networks Jinan 250014 Shandong Peoples R China Shandong Univ Finance & Econ Dept Comp Sci & Technol Jinan 250014 Shandong Peoples R China 3501 Daxue Rd Jinan Shandong Peoples R China
In this paper, we consider the optimal control problem of heavy material handling manipulators for agricultural robots. Unlike the existing results on agricultural robots, the robot parameters may be unknown for the d... 详细信息
来源: 评论