咨询与建议

限定检索结果

文献类型

  • 28 篇 期刊文献
  • 7 篇 会议
  • 2 篇 学位论文

馆藏范围

  • 37 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 24 篇 工学
    • 11 篇 电气工程
    • 11 篇 控制科学与工程
    • 10 篇 计算机科学与技术...
    • 3 篇 仪器科学与技术
    • 3 篇 信息与通信工程
    • 1 篇 机械工程
    • 1 篇 安全科学与工程
  • 14 篇 理学
    • 12 篇 数学
    • 2 篇 化学
    • 2 篇 生物学
    • 2 篇 系统科学
    • 1 篇 物理学
  • 13 篇 管理学
    • 12 篇 管理科学与工程(可...
    • 1 篇 工商管理

主题

  • 37 篇 value iteration ...
  • 5 篇 dynamic programm...
  • 5 篇 optimal control
  • 4 篇 iterative method...
  • 4 篇 markov decision ...
  • 4 篇 optimal policy
  • 4 篇 policy iteration...
  • 3 篇 reinforcement le...
  • 3 篇 markov processes
  • 2 篇 feedback
  • 2 篇 numerical comple...
  • 2 篇 game theory
  • 2 篇 polynomials
  • 2 篇 markov decision ...
  • 1 篇 saddle-point equ...
  • 1 篇 approximate dyna...
  • 1 篇 limited inventor...
  • 1 篇 asynchronous tra...
  • 1 篇 markov
  • 1 篇 vehicle dynamics

机构

  • 2 篇 univ sci & techn...
  • 2 篇 univ shanghai sc...
  • 1 篇 univ adelaide sc...
  • 1 篇 hakim sabzevari ...
  • 1 篇 inrs-energie ver...
  • 1 篇 tel aviv univ sc...
  • 1 篇 ferdowsi univ ma...
  • 1 篇 jiangnan univ sc...
  • 1 篇 southern methodi...
  • 1 篇 hefei comprehens...
  • 1 篇 3501 daxue rd pe...
  • 1 篇 univ pretoria de...
  • 1 篇 cent univ kerala...
  • 1 篇 nanyang technol ...
  • 1 篇 shandong prov ke...
  • 1 篇 dalian maritime ...
  • 1 篇 texas a&m univ e...
  • 1 篇 islamic azad uni...
  • 1 篇 islamic azad uni...
  • 1 篇 university of ou...

作者

  • 2 篇 wang chaoli
  • 2 篇 hao longyan
  • 2 篇 jing chonglin
  • 2 篇 guo xianping
  • 2 篇 herzberg m
  • 2 篇 yechiali u
  • 1 篇 niyato dusit
  • 1 篇 chafik sanaa
  • 1 篇 balochian saeed
  • 1 篇 zhou peixin
  • 1 篇 huang yonghui
  • 1 篇 daoui cherki
  • 1 篇 lan wei
  • 1 篇 heng zhang
  • 1 篇 wang jin-yuan
  • 1 篇 holzbaur u
  • 1 篇 shi yibo
  • 1 篇 anahtarci berkay
  • 1 篇 wen xian
  • 1 篇 xu yujing

语言

  • 33 篇 英文
  • 4 篇 其他
检索条件"主题词=Value Iteration Algorithm"
37 条 记 录,以下是1-10 订阅
排序:
value iteration algorithm for continuous-time linear quadratic stochastic optimal control problems
收藏 引用
Science China(Information Sciences) 2024年 第2期67卷 170-180页
作者: Guangchen WANG Heng ZHANG School of Control Science and Engineering Shandong University
In this study, we investigate a continuous-time infinite-horizon linear quadratic stochastic optimal control problem with multiplicative noise in control and state variables. Using the techniques of stochastic stabili... 详细信息
来源: 评论
Data-driven optimal tracking control of discrete-time linear systems with multiple delays via the value iteration algorithm
收藏 引用
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE 2022年 第14期53卷 2845-2859页
作者: Hao, Longyan Wang, Chaoli Zhang, Guang Jing, Chonglin Shi, Yibo Univ Shanghai Sci & Technol Dept Control Sci & Engn Shanghai 200093 Peoples R China Univ Shanghai Sci & Technol Business Sch Shanghai Peoples R China Univ Shanghai Sci & Technol Coll Sci Shanghai Peoples R China
In this paper, the optimal tracking problem for discrete-time linear systems with multiple delays is studied without system dynamics. It is known that the total state of a system without specific dynamic characteristi... 详细信息
来源: 评论
value iteration algorithm for mean-field games
收藏 引用
SYSTEMS & CONTROL LETTERS 2020年 143卷 104744-104744页
作者: Anahtarci, Berkay Kariksiz, Can Deha Saldi, Naci Ozyegin Univ Istanbul Turkey
In the literature, existence of mean-field equilibria has been established for discrete-time mean field games under both the discounted cost and the average cost optimality criteria. In this paper, we provide a value ... 详细信息
来源: 评论
Two-Player Stackelberg Game for Linear System via value iteration algorithm  28
Two-Player Stackelberg Game for Linear System via Value Iter...
收藏 引用
28th IEEE International Symposium on Industrial Electronics (IEEE-ISIE)
作者: Li, Man Qin, Jiahu Ding, Lei Univ Sci & Technol China Dept Automat Hefei 230027 Anhui Peoples R China Nanjing Univ Posts & Telecommun Inst Adv Technol Nanjing 210023 Jiangsu Peoples R China
This paper investigates a hierarchical decision-making problem for two players governed by a continuous-time linear system. Such a problem is formulated as a Stackelberg game, in which one player, called leader, has t... 详细信息
来源: 评论
A Modified value iteration algorithm for Discounted Markov Decision Processes
收藏 引用
JOURNAL OF ELECTRONIC COMMERCE IN ORGANIZATIONS 2015年 第3期13卷 47-57页
作者: Chafik, Sanaa Daoui, Cherki Univ Sultan Moulay Slimane Lab Informat Proc & Decis Support Beni Mellal Morocco
As many real applications need a large amount of states, the classical methods are intractable for solving large Markov Decision Processes. The decomposition technique basing on the topology of each state in the assoc... 详细信息
来源: 评论
Incremental value iteration for optimal output regulation of linear systems with unknown exosystems
收藏 引用
NEUROCOMPUTING 2025年 626卷
作者: Jing, Chonglin Wang, Chaoli Liang, Dong Xu, Yujing Hao, Longyan Univ Shanghai Sci & Technol Dept Control Sci & Engn Shanghai 200093 Peoples R China High Tech Inst Fan Gong Ting South St 12th Weifang 261000 Peoples R China
This paper addresses the optimal output regulation problem for discrete-time linear systems with completely unknown dynamics and unmeasurable exosystem states. The primary objective is to design incremental dataset-ba... 详细信息
来源: 评论
Zero-sum risk-sensitive continuous-time stochastic games with unbounded reward and transition rates in Borel spaces
收藏 引用
AUTOMATICA 2025年 177卷
作者: Zhang, Junyu Guo, Xianping Xia, Li Sun Yat Sen Univ Sch Math Guangzhou 510275 Peoples R China Sun Yat Sen Univ Sch Business Guangzhou 510275 Peoples R China Sun Yat Sen Univ Guangdong Prov Key Lab Computat Sci Guangzhou Peoples R China
This paper investigates a finite-horizon two-player zero-sum risk-sensitive stochastic game in continuous-time Markov chains with Borel state and action spaces. The model accommodates unbounded reward rates, transitio... 详细信息
来源: 评论
Risk probability optimization of finite horizon piecewise deterministic Markov decision processes
收藏 引用
OPTIMIZATION 2025年 第7期74卷 1697-1721页
作者: Huo, Haifeng Wen, Xian Guangxi Univ Sci & Technol Sch Sci Liuzhou Peoples R China
This paper investigates the piecewise deterministic Markov decision processes (PDMDPs) under the risk probability criterion. The optimality problem is to minimize the probability that the finite horizon total costs ar... 详细信息
来源: 评论
Dynamic soft-kill weapon-target assignment in naval environments
收藏 引用
COMPUTERS & INDUSTRIAL ENGINEERING 2024年 197卷
作者: Tashakori, Sadegh Ranjbar, Mohammad Balochian, Saeed Sharif-Razavian, Javad Peymankar, Mahboobeh Ferdowsi Univ Mashhad Fac Engn Dept Ind Engn Mashhad Iran Islamic Azad Univ Dept Elect Engn Mashhad Branch Mashhad Iran Islamic Azad Univ Dept Comp Engn Mashhad Branch Mashhad Iran Hakim Sabzevari Univ Dept Ind Engn Sabzevar Iran
One of the most significant threats faced by ships is anti-ship missiles. Nowadays, these missiles, equipped with diverse guidance systems, can locate their trajectory and attack the ship. Consequently, ships need to ... 详细信息
来源: 评论
Optimal control of a dynamic production-inventory system with various cost criteria
收藏 引用
ANNALS OF OPERATIONS RESEARCH 2024年 第1期337卷 75-103页
作者: Golui, Subrata Pal, Chandan Manikandan, R. Sobhanan, Abhay Indian Inst Technol Guwahati Dept Math Gauhati 781039 Assam India Cent Univ Kerala Dept Math Kasaragod 671320 Kerala India Univ S Florida Dept Ind & Management Syst Engn Tampa FL 33620 USA
In this article, we investigate the dynamic control problem of a production-inventory system. Here, demands arrive at the production unit according to a Poisson process and are processed in an FCFS manner. The process... 详细信息
来源: 评论