检索结果-内蒙古大学图书馆

Multi-static radar power allocation for multi-stage stochastic task of missile interception

IET RADAR SONAR AND NAVIGATION 2018年第5期12卷 540-548页

作者： Yang, Yichuan Zhang, Tianxian Yi, Wei Kong, Lingjiang Li, Xiaolong Yang, Xiaobo Univ Elect Sci & Technol China Sch Elect Engn Chengdu Sichuan Peoples R China

Considering a multi-stage stochastic task, in which a multi-static radar system (MSRS) is applied to assist with missile interception, the authors study an optimisation problem of radar resource management. Specifically, under restriction of a fixed energy budget, the authors devote to minimise the loss, which is caused by the unsuccessfully intercepted missiles, through optimal power allocation (OPA) of MSRS within multiple stages. The design of OPA can be translated into a sequential decision-making problem. The authors formulate the problem through variable definition and modelling the missile interception procedure. As the authors need to consider the randomness of multiple coupled stages and jointly allocation power between multiple radar nodes, to solve the proposed problem is of huge computational load. The authors propose a solution that combines with reinforcement learning and particle swarm optimisation. Comparing with the uniform power allocation scheme, the simulation results demonstrate that the OPA scheme designed by the proposed method is capable to achieve preferable and more stable performance for the whole missile interception. The authors' contributions include a novel optimisation resource management model for a multi-stage stochastic task and an effective solution for the optimal resource management scheme.

关键词： missiles military radar military computing stochastic processes minimisation decision making learning (artificial intelligence) particle swarm optimisation multistatic radar power allocation missile interception multi-stage stochastic task MSRS radar resource management optimisation problem fixed energy budget loss minimisation optimal power allocation OPA sequential decision-making problem multiple coupled stages computational load reinforcement learning particle swarm optimisation uniform power allocation scheme optimal resource management scheme

来源：评论

学校读者我要写书评

暂无评论

Reinforcement-Learning-Informed Prescriptive Analytics for Air Traffic Flow Management

引用

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING 2024年第3期21卷 4188-4202页

作者： Wang, Yuan Cai, Weilin Tu, Yilei Mao, Jianfeng Chinese Univ Hong Kong Shenzhen CUHK Shenzhen Sch Data Sci Shenzhen 518172 Peoples R China Univ Sci & Technol China Sch Informat Sci & Technol Hefei 230026 Peoples R China Shenzhen Res Inst Big Data Shenzhen 110004 Peoples R China Swiss Fed Inst Technol Dept Comp Sci CH-8092 Zurich Switzerland

Air Traffic Flow Management (ATFM) is a complex sequential decision-making problem that involves dynamically matching flights with sectors under changing environmental conditions. Finding an optimal solution for ATFM is challenging due to its dynamic nature and operational constraints. Reinforcement learning is a well-suited approach for sequential decision-making problems. However, ATFM poses three potential challenges: 1) large state space, 2) combinatorial action space, and 3) variational feasible action set, resulting from numerous agents with tightly-coupled constraints. These challenges can hinder the effectiveness of direct application of reinforcement learning methods. While prescriptive analytics can readily handle hard constraints via a mathematical optimization model, but it is computationally intractable for online sequential decision-making problems under changing environments. To address these challenges, we propose a novel framework, Reinforcement-Learning-Informed Prescriptive Analytics (RLIPA), in which an "informing" scheme is devised to integrate reinforcement learning and prescriptive analytics and leverage their strengths in predicting future reward and coping with hard constraints respectively. RLIPA is a general framework that can be adapted to other problems beyond ATFM, which typically involves many agents with tightly-coupled hard constraints. We demonstrate the usage and performance of RLIPA using numerical results and a real case study in comparison to two baseline *** to Practitioners-To improve Air Traffic Flow Management (ATFM) and reduce flight congestion, we propose a new method called reinforcement-learning-informed prescriptive analytics (RLIPA). RLIPA is a general framework that facilitates online sequential decision-making problems with multiple agents coupled with hard constraints. The approach consists of two stages: first, estimating future potential rewards for each agent via reinforcement learning, and second, in

关键词： Air traffic flow management reinforcement learning prescriptive analytics sequential decision-making problem

来源：评论

学校读者我要写书评

暂无评论

A practical implementation of stochastic programming: an application to the evaluation of option contracts in supply chains

引用

AUTOMATICA 2004年第5期40卷 743-756页

作者： van Delft, C Vial, JP Univ Geneva Dept Management Studies HEC CH-1211 Geneva 4 Switzerland Dept Ind Management & Log Grp HEC F-78351 Jouy En Josas France

Stochastic programming is a powerful analytical method in order to solve sequential decision-making problems under uncertainty. We describe an approach to build such stochastic linear programming models. We show that algebraic modeling languages make it possible for non-specialist users to formulate complex problems and have solved them by powerful commercial solvers. We illustrate our point in the case of option contracts in supply chain management and propose a numerical analysis of performance. We propose easy-to-implement discretization procedures of the stochastic process in order to limit the size of the event tree in a multi-period environment. (C) 2003 Elsevier Ltd. All rights reserved.

关键词： linear stochastic programming algebraic modeling language sequential decision-making problem event tree

来源：评论

学校读者我要写书评

暂无评论

Online/offline evolutionary algorithms for dynamic urban green space allocation problems

引用

JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE 2017年第4期29卷 843-867页

作者： Vallejo, M. Corne, D. Vargas, P. Heriot Watt Univ Intelligent Syst Lab Edinburgh Midlothian Scotland Heriot Watt Univ Robot Lab Edinburgh Midlothian Scotland

Urban-planning authorities continually face the problem of optimising the allocation of green space over time in developing urban environments. The problem is essentially a sequential decision-making task involving several interconnected and non-linear uncertainties, and requires time-intensive computation to evaluate the potential consequences of individual decisions. We explore the application of two very distinct frameworks incorporating evolutionary algorithm approaches for this problem: (i) an offline' approach, in which a candidate solution encodes a complete set of decisions, which is then evaluated by full simulation and (ii) an online' approach which involves a sequential series of optimisations, each making only a single decision, and starting its simulations from the endpoint of the previous run. We study the outcomes, in each case, in the context of a simulated urban development model, and compare their performance in terms of speed and quality. Our results show that the online version is considerably faster than the offline counterpart, without significant loss in performance.

关键词： Optimisation green spaces allocation evolutionary algorithms planning uncertainty sequential decision-making problem

来源：评论

学校读者我要写书评

暂无评论

A reinforcement learning-based algorithm for the aircraft maintenance routing problem

引用

EXPERT SYSTEMS WITH APPLICATIONS 2021年 169卷

作者： Ruan, J. H. Wang, Z. X. Chan, Felix T. S. Patnaik, S. Tiwari, M. K. Northwest A&F Univ Coll Econ & Management Yangling Shaanxi Peoples R China Hong Kong Polytech Univ Dept Ind & Syst Engn Hung Hom Hong Kong Peoples R China Dalian Univ Technol Inst Syst Engn Dalian Peoples R China Dongbei Univ Finance & Econ Sch Business Adm Dalian Peoples R China Indian Inst Technol Kharagpur Dept Ind & Syst Engn Kharagpur W Bengal India Natl Inst Ind Engn NITIE Mumbai 400087 Maharashtra India

With recent developments in the airline industry worldwide, the competition among the industry has increased largely with many key players in the market. In order to generate profits, the industry has paid much attention to generate optimal routes that are maintenance feasible. The main aim of operational aircraft maintenance routing problem (OAMRP) is to generate these optimal routes for each aircraft that are maintenance feasible and follow the constraints defined by the Federal Aviation Administration (FAA). In this paper, the OAMRP is studied with two main objectives. First, to propose a formulation of a network flow-based Integer Linear Programming (ILP) framework for the OAMRP that considers three main maintenance constraints simultaneously: maximum flyinghour, limit on the number of take-offs between two consecutive maintenance checks and the work-force capacity. Second, to develop a new reinforcement learning-based algorithm which can be used to solve the problem, quickly and efficiently, as compared to commonly available optimization software. Finally, the evaluation of the proposed algorithm on real case datasets obtained from a major airline located in the Middle East verifies that the algorithm generates high-quality solutions quickly for both medium and large-scale flight schedule dataset.

关键词： Aircraft routing problem sequential decision-making problem Markov decision Process (MDP) Reinforcement Learning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：