In this paper, the inverse linear quadratic(LQ) problem over finite time-horizon is *** the output observations of a dynamic process, the goal is to recover the corresponding LQ cost function. Firstly, by considering ...
详细信息
In this paper, the inverse linear quadratic(LQ) problem over finite time-horizon is *** the output observations of a dynamic process, the goal is to recover the corresponding LQ cost function. Firstly, by considering the inverse problem as an identification problem, its model structure is shown to be strictly globally identifiable under the assumption of system invertibility. Next, in the noiseless case a necessary and sufficient condition is proposed for the solvability of a positive semidefinite weighting matrix and its unique solution is obtained with two proposed algorithms under the condition of persistent excitation. Furthermore, a residual optimization problem is also formulated to solve a best-fit approximate cost function from sub-optimal observations. Finally, numerical simulations are used to demonstrate the effectiveness of the proposed methods.
This paper presents a computationally-efficient method for evaluating the feasibility of Quadratic Programs (QPs) for online constrained control. Based on the duality principle, we first show that the feasibility of a...
详细信息
This work presents a Fleet Manager for a fleet of Autonomous Mobile Robots (AMRs) that perform material handling tasks in a shared environment. The Fleet Manager assigns AMRs to newly released tasks, computes paths fo...
详细信息
ISBN:
(数字)9798350358513
ISBN:
(纸本)9798350358520
This work presents a Fleet Manager for a fleet of Autonomous Mobile Robots (AMRs) that perform material handling tasks in a shared environment. The Fleet Manager assigns AMRs to newly released tasks, computes paths for them to travel to the task’s locations, and schedules their travel along the computed paths so that conflicts with other AMRs are avoided. The objective is for each AMR to complete its task as quickly as possible, to then be assigned a new *** Fleet Manager works online, assigning a released task to the AMR closest to the task’s location, and then computing the path and schedule to fit in with the already assigned and executing AMRs. Conflicts occur when, in order to reach their targets, AMRs would have to simultaneously occupy the same space. Resolving this is done by appropriate scheduling, or by moving idle AMRs out of the way. For fleet management to be practicable, the computation time for assigning an AMR to a task and computing its path and schedule must be negligible compared to other system *** were conducted to evaluate the performance of the Fleet Manager on a number of benchmark problem instances, counting up to hundreds of AMRs. The results show that the presented Fleet Manager can handle these systems quickly enough to be practically useful in real industrial scenarios.
This paper introduces a novel control framework to address the satisfaction of multiple time-varying output constraints in uncertain high-order MIMO nonlinear controlsystems. Unlike existing methods, which often assu...
详细信息
With increasing adoption of residential PV systems, net load forecasting is gradually shifting from forecasting pure load to forecasting pure load with PV generation. This paper explicitly compares two methods of net ...
详细信息
We investigate the equilibrium stability and robustness in a class of moving target defense problems, in which players have both incomplete information and asymmetric cognition. We first establish a Bayesian Stackelbe...
详细信息
We demonstrate a novel bending structure on silicon nitride towards a monolithically integrated laser platform. This bending structure has a deep etched groove along the outer side, and can greatly reduce the footprin...
详细信息
Offline safe reinforcement learning (RL) aims to train a constraint satisfaction policy from a fixed dataset. Current state-of-the-art approaches are based on supervised learning with a conditioned policy. However, th...
详细信息
With the rising complexity of our electricity infrastructure, smart grid simulations increasingly rely on co-simulation, which involves jointly executing independent subsystem simulations. However, in large-scale simu...
详细信息
Building on our recent research on neural heuristic quantization systems, results on learning quantized motions and resilience to channel dropouts are reported. We propose a general emulation problem consistent with t...
Building on our recent research on neural heuristic quantization systems, results on learning quantized motions and resilience to channel dropouts are reported. We propose a general emulation problem consistent with the neuromimetic paradigm. This optimal quantization problem can be solved by model predictive control (MPC), but because the optimization step involves integer programming, the approach suffers from combinatorial complexity when the number of input channels becomes large. Even if we collect data to train a neural network, collection of training data and the training itself are still time-consuming. Therefore, we propose a general Deep Q Network (DQN) algorithm that not only learns trajectories but also exhibits the advantages of resilience to channel dropout. Furthermore, to transfer the model to other emulation problems, a mapping-based transfer learning approach can be used directly on the current model to obtain the optimal trajectory steps for the new emulation problems.
暂无评论