Short term scheduling of multipurpose batch processes has received growing attention over past decades. It concerns the optimal allocation of a set of limited resources to tasks over time in order to enhance the reven...
详细信息
Short term scheduling of multipurpose batch processes has received growing attention over past decades. It concerns the optimal allocation of a set of limited resources to tasks over time in order to enhance the revenue of plants. This paper addresses the short term scheduling of batch processes through a continuous-time mixed integer linear programming (MILP) formulation based on the state-task network (STN) representation that allows to consider multiple intermediate due dates for market requirements. The proposed formulation can be classified as a slot-based approach that views the time horizon as a set of ordered blocks of unknown and variable lengths. Compared to previous similar approaches, it is simpler and leads to a smaller mathematical model without decoupling tasks from units. A few benchmark problems are used to illustrate the computational advantages of the proposed optimization approach.
In this paper, we study the adaptive optimal consensus control of leaderless multi-agent systems (MASs) with heterogeneous dynamics. First, the consensus control problem is converted into a graphical minimax game prob...
详细信息
ISBN:
(数字)9781728190938
ISBN:
(纸本)9781728190945
In this paper, we study the adaptive optimal consensus control of leaderless multi-agent systems (MASs) with heterogeneous dynamics. First, the consensus control problem is converted into a graphical minimax game problem and the corresponding algebraic Riccati equation (ARE) for each agent is obtained. Then, an on-policy reinforcement learning algorithm is proposed to online learn the optimal control policy without requiring the system dynamics. A certain rank condition is established to guarantee the convergence of the proposed online learning algorithm to the unique solution of the ARE. Finally, the effectiveness of the proposed algorithm is demonstrated through a numerical simulation.
For robust control and iterative optimization of industrial batch processes with polytopic uncertainties, this paper proposes a robust output feedback based iterative learning control (ILC) design in terms of finite f...
详细信息
For robust control and iterative optimization of industrial batch processes with polytopic uncertainties, this paper proposes a robust output feedback based iterative learning control (ILC) design in terms of finite frequency range stability specifications. Robust stability conditions for the closed-loop ILC system along both time and batch directions are first established based on the generalized Kalman-Yakubovich-Popov lemma and linear repetitive system theory. To facilitate the ILC controller design with respect to process uncertainties described in a polytopic form, extended sufficient conditions for the system stability are then derived in terms of matrix inequalities. Correspondingly, a two-stage heuristic approach is developed to iteratively compute feasible ILC controller gains for implementation. An illustrative example is given to demonstrate the effectiveness of the proposed control design.
This paper studies the distributed average tracking problem pertaining to a discrete-time linear time-invariant multi-agent network, which is subject to, concurrently, input delays, random packet-drops, and reference ...
详细信息
This paper develops a novel off-policy game Q-learning algorithm to solve the anti-interference control problem for discrete-time linear multi-player systems using only data without requiring system matrices to be kno...
详细信息
This paper develops a novel off-policy game Q-learning algorithm to solve the anti-interference control problem for discrete-time linear multi-player systems using only data without requiring system matrices to be known. The primary contribution of this paper lies in that the Q-learning strategy employed in the proposed algorithm is implemented in an off-policy policy iteration approach other than on-policy learning due to the well-known advantages of off-policy Q-learning over on-policy Q-learning. All of the players work hard together for the goal of minimizing their common performance index meanwhile defeating the disturbance that tries to maximize the specific performance index, and finally they reach the Nash equilibrium of the game resulting in satisfying disturbance attenuation condition. In order to find the solution to the Nash equilibrium, the anti-interference control problem is first transformed into an optimal control problem. Then an off-policy Q-learning algorithm is proposed in the framework of typical adaptive dynamic programming (ADP) and game architecture, such that control policies of all players can be learned using only measured data. Comparative simulation results are provided to verify the effectiveness of the proposed method.
The results of theoretical studies and experiments of the developed exhaust ventilation system with direct evaporative air-cooling with using "wet mats"are given. It has shown sufficient efficiency with incr...
详细信息
This work studies the control energy for directed networks which are exactly controllable. By employing Jordan transformation, we derive a simplified formula to calculate the control energy of directed networks. We sh...
详细信息
This work studies the control energy for directed networks which are exactly controllable. By employing Jordan transformation, we derive a simplified formula to calculate the control energy of directed networks. We show that the control energy of directed networks is a decreasing function of the terminal time, indicating that the more time to control, the less energy to cost. Additionally, we investigate the control energy for the Price network model. We show that for Price networks the control energy is a decreasing function of network heterogeneity and an increasing function of network size.
We consider the distributed average tracking(DAT) problem for multiple time-varying signals subject to both process and measurement noise under a directed time-varying communication network. In order to attenuate the ...
详细信息
We consider the distributed average tracking(DAT) problem for multiple time-varying signals subject to both process and measurement noise under a directed time-varying communication network. In order to attenuate the effect of the noise, we construct a Kalman filter for each agent, based on which we design a discrete-time consensus-based DAT algorithm. It is shown that the proposed algorithm can track the average of the reference signals with small steady-state error. A numerical example is presented to validate the theoretical results.
This paper studies a time-varying unconstrained convex optimization problem under the prediction-correction *** primary objective is to devise a simplified prediction-correction algorithm, which can solve the time-var...
详细信息
This paper studies a time-varying unconstrained convex optimization problem under the prediction-correction *** primary objective is to devise a simplified prediction-correction algorithm, which can solve the time-varying unconstrained convex optimization problem effectively without the need to compute the inverse of the Hessian matrix of the cost *** this aim, we propose a simplified prediction step and provide the theoretical analysis on the convergence of the resulting algorithm. The results show that, under suitable conditions, the error bound of our algorithm is O(h), which is at the same level as those reported in the literature. We demonstrate the validity of the proposed algorithm by a numerical example.
暂无评论