The purpose of this paper is to generalize the differential dynamic programming technique to such an extent that it can tackle multicriteria optimization problems with the same advantage of overcoming the dimensionali...
详细信息
The purpose of this paper is to generalize the differential dynamic programming technique to such an extent that it can tackle multicriteria optimization problems with the same advantage of overcoming the dimensionality difficulty. Fundamental concepts, underlying ideas and basic formulas which form the basis of a multicriteria differential dynamic programming algorithm are introduced and derived. The algorithm is described in detail. A fundamental convergence theorem is shown which can be applied to a wide class of multicriteria optimization problems.
This paper proposes a new computational guidance algorithm using differential dynamic programming and sparse Gauss-Hermite quadrature rule. By the application of sparse Gauss-Hermite quadrature rule, numerical differe...
详细信息
This paper proposes a new computational guidance algorithm using differential dynamic programming and sparse Gauss-Hermite quadrature rule. By the application of sparse Gauss-Hermite quadrature rule, numerical differentiation in the calculation of Hessian matrices and gradients in differential dynamic programming is avoided. Based on the new differential dynamic programming approach developed, a three-dimensional computational algorithm is proposed to control the impact angle and impact time for an air-to-surface interceptor. Extensive numerical simulations are performed to show the effectiveness of the proposed approach.
In this paper, we present a nonlinear H-infinity-optimal control algorithm for a systemwhose dynamics can be described by the summation of two terms: a known func-tion obtained from system modeling and an unknown func...
详细信息
In this paper, we present a nonlinear H-infinity-optimal control algorithm for a systemwhose dynamics can be described by the summation of two terms: a known func-tion obtained from system modeling and an unknown function that represents themodel error induced by the disturbance and the noise that are not captured by theoriginal model. A Gaussian Process (GP) is utilized as an alternative to a super-vised artificial neural network to update the nominal dynamics of the system andprovide disturbance estimates based on data gathered through interaction with thesystem. A soft-constrained two-player zero-sum differential game that is equivalentto the disturbance attenuation problem in nonlinear H-infinity-optimal control is then for-mulated to synthesis the H(infinity )controller. The differential game is solved through theGame-Theoretic differential dynamic programming (GT-DDP) algorithm in contin-uous time. In addition we provide a proof of quadratic convergence of the proposedGT-DDP algorithm. Simulation results on a quadcopter system demonstrate the effi-ciency of the learning-based control algorithm in handling model uncertainties andexternal disturbances.
This paper presents a contact-implicit model predictive control (MPC) framework for the real-time discovery of multi-contact motions, without predefined contact mode sequences or foothold positions. This approach util...
详细信息
This paper presents a contact-implicit model predictive control (MPC) framework for the real-time discovery of multi-contact motions, without predefined contact mode sequences or foothold positions. This approach utilizes the contact-implicit differential dynamic programming (DDP) framework, merging the hard contact model with a linear complementarity constraint. We propose the analytical gradient of the contact impulse based on relaxed complementarity constraints to further the exploration of a variety of contact modes. By leveraging a hard contact model-based simulation and computation of search direction through a smooth gradient, our methodology identifies dynamically feasible state trajectories, control inputs, and contact forces while simultaneously unveiling new contact mode sequences. However, the broadened scope of contact modes does not always ensure real-world applicability. Recognizing this, we implemented differentiable cost terms to guide foot trajectories and make gait patterns. Furthermore, to address the challenge of unstable initial roll-outs in an MPC setting, we employ the multiple shooting variant of DDP. The efficacy of the proposed framework is validated through simulations and real-world demonstrations using a 45 kg HOUND quadruped robot, performing various tasks in simulation and showcasing actual experiments involving a forward trot and a front-leg rearing motion.
We propose a design of data-driven Model Predictive Control (MPC) using a suboptimal trajectory and the linear time-varying (LTV) models from data-driven trajectory optimisation that achieves offset-free tracking. Dat...
详细信息
We propose a design of data-driven Model Predictive Control (MPC) using a suboptimal trajectory and the linear time-varying (LTV) models from data-driven trajectory optimisation that achieves offset-free tracking. Data-driven constrained differential dynamic programming (CDDP) is exploited to improve the trajectory iteratively without the knowledge of the nonlinear model. A trajectory is divided to the transient and steady state regions, controlled by the Linear time-varying MPC (LTVMPC) and the offset-free linear MPC (LMPC), respectively. We prove the feasibility of the proposed LTVMPC in the transient region, and the offset-free tracking property of LMPC. The proposed scheme is validated to a continuous stirred tank reactor (CSTR) process. Simulation studies show that the suboptimal trajectory and LTV models are generated by CDDP, and the proposed MPC achieves offset-free tracking and disturbance rejection for a set of initial conditions and set points in the operating region.
This study integrates the genetic algorithm (GA) and constrained differential dynamic programming (CDDP) to design the pump-treat-inject system. The proposed model. considers both the cost of installing wells (fixed c...
详细信息
This study integrates the genetic algorithm (GA) and constrained differential dynamic programming (CDDP) to design the pump-treat-inject system. The proposed model. considers both the cost of installing wells (fixed cost) and the operating cost of pumping, injection and water treatment. To minimize the total cost white meeting the water quality constraints, the model can compute the optimal number and locations of wells, as welt as the associated optimal pumping and injection schemes. Various numerical cases reveal that the requirement to balance the total volume between pumping and injection can significantly influence the final optimal design. (C) 2007 Elsevier B.V. All rights reserved.
With regard to the dynamic obstacles current unmanned aerial vehicles encountered in practical applications, an integral suboptimal trajectory programming method was proposed. It tackled with multiple constraints simu...
详细信息
With regard to the dynamic obstacles current unmanned aerial vehicles encountered in practical applications, an integral suboptimal trajectory programming method was proposed. It tackled with multiple constraints simultaneously while guiding the unmanned aerial vehicle to execute autonomous avoidance maneuver. The kinetics of both unmanned aerial vehicle and dynamic obstacles were established with appropriate hypotheses. Then it was assumed that the unmanned aerial vehicle was faced with terminal constraints and control constraints in the whole duration. Meanwhile, the performance index was established as minimum control efforts. The initial trajectory was generated according to optimized model predictive static programming. Next, the slack variables were introduced to transform the inequality constraints arising from dynamic obstacle avoidance into equality constraints. In addition, sliding mode control theory was utilized to determine these slack variables' dynamics by designing the approaching law of sliding mode. Then the avoidance trajectory for single or multiple dynamic obstacles was developed by this combined method. At last, a further trajectory optimization was conducted by differential dynamic programming. Consequently, the integral problem was solved step by step and numerical simulations demonstrated that the integral method possessed high computational efficiency.
The optimal control of manipulators is a key to the success of automated manufacturing. The problem considered here is an energy minimisation problem with given dynamics and is subject to actuator constraints. A diffe...
详细信息
The optimal control of manipulators is a key to the success of automated manufacturing. The problem considered here is an energy minimisation problem with given dynamics and is subject to actuator constraints. A differential dynamic programming algorithm is developed to solve the discrete-time optimal control problem. This method allows calculation of the joint reference trajectories and the design of a proportional derivative regulator. The characteristics of this new method are exposed and the simulation results shown.
The objective of this study is to apply the differential dynamic programming (DDP) technique of optimal control to heating, ventilating and air-conditioning (HVAC) systems and to compare its performance with a non-lin...
详细信息
The objective of this study is to apply the differential dynamic programming (DDP) technique of optimal control to heating, ventilating and air-conditioning (HVAC) systems and to compare its performance with a non-linear programming (NLP) technique using the sequential quadratic programming method. The DDP technique is briefly described and studied. Limitations of the technique are noted. Three cases of a system that has been treated previously in the literature are optimized by the two techniques and the computational times compared, The study shows DDP to be efficient compared with NLP for the example problems. NLP is, however, more robust and general and can treat constraints on the state variables directly. Further investigation is needed for larger-scale problems to fully explore the features of the two methods.
We formulate a locally superlinearly convergent projected Newton method for constrained minimization in a Cartesian product of balls. For discrete-time,N-stage, input-constrained optimal control problems with Bolza ob...
详细信息
We formulate a locally superlinearly convergent projected Newton method for constrained minimization in a Cartesian product of balls. For discrete-time,N-stage, input-constrained optimal control problems with Bolza objective functions, we then show how the required scaled tangential component of the objective function gradient can be approximated efficiently with a differential dynamic programming scheme; the computational cost and the storage requirements for the resulting modified projected Newton algorithm increase linearly with the number of stages. In calculations performed for a specific control problem with 10 stages, the modified projected Newton algorithm is shown to be one to two orders of magnitude more efficient than a standard unscaled projected gradient method.
暂无评论