作者:
Atheupe, Gael P.Martinez, DidierMonsuez, Bruno
Renault Technical Centre Renault Group & Ensta Paris Paris France Renault Technical Centre
Renault Group Dept. Chassis Control & Adas Systems Guyancourt France
Ensta Paris Dept. Computer Science & Systems Engineering Paris France
The transition to vehicle electrification introduces new demands on chassis dynamics, paving the way for advances in driving dynamics, safety, and efficiency. A key consideration arises: how are driving torque impulse...
In this work, a novel reinforcement learning-based adaptive fault-tolerant control (FTC) scheme with actuator redundancy is presented for a nonlinear strict-feedback system with nonlinear dynamics and uncertainties. A...
详细信息
In this work, a novel reinforcement learning-based adaptive fault-tolerant control (FTC) scheme with actuator redundancy is presented for a nonlinear strict-feedback system with nonlinear dynamics and uncertainties. A learning-based switching function technique is established to steer different groups of actuators automatically and successively to mitigate the impact of faulty actuators by observing a switching performance index. The optimal tracking control problem (OTCP) of strict-feedback nonlinear systems is transformed into an equivalent optimal regulation problem of each affine subsystem via adaptive feedforward controllers. Subsequently, the designed objective functions associated with Hamilton–Jacobi–Bellman (HJB) estimate errors caused by neural network (NN) approximations can be minimized by the reinforcement learning algorithm without value or policy iterations. It is proved that the tracking objective can be achieved and all signals in the closed-loop system can be guaranteed to be bounded, as long as the minimum time interval between two successive failures is bounded. Theoretical results are verified by simulations.
A hidden Markov model(HMM)comprises a state with Markovian dynamics that can only be observed via noisy *** paper considers three problems connected to HMMs,namely,inverse filtering,belief estimation from actions,and ...
详细信息
A hidden Markov model(HMM)comprises a state with Markovian dynamics that can only be observed via noisy *** paper considers three problems connected to HMMs,namely,inverse filtering,belief estimation from actions,and privacy enforcement in such a ***,the authors discuss how HMM parameters and sensor measurements can be reconstructed from posterior distributions of an HMM ***,the authors consider a rational decision-maker that forms a private belief(posterior distribution)on the state of the world by filtering private *** authors show how to estimate such posterior distributions from observed optimal actions taken by the *** the setting of adversarial systems,the authors finally show how the decision-maker can protect its private belief by confusing the adversary using slightly sub-optimal *** range from financial portfolio investments to life science decision systems.
Subsystem reduction of interconnected systems provides a computationally cheap and structure-preserving alternative to direct model order reduction techniques of large-scale systems. Unfortunately, for this method, er...
详细信息
Motion systems are a vital part of many industrial processes. However, meeting the increasingly stringent demands of these systems, especially concerning precision and throughput, requires novel control design methods...
详细信息
ISBN:
(数字)9798350316339
ISBN:
(纸本)9798350316346
Motion systems are a vital part of many industrial processes. However, meeting the increasingly stringent demands of these systems, especially concerning precision and throughput, requires novel control design methods that can go beyond the capabilities of traditional solutions. Traditional control methods often struggle with the complexity and position-dependent effects inherent in modern motion systems, leading to compromises in performance and a laborious task of controller design. This paper addresses these challenges by introducing a novel structured feedback control auto-tuning approach for multiple-input multiple-output (MIMO) motion systems. By leveraging frequency response function (FRF) estimates and the linear-parameter-varying (LPV) control framework, the proposed approach automates the controller design, while providing local stability and performance guarantees. Key innovations include norm-based magnitude optimization of the sensitivity functions, an automated stability check through a novel extended factorized Nyquist criterion, a modular structured MIMO LPV controller parameterization, and a controller discretization approach which preserves the continuous-time (CT) controller parameterization. The proposed approach is validated through experiments using a state-of-the-art moving-magnet planar actuator prototype.
The high programmability provided by Software Defined Networking (SDN) paradigm facilitated the integration of Machine Learning (ML) methods to design a new family of network management schemes. Among them, we can cit...
详细信息
The high programmability provided by Software Defined Networking (SDN) paradigm facilitated the integration of Machine Learning (ML) methods to design a new family of network management schemes. Among them, we can cite self-driving networks, where ML is used to analyze data and define strategies that are then translated into network configurations by the SDN controllers, making the networks autonomous and capable of auto-scaling decisions based on the network’s needs. Despite their attractiveness, the centralized design of the majority of proposed solutions cannot keep up with the increasing size of the network. To this end, this paper investigates the use of a multi-agent reinforcement learning (MARL) model for auto-scaling decisions in an SDN environment. In particular, we study two possible alternatives for distributing operations: a collaborative one, where controllers share the same observations, and an individual one, where controllers make decisions according to their own logic and share only some basic information, such as the network topology. After an experimental campaign performed both on Mininet and GENI, results showed that both approaches can guarantee high throughput while minimizing the set of active resources.
This paper consider solving a class of nonconvex-strongly-convex distributed stochastic bilevel optimization (DSBO) problems with personalized inner-level objectives. Most existing algorithms require computational loo...
详细信息
Reinforcement learning (RL) has gained wide attention, but its implementation in autonomous vehicles is still limited by insufficient sample efficiency and heavy training costs. The training efficiency of RL agents is...
详细信息
Most companies relay on centralized servers, which are considered as a fast and easy to deploy solution for file sharing, but they have many downsides, like security and trust issues, which can be solved using DLT (bl...
详细信息
Deep learning frameworks promote the development of artificial intelligence and demonstrate considerable potential in numerous ***,the security issues of deep learning frameworks are among the main risks preventing th...
详细信息
Deep learning frameworks promote the development of artificial intelligence and demonstrate considerable potential in numerous ***,the security issues of deep learning frameworks are among the main risks preventing the wide application of *** on deep learning frameworks by malicious internal or external attackers would exert substantial effects on society and *** start with a description of the framework of deep learning algorithms and a detailed analysis of attacks and vulnerabilities in *** propose a highly comprehensive classification approach for security issues and defensive approaches in deep learning frameworks and connect different attacks to corresponding defensive ***,we analyze a case of the physical-world use of deep learning security *** addition,we discuss future directions and open issues in deep learning *** hope that our research will inspire future developments and draw attention from academic and industrial domains to the security of deep learning frameworks.
暂无评论