To solve the problem of noise interference during satellite operation, a satellite attitude control method based on a fully actuated system method and active disturbance rejection technology is proposed. Firstly, a st...
详细信息
In this paper, we address a data-driven linear quadratic optimal control problem in which the regulator design is performed on-policy by resorting to approaches from reinforcement learning and model reference adaptive...
ISBN:
(纸本)9798350301243
In this paper, we address a data-driven linear quadratic optimal control problem in which the regulator design is performed on-policy by resorting to approaches from reinforcement learning and model reference adaptive control. In particular, a continuous-time identifier of the value function is used to generate online a reference model for the adaptive stabilizer. By introducing a suitably selected dithering signal, the resulting policy is shown to achieve asymptotic convergence to the optimal gain while the controlled plant reaches asymptotically the behavior of the optimal closed-loop system.
In Federated learning (FL), network end-nodes use private data for the local training of classification models that are periodically synchronized with a remote server to update a global model. A key limitation in FL i...
详细信息
ISBN:
(纸本)9798350377217;9798350377200
In Federated learning (FL), network end-nodes use private data for the local training of classification models that are periodically synchronized with a remote server to update a global model. A key limitation in FL is the high network traffic required to achieve acceptable accuracy, particularly when each client has samples of only a few classes, a problem known as pathological labels skew. Under such constraints, local models become biased toward the available classes, and more synchronization rounds are needed to converge, saturating the communication budget. We address this issue with Draft & Refine, a new resource management strategy to optimize accuracy by dynamically controlling clients' participation based on their remaining budget. Our approach consists of two learning phases: a low-effort drafting phase with only a few clients selected for synchronization, followed by a refinement phase with increased client participation. The available data traffic is used as the control variable to set the duration of the two phases. The proposed strategy outperforms state-of-the-art FL schemes with up to 12.20% higher accuracy within the same data traffic.
While real-world problems are often challenging to analyze analytically, deep learning excels in modeling complex processes from data. Existing optimization frameworks like CasADi facilitate seamless usage of solvers ...
详细信息
While real-world problems are often challenging to analyze analytically, deep learning excels in modeling complex processes from data. Existing optimization frameworks like CasADi facilitate seamless usage of solvers but face challenges when integrating learned process models into numerical optimizations. To address this gap, we present the learning for CasADi (L4CasADi) framework, enabling the seamless integration of PyTorch-learned models with CasADi for efficient and potentially hardware-accelerated numerical optimization. The applicability of L4CasADi is demonstrated with two tutorial examples: First, we optimize a fish's trajectory in a turbulent river for energy efficiency where the turbulent flow is represented by a PyTorch model. Second, we demonstrate how an implicit Neural Radiance Field environment representation can be easily leveraged for optimal control with L4CasADi. L4CasADi, along with examples and documentation, is available under MIT license at https://***/Tim-Salzmann/l4casadi
Whip targeting (to hit) is a challenging motor control task owing to movement rapidness (72% reductions of joint motion tracking control errors, compared to traditional learning and control algorithms. This integratio...
详细信息
ISBN:
(纸本)9798350386523;9798350386530
Whip targeting (to hit) is a challenging motor control task owing to movement rapidness (< 1.5 s) and accuracy, as well as tool flexibility. A learning-based arm control framework is proposed for manipulating a whip to hit targets in this paper. The framework combines a joint motion planner learned by the algorithm soft actor-critic (SAC) with online impedance adaptation control (OIAC). It was validated on a simulated arm and whip system with 54 degrees-of-freedom (DOFs), whose objective is to minimize the distance between the whip tip and a target in a 3D space. As a result, our integration of data-driven and physics-based algorithms allows for 89% improvement of hitting effectiveness and >72% reductions of joint motion tracking control errors, compared to traditional learning and control algorithms. This integration provides a novel proof-of-concept solution to fast targeting control in flexible object manipulation(1).
In the era of the Internet of Things (IoT), decentralized paradigms for machine learning are gaining prominence. In this paper, we introduce a federated learning model that capitalizes on the Euclidean distance betwee...
详细信息
ISBN:
(纸本)9781728190549
In the era of the Internet of Things (IoT), decentralized paradigms for machine learning are gaining prominence. In this paper, we introduce a federated learning model that capitalizes on the Euclidean distance between device model weights to assess their similarity and disparity. This is foundational for our system, directing the formation of coalitions among devices based on the closeness of their model weights. Furthermore, the concept of a barycenter, representing the average of model weights, helps in the aggregation of updates from multiple devices. We evaluate our approach using homogeneous and heterogeneous data distribution, comparing it against traditional federated learning averaging algorithm. Numerical results demonstrate its potential in offering structured, outperformed and communication-efficient model for IoT-based machine learning.
The robust iterative learningcontrol (RILC) can deal with the systems with unknown time-varying uncertainty to track a repeated reference signal. However, the existing robust designs consider all the possibilities of...
详细信息
The robust iterative learningcontrol (RILC) can deal with the systems with unknown time-varying uncertainty to track a repeated reference signal. However, the existing robust designs consider all the possibilities of uncertainty, which makes the design conservative and causes the controlled process converging to the reference trajectory slowly. To eliminate this weakness, a data-driven method is proposed. The new design intends to employ more information from the past input-output data to compensate for the robust control law and then to improve performance. The proposed control law is proved to guarantee convergence and accelerate the convergence rate. Ultimately, the experiments on a robot manipulator have been conducted to verify the good convergence of the trajectory errors under the control of the proposed method.
This article addresses the problem of finite-time interception against a high-speed maneuvering target formulated as a two-player zero-sum differential game framework with input constraints. The interceptor seeks to m...
详细信息
This article addresses the problem of finite-time interception against a high-speed maneuvering target formulated as a two-player zero-sum differential game framework with input constraints. The interceptor seeks to minimize the performance index and the maneuvering target seeks to maximize it. A nonquadratic value function is constructed to incorporate the control constraints into the differential game problem for both sides of the players. A finite-time convergent algorithm is proposed to solve the optimal value function of the differential game from the nonlinear Hamilton-Jacobi-Isaacs equation within finite time. The optimal game strategies are consequently learned using collected data of the engagement states based on the reinforcement learning. Uniform and global finite-time convergent property is confirmed via Lyapunov analysis with the obtained Nash strategies. Simulation results of an interceptor against a high-speed maneuvering target are provided to show the efficacy of the proposed method.
Massive integration of photovoltaic (PV) and energy storage systems (ESS) in the energy system increases the utilization of renewable energy and system complexity. Optimal and centralized scheduling and management of ...
详细信息
ISBN:
(纸本)9798350377958;9798350377941
Massive integration of photovoltaic (PV) and energy storage systems (ESS) in the energy system increases the utilization of renewable energy and system complexity. Optimal and centralized scheduling and management of these sources can make them more effective and acceptable in the energy community. In this regard, this paper proposes a virtual scheduling of the PV-ESS system based on a deep learning-based data-driven approach. The proposed system divides the time span into two segments based on the availability of PV generation. A deep learning model is applied to predict the day-ahead PV generation and demand during the two-time segments. An optimization algorithm is then applied to schedule the PV and ESS based on the predictive outcomes. For validating the proposed system, a community microgrid framework is considered where multiple PV and ESS systems are connected. In addition, the user-owned PV-ESS systems are controlled by the microgrid. The simulation results demonstrate the proposed system's performance in terms of reliability and sustainability.
Fast tool servo (FTS) systems using piezoelectric technology show great promise for ultra-precise machining due to their high precision and responsiveness. However, during machining, these systems often struggle with ...
详细信息
ISBN:
(纸本)9798331516246;9798331516239
Fast tool servo (FTS) systems using piezoelectric technology show great promise for ultra-precise machining due to their high precision and responsiveness. However, during machining, these systems often struggle with tracking accuracy due to factors like system complexity and external disturbances. To address this, we propose a robust control method specifically for piezo-driven FTS. Our approach aims to improve tracking accuracy and disturbance rejection during machining. We use a robust repetitive control scheme with a disturbance observer (DOB) to minimize the effect of interference on tracking. However, in practice, solving the inverse model of the controlled object can introduce uncertainty. To enhance the disturbance observer's performance, we add iterative learningcontrol (ILC). This helps refine control actions based on past errors, improving overall system performance.
暂无评论