In recent years, the rise of online Knowledge Management Systems (KMSs) has significantly improved work efficiency in enterprises. Knowledge development prediction, as a critical application within these online platfo...
详细信息
The popularity of quadrotor Unmanned Aerial Vehicles(UAVs)stems from their simple propulsion systems and structural ***,their complex and nonlinear dynamic behavior presents a significant challenge for control,necessi...
详细信息
The popularity of quadrotor Unmanned Aerial Vehicles(UAVs)stems from their simple propulsion systems and structural ***,their complex and nonlinear dynamic behavior presents a significant challenge for control,necessitating sophisticated algorithms to ensure stability and accuracy in *** strategies have been explored by researchers and control engineers,with learning-based methods like reinforcement learning,deep learning,and neural networks showing promise in enhancing the robustness and adaptability of quadrotor control *** paper investigates a Reinforcement Learning(RL)approach for both high and low-level quadrotor control systems,focusing on attitude stabilization and position tracking tasks.A novel reward function and actor-critic network structures are designed to stimulate high-order observable states,improving the agent’s understanding of the quadrotor’s dynamics and environmental *** address the challenge of RL hyper-parameter tuning,a new framework is introduced that combines Simulated Annealing(SA)with a reinforcement learning algorithm,specifically Simulated Annealing-Twin Delayed Deep Deterministic Policy Gradient(SA-TD3).This approach is evaluated for path-following and stabilization tasks through comparative assessments with two commonly used control methods:Backstepping and Sliding Mode Control(SMC).While the implementation of the well-trained agents exhibited unexpected behavior during real-world testing,a reduced neural network used for altitude control was successfully implemented on a Parrot Mambo mini *** results showcase the potential of the proposed SA-TD3 framework for real-world applications,demonstrating improved stability and precision across various test scenarios and highlighting its feasibility for practical deployment.
作者:
Sun, HanbingSun, Fuchun
State Key Lab on Intelligence Technology and Systems Department of Computer Science and Technology Tsinghua University Beijing China
This article introduces pedestrian trajectory prediction, which is a crucial step in the perception of autonomous driving. The controller system should predict the person’s motion before making a decision. Pedestrian...
详细信息
Data publishing methods can provide available information for analysis while preserving *** multiple sensitive attributes data publishing,which preserves the relationship between sensitive attributes,may keep many rec...
详细信息
Data publishing methods can provide available information for analysis while preserving *** multiple sensitive attributes data publishing,which preserves the relationship between sensitive attributes,may keep many records from being grouped and bring in a high record suppression *** category of multiple sensitive attributes data publishing,which reduces the possibility of record suppression by breaking the relationship between sensitive attributes,cannot provide the sensitive attributes association for ***,the existing multiple sensitive attributes data publishing fails to fully account for the comprehensive information *** acquire a guaranteed information utility,this article defines comprehensive information loss that considers both the suppression of records and the relationship between sensitive attributes.A heuristic method is leveraged to discover the optimal anonymity scheme that has the lowest comprehensive information *** experimental results verify the practice of the proposed data publishing method with multiple sensitive *** proposed method can guarantee information utility when compared with previous ones.
Recent studies have shown that the multi-encoder models are agnostic to the choice of context, and the context encoder generates noise which helps improve the models in terms of BLEU score. In this paper, we further e...
详细信息
The COVID-19 pandemic has led to a global medical crisis and significant disruptions to daily life since its emergence in December 2019. Rapidly, it spread to 218 countries affecting more than 754 million people. The ...
详细信息
In the field of aquaponics, where fish and plants coexist in a symbiotic environment, closely monitoring nitrate levels in the water is crucial due to their profound impact on aquatic and plant well-being. Traditional...
详细信息
Wireless federated learning (FL) is a collaborative machine learning (ML) framework in which wireless client-devices independently train their ML models and send the locally trained models to the FL server for aggrega...
详细信息
Wireless federated learning (FL) is a collaborative machine learning (ML) framework in which wireless client-devices independently train their ML models and send the locally trained models to the FL server for aggregation. In this paper, we consider the coexistence of privacy-sensitive client-devices and privacy-insensitive yet computing-resource constrained client-devices, and propose an FL framework with a hybrid centralized training and local training. Specifically, the privacy-sensitive client-devices perform local ML model training and send their local models to the FL server. Each privacy-insensitive client-device can have two options, i.e., (i) conducting a local training and then sending its local model to the FL server, and (ii) directly sending its local data to the FL server for the centralized training. The FL server, after collecting the data from the privacy-insensitive client-devices (which choose to upload the local data), conducts a centralized training with the received datasets. The global model is then generated by aggregating (i) the local models uploaded by the client-devices and (ii) the model trained by the FL server centrally. Focusing on this hybrid FL framework, we firstly analyze its convergence feature with respect to the client-devices' selections of local training or centralized training. We then formulate a joint optimization of client-devices' selections of the local training or centralized training, the FL training configuration (i.e., the number of the local iterations and the number of the global iterations), and the bandwidth allocations to the client-devices, with the objective of minimizing the overall latency for reaching the FL convergence. Despite the non-convexity of the joint optimization problem, we identify its layered structure and propose an efficient algorithm to solve it. Numerical results demonstrate the advantage of our proposed FL framework with the hybrid local and centralized training as well as our proposed alg
With recent advancements in industrial robots, educating students in new technologies and preparing them for the future is imperative. However, access to industrial robots for teaching poses challenges, such as the hi...
详细信息
Finite-sum optimization has wide applications in machine learning, covering important problems such as support vector machines, regression, *** this paper, we initiate the study of solving finite-sum optimization prob...
详细信息
Finite-sum optimization has wide applications in machine learning, covering important problems such as support vector machines, regression, *** this paper, we initiate the study of solving finite-sum optimization problems by quantum ***, let f1, ..., fn : d → be -smooth convex functions and ψ: d → be a µ-strongly convex proximal *** goal is to find an ϵ-optimal point for F(x) = n1 Pni=1 fi(x) + ψ(x).We give a quantum algorithm with complexity Õ(Equation presented) 1 improving the classical tight bound (Equation presented).We also prove a quantum lower bound Ω˜(n + n3/4(/µ)1/4) when d is large *** our quantum upper and lower bounds can extend to the cases where ψ is not necessarily strongly convex, or each fi is Lipschitz but not necessarily *** addition, when F is nonconvex, our quantum algorithm can find_an ϵ-critial point using (Equation presented) queries. Copyright 2024 by the author(s)
暂无评论