We introduce a constructive function approximation approach as a general tool, particularly useful in adaptive and data-driven methods for perception and control. The key idea is to estimate of a collection of simple ...
详细信息
Pneumonia is one of the top causes of death in Romania and early detection of this disease improves the recovery chances and shortens the length of hospitalization. In this work, we develop a solution for automatic pn...
详细信息
The present paper introduces a mathematical model for the cross-talking between microRNA and Protein. Studying the qualitative properties of the proposed model, we infer that the microRNA is an inhibitor for the Prote...
详细信息
Trajectory planning method is a research hotspot in autonomous driving. Existing reinforcement learning-based trajectory planning methods suffer from unstable performance due to the strong randomness of network weight...
详细信息
Trajectory planning method is a research hotspot in autonomous driving. Existing reinforcement learning-based trajectory planning methods suffer from unstable performance due to the strong randomness of network weight parameter updates during the training process. Therefore, this paper proposes a novel trajectory planning method based on deep reinforcement learning trust region policy optimization (TRPO). Firstly, in order to enhance the robustness of the trajectory planning method based on deep reinforcement learning TRPO, a TRPO-LSTM based decision model was proposed. More specifically, a long short term memory (LSTM) based state feature extraction network was designed and embeded into a TRPO-based decision model to enhance the ability of TRPO to extract information from the environmental state space. Secondly, in order to make the planned trajectory adaptive to the dynamic changes of traffic environment, we presented a novel TRPO-LSTM trajectory fitting algorithm. To the best of our knowledge, this is the first work aiming at applying the TRPO-LSTM based decision model in the trajectory fitting process to search the optimal longitudinal trajectory speed. Finally, the proposed trajectory planning method was implemented and simulated on the CARLA simulator. The experimental results show that, compared with existing trajectory planning methods based on deep reinforcement learning algorithms, our proposed method achieves a cumulative reward improvement of over 28.9% in the scenario of four lane highway, and has better robustness. Meanwhile, the proposed method can achieve a lower collision rate of 0.93% while improving the average speed and comfort of vehicle driving. IEEE
Quadrotors play a significant role in our lives and are transforming our *** cable-suspended loads is an unavoidable quadrotor application trend and a hot research topic in the control ***,the load swing and unpredict...
详细信息
Quadrotors play a significant role in our lives and are transforming our *** cable-suspended loads is an unavoidable quadrotor application trend and a hot research topic in the control ***,the load swing and unpredictability pose significant challenges to the quadrotor's *** this paper,an anti-swing controller with an inner-outer control strategy for the quadrotor-slung load transportation system is *** facilitate the controller design,the outer position dynamics are restructured in the form of ***,a virtual controller is created to force the underactuated states to the dynamic surface to ensure the position subsystem's *** improve robustness,an adaptive law is used to eliminate the effects of uncertain cable ***,a dynamic surface controller for the inner attitude subsystem is presented to drive the actual force to the virtual *** is demonstrated that the control strategy can stabilize the quadrotor despite mass and cable length *** results are provided to demonstrate the efficacy and durability of the proposed method.
Accurate state of charge (SOC) estimation is crucial for the safe operation of lithium-ion batteries (LIBs), yet existing methods are limited by sensitivity to initial SOC guess or high computational complexity. To ad...
详细信息
Regularized system identification has become the research frontier of system identification in the past *** related core subject is to study the convergence properties of various hyper-parameter estimators as the samp...
详细信息
Regularized system identification has become the research frontier of system identification in the past *** related core subject is to study the convergence properties of various hyper-parameter estimators as the sample size goes to *** this paper,we consider one commonly used hyper-parameter estimator,the empirical Bayes(EB).Its convergence in distribution has been studied,and the explicit expression of the covariance matrix of its limiting distribution has been ***,what we are truly interested in are factors contained in the covariance matrix of the EB hyper-parameter estimator,and then,the convergence of its covariance matrix to that of its limiting distribution is *** general,the convergence in distribution of a sequence of random variables does not necessarily guarantee the convergence of its covariance ***,the derivation of such convergence is a necessary complement to our theoretical analysis about factors that influence the convergence properties of the EB hyper-parameter *** this paper,we consider the regularized finite impulse response(FIR)model estimation with deterministic inputs,and show that the covariance matrix of the EB hyper-parameter estimator converges to that of its limiting ***,we run numerical simulations to demonstrate the efficacy of ourtheoretical results.
Motion systems are a vital part of many industrial processes. However, meeting the increasingly stringent demands of these systems, especially concerning precision and throughput, requires novel control design methods...
详细信息
The third generation of neural networks is called spiking neural networks. Spiking neural networks can not only answer all the problems that can be solved by common neural networks, they can also be computationally mo...
详细信息
We consider word-of-mouth social learning involving m Kalman filter agents that operate sequentially. The first Kalman filter receives the raw observations, while each subsequent Kalman filter receives a noisy measure...
详细信息
暂无评论