During the COVID-19 coronavirus epidemic, people usually wear masks to prevent the spread of the virus, which has become a major obstacle when we use face-based computer vision techniques such as face recognition and ...
详细信息
In actor-critic reinforcement learning(RL)algorithms,function estimation errors are known to cause ineffective random exploration at the beginning of training,and lead to overestimated value estimates and suboptimal *...
详细信息
In actor-critic reinforcement learning(RL)algorithms,function estimation errors are known to cause ineffective random exploration at the beginning of training,and lead to overestimated value estimates and suboptimal *** this paper,we address the problem by executing advantage rectification with imperfect demonstrations,thus reducing the function estimation *** with expert demonstrations has been widely adopted to accelerate the learning process of deep reinforcement learning when simulations are expensive to ***,existing methods,such as behavior cloning,often assume the demonstrations contain other information or labels with regard to performances,such as optimal assumption,which is usually incorrect and useless in the real *** this paper,we explicitly handle imperfect demonstrations within the actor-critic RL frameworks,and propose a new method called learning from imperfect demonstrations with advantage rectification(LIDAR).LIDAR utilizes a rectified loss function to merely learn from selective demonstrations,which is derived from a minimal assumption that the demonstrating policies have better performances than our current *** learns from contradictions caused by estimation errors,and in turn reduces estimation *** apply LIDAR to three popular actor-critic algorithms,DDPG,TD3 and SAC,and experiments show that our method can observably reduce the function estimation errors,effectively leverage demonstrations far from the optimal,and outperform state-of-the-art baselines consistently in all the scenarios.
The Large Language Model (LLM) has demonstrated significant capabilities in intelligent robotics and Autonomous Driving(AD). Compared to traditional end-to-end models, decision reasoning in the form of language exhibi...
详细信息
Remote sensing image classification is a popular yet challenging field. Many researchers have combined convolutional neural networks (CNNs) and Transformers for hyperspectral imaging (HSI) classification tasks. Howeve...
详细信息
Container based microservices have been widely applied to promote the cloud elasticity. The mainstream Docker containers are structured in layers, which are organized in stack with bottom-up dependency. To start a mic...
详细信息
As an important part of CRH(China Railway High-speed) trains, the stability and stationarity of a suspension system is of great significance to the vehicle system. Based on the framework of probability relevant princi...
详细信息
Predicting the upcoming weather instances is very crucial. It depends on different climatic parameters like humidity, pressure, temperature, etc. In this paper, the historical data of the weather in the India area is ...
详细信息
Implementing precise network traffic prediction is essential for optimizing base station strategies, allocating network resources effectively, reducing costs, and ensuring service quality in network management. Curren...
详细信息
Owing to the influence of sampling loss,cavity difference and detecting source,the multi-optical parameter measurement of atmospheric aerosol cannot be detected simultaneously in the same reference *** order to solve ...
详细信息
Owing to the influence of sampling loss,cavity difference and detecting source,the multi-optical parameter measurement of atmospheric aerosol cannot be detected simultaneously in the same reference *** order to solve this problem,a new method of simultaneously detecting the aerosol optical parameters by coupling cavity ring-down spectrometer with photoacoustic spectroscopy is ***,the coupled photoacoustic cavity is formed by the organic fusion of the photoacoustic cavity and the ring-down ***,the integrated design of the coupling spectroscopy system is carried ***,the extinction coefficient and absorption coefficient of aerosol are measured simultaneously by the system,and then the aerosol scattering coefficient and single albedo are calculated *** accuracy of the system is verified by comparing with the data from the environmental quality monitoring station,which provides a new idea for the detection of multi-optical characteristics of atmospheric aerosol.
In this challenging world, social media plays a vital role as it is at the pinnacle of data sharing. The advancement in technology has made a huge amount of information available for data analysis and it is on the hot...
详细信息
暂无评论