Imitation learning, in which learning is performed by demonstration, has been studied and advanced for sequential decision-making tasks in which a reward function is not predefined. However, imitation learning methods...
详细信息
Imitation learning, in which learning is performed by demonstration, has been studied and advanced for sequential decision-making tasks in which a reward function is not predefined. However, imitation learning methods still require numerous expert demonstration samples to successfully imitate an expert's behavior. To improve sample efficiency, we utilize self-supervised representation learning, which can generate vast training signals from the given data. In this study, we propose a self-supervised representation-based adversarial imitation learning method to learn state and action representations that are robust to diverse distortions and temporally predictive, on non-image control tasks. In particular, in comparison with existing self-supervised learning methods for tabular data, we propose a different corruption method for state and action representations that is robust to diverse distortions. We theoretically and empirically observe that making an informative feature manifold with less sample complexity significantly improves the performance of imitation learning. The proposed method shows a 39% relative improvement over existing adversarial imitation learning methods on MuJoCo in a setting limited to 100 expert state-action pairs. Moreover, we conduct comprehensive ablations and additional experiments using demonstrations with varying optimality to provide insights into a range of factors.
Lauded as the world's most popular fruit, bananas are one of the main exports of the Philippines. It contains essential nutrients that can have a protective impact on health. Commonly, farmers can distinguish the ...
详细信息
ISBN:
(数字)9798350374186
ISBN:
(纸本)9798350374193
Lauded as the world's most popular fruit, bananas are one of the main exports of the Philippines. It contains essential nutrients that can have a protective impact on health. Commonly, farmers can distinguish the different types of bananas and potentially predict their shelf-life. Nevertheless, human judgment is not always reliable. It needs to have support and evidence to minimize the margin of error. Hence, this study focuses on developing an android application to determine the Philippine bananas' ripeness, variety, and shelf-life. This application identifies different bananas sold in the local market, specifically Cardava, Lakatan, and Latundan. Also, the application determines the shelf-life of bananas, such as the remaining days for unripe bananas to become ripe and ripe bananas to overripe. A Convolution Neural Network-based (CNN) model classifier is used as an architecture categorizing the Philippine local market bananas. The proponents used real-life image data to increase the accuracy of the CNN model. The application was proven effective and has reached 90% accuracy.
This paper describes the automation of a forearm prosthesis using the signal collected by a Fiber Bragg Grating (FBG) sensor. The FBG sensor is applied to one subject's forearm to measure the deformation as a resu...
详细信息
Copy-move forgeries often exploit homogeneous regions in images with large-scale attacks to either highlight or conceal target objects. These manipulations are simple to execute but challenging to notice. Forgery dete...
详细信息
While the rise of electric vehicles reflects a push for clean energy as envisioned by international agreements like the Paris Agreement, wireless charging's convenience can overcome limitations and hasten their ad...
详细信息
Vision-based neural networks as artificial intelligence models have been critical in many manufacturing industries, including automotive, food, and aerospace. Machine vision and deep learning have provided practical, ...
详细信息
Vehicle-to-grid (V2G) technology supporting bidirectional power transfer allows electric vehicles (EVs) to contribute and consume energy bidirectionally. Because the specific properties and requirements of V2G, Khan e...
详细信息
A semi-analytical finite element method(SAFEM),based on the two-scale asymptotic homogenization method(AHM)and the finite element method(FEM),is implemented to obtain the effective properties of two-phase fiber-reinfo...
详细信息
A semi-analytical finite element method(SAFEM),based on the two-scale asymptotic homogenization method(AHM)and the finite element method(FEM),is implemented to obtain the effective properties of two-phase fiber-reinforced composites(FRCs).The fibers are periodically distributed and unidirectionally aligned in a homogeneous *** framework addresses the static linear elastic micropolar problem through partial differential equations,subject to boundary conditions and perfect interface contact *** mathematical formulation of the local problems and the effective coefficients are presented by the *** local problems obtained from the AHM are solved by the FEM,which is denoted as the *** numerical results are provided,and the accuracy of the solutions is analyzed,indicating that the formulas and results obtained with the SAFEM may serve as the reference points for validating the outcomes of experimental and numerical computations.
Aberrant RNA splicing events resulting from DNA variations are common causes of genetic disorders. Two studies published in Nature Genetics independently describe methods to decipher DNA-variant-associated aberrant sp...
详细信息
Aberrant RNA splicing events resulting from DNA variations are common causes of genetic disorders. Two studies published in Nature Genetics independently describe methods to decipher DNA-variant-associated aberrant splicing using high-throughput RNA sequencing data.
In this paper, a method using deep reinforcement learning is proposed to deal with the 3D online bin packing problem. The packing objects are not limited to several specific or fixed cuboid objects, but are composed o...
详细信息
暂无评论