In this paper, we study offline-to-online Imitation Learning (IL) that pretrains an imitation policy from static demonstration data, followed by fast finetuning with minimal environmental interaction. We find the na...
详细信息
In this paper, we study offline-to-online Imitation Learning (IL) that pretrains an imitation policy from static demonstration data, followed by fast finetuning with minimal environmental interaction. We find the naïve combination of existing offline IL and online IL methods tends to behave poorly in this context, because the initial discriminator (often used in online IL) operates randomly and discordantly against the policy initialization, leading to misguided policy optimization and unlearning of pretraining knowledge. To overcome this challenge, we propose a principled offline-to-online IL method, named OLLIE, that simultaneously learns a near-expert policy initialization along with an aligned discriminator initialization, which can be seamlessly integrated into online IL, achieving smooth and fast finetuning. Empirically, OLLIE consistently and significantly outperforms the baseline methods in 20 challenging tasks, from continuous control to vision-based domains, in terms of performance, demonstration efficiency, and convergence speed. This work may serve as a foundation for further exploration of pretraining and finetuning in the context of IL. Copyright 2024 by the author(s)
UAWSNs face challenges such as long propagation delays, limited bandwidth, and varying channel conditions. To solve these problems, we developed a new protocol called Multi- Hop Cross-Layer Optimized Hybrid Automatic ...
详细信息
Monocular 3D object detection is an important yet challenging problem in computer vision, with applications such as autonomous driving. A key limitation in advancing this field is the scarcity of annotated training da...
详细信息
One of the fundamental differences in the perception of electric (e-) vehicles is how their radiated noise is perceived with respect to classic internal combustion engines. Even though e-vehicles are usually quieter, ...
详细信息
In response to the critical need for environmentally sustainable practices, green energy certificates have emerged as a key mechanism to prevent greenwashing and ensure the authenticity of renewable energy sources. Wh...
详细信息
We introduce and study the weighted version of an online matching problem in the Euclidean plane with non-crossing constraints: 2n points with non-negative weights arrive online, and an algorithm can match an arriving...
详细信息
In the rapidly evolving landscape of digital media, video streaming has become a ubiquitous method of content delivery. However, the increasing sophistication of cyber threats poses significant risks to the security a...
详细信息
Within the context of the digital revolution, the domain of smart healthcare has emerged as a promising area with the aim of enhancing patient care, streamlining clinical operations, and facilitating prompt medical in...
详细信息
One of the biggest hurdles in microLED-technology is the efficiency degradation with shrinking pixel-size, due to etching damage. We present the scaling of µLED from 45 to 5 µm by MacEtch, with near size-ind...
详细信息
In autonomous driving, safety assessment is becoming an essential component, particularly in the perception and analysis of the environment around them. To make safe driving judgments, autonomous cars mostly rely on t...
详细信息
暂无评论