Imitation learning, in which learning is performed by demonstration, has been studied and advanced for sequential decision-making tasks in which a reward function is not predefined. However, imitation learning methods...
详细信息
Imitation learning, in which learning is performed by demonstration, has been studied and advanced for sequential decision-making tasks in which a reward function is not predefined. However, imitation learning methods still require numerous expert demonstration samples to successfully imitate an expert's behavior. To improve sample efficiency, we utilize self-supervised representation learning, which can generate vast training signals from the given data. In this study, we propose a self-supervised representation-based adversarial imitation learning method to learn state and action representations that are robust to diverse distortions and temporally predictive, on non-image control tasks. In particular, in comparison with existing self-supervised learning methods for tabular data, we propose a different corruption method for state and action representations that is robust to diverse distortions. We theoretically and empirically observe that making an informative feature manifold with less sample complexity significantly improves the performance of imitation learning. The proposed method shows a 39% relative improvement over existing adversarial imitation learning methods on MuJoCo in a setting limited to 100 expert state-action pairs. Moreover, we conduct comprehensive ablations and additional experiments using demonstrations with varying optimality to provide insights into a range of factors.
We present a technique for information-theoretic optimization of computational imaging systems demonstrated in snapshot 3D microscopy. By directly evaluating measurement quality and decoupling optimization from downst...
详细信息
作者:
Butola, RajatLi, YimingKola, Sekhar ReddyNational Yang Ming Chiao Tung University
Parallel and Scientific Computing Laboratory Electrical Engineering and Computer Science International Graduate Program Hsinchu300093 Taiwan National Yang Ming Chiao Tung University
Institute of Communications Engineering the Institute of Biomedical Engineering the Department of Electronics and Electrical Engineering the Institute of Pioneer Semiconductor Innovation and the Institute of Artificial Intelligence Innovation Hsinchu300093 Taiwan
Machine learning (ML) is poised to play an important part in advancing the predicting capability in semiconductor device compact modeling domain. One major advantage of ML-based compact modeling is its ability to capt...
详细信息
From the perspective of majorization theory, we study how to enhance the entanglement of a two-mode squeezed vacuum (TMSV) state by using local filtration operations. We present several schemes achieving entanglement ...
详细信息
From the perspective of majorization theory, we study how to enhance the entanglement of a two-mode squeezed vacuum (TMSV) state by using local filtration operations. We present several schemes achieving entanglement enhancement with photon addition and subtraction, and then consider filtration as a general probabilistic procedure consisting in acting with local (nonunitary) operators on each mode. From this, we identify a sufficient set of two conditions for these filtration operators to successfully enhance the entanglement of a TMSV state, namely, the operators must be Fock orthogonal (i.e., preserving the orthogonality of Fock states) and Fock amplifying (i.e., giving larger amplitudes to larger Fock states). Our results notably prove that ideal photon addition, subtraction, and any concatenation thereof always enhance the entanglement of a TMSV state in the sense of majorization theory. We further investigate the case of realistic photon addition (subtraction) and are able to upper bound the distance between a realistic photon-added (-subtracted) TMSV state and a nearby state that is provably more entangled than the TMSV, thus extending entanglement enhancement to practical schemes via the use of a notion of approximate majorization. Finally, we consider the state resulting from k-photon addition (on each of the two modes) on a TMSV state. We prove analytically that the state corresponding to k=1 majorizes any state corresponding to 2≤k≤8 and we conjecture the validity of the statement for all k≥9.
Recent advances in distribution networks, driven by the integration of renewable energy sources, have spurred the emergence of microgrids, elevating concerns regarded reliability and stability. In this context, precis...
详细信息
Transcranial Magnetic Stimulation (TMS) is a non-invasive brain stimulation technique used for the treatment of depression, as well as various neurological and psychiatric disorders. There has been ongoing interest in...
详细信息
This paper discusses a method for classification of breast cancer imaging data through the application of an adaptive neuro-fuzzy inference system (ANFIS) and particle swarm optimization (PSO) for hyperparameter optim...
详细信息
ISBN:
(数字)9798331513269
ISBN:
(纸本)9798331513276
This paper discusses a method for classification of breast cancer imaging data through the application of an adaptive neuro-fuzzy inference system (ANFIS) and particle swarm optimization (PSO) for hyperparameter optimization of the ANFIS system. A robust parameter tuning method is used to select the optimal configuration for the ANFIS and PSO components without expert knowledge of the dataset. Using these methods, high classification accuracies can be achieved for both the original and diagnostic versions of the Wisconsin Breast Cancer Dataset. These results demonstrate the flexibility and potential of a joint ANFIS-PSO system for automated diagnosis while retaining system simplicity and linguistic interpretability to support clinical decision-making.
The non-orthogonal multiple access(NOMA)method is a novel multiple access technique that aims to increase spectral efficiency(SE)and accommodate enormous user ***-user signals are superimposed and transmitted in the p...
详细信息
The non-orthogonal multiple access(NOMA)method is a novel multiple access technique that aims to increase spectral efficiency(SE)and accommodate enormous user ***-user signals are superimposed and transmitted in the power domain at the transmitting end by actively implementing controllable interference information,and multi-user detection algorithms,such as successive interference cancellation(SIC),are performed at the receiving end to demodulate the necessary user *** its basic signal waveform,like LTE baseline,could be based on orthogonal frequency division multiple access(OFDMA)or discrete Fourier transform(DFT)-spread OFDM,NOMA superimposes numerous users in the power *** contrast to the orthogonal transmission method,the nonorthogonal method can achieve higher spectrum ***,it will increase the complexity of its *** power allocation techniques will have a direct impact on the system’s *** a result,in order to boost the system capacity,an efficient power allocation mechanism must be *** research developed an efficient technique based on conjugate gradient to solve the problem of downlink power *** major goal is to maximize the users’maximum weighted sum *** suggested algorithm’s most notable feature is that it converges to the global optimal *** compared to existing methods,simulation results reveal that the suggested technique has a better power allocation capability.
In our study, we explore methods for detecting unwanted content lurking in visual datasets. We provide a theoretical analysis demonstrating that a model capable of successfully partitioning visual data can be obtained...
暂无评论