In this paper,we study the robustness property of policy optimization(particularly Gauss-Newton gradient descent algorithm which is equivalent to the policy iteration in reinforcement learning)subject to noise at each...
详细信息
In this paper,we study the robustness property of policy optimization(particularly Gauss-Newton gradient descent algorithm which is equivalent to the policy iteration in reinforcement learning)subject to noise at each *** invoking the concept of input-to-state stability and utilizing Lyapunov's direct method,it is shown that,if the noise is sufficiently small,the policy iteration algorithm converges to a small neighborhood of the optimal solution even in the presence of noise at each *** expressions of the upperbound on the noise and the size of the neighborhood to which the policies ultimately converge are *** on Willems'fundamental lemma,a learning-based policy iteration algorithm is *** persistent excitation condition can be readily guaranteed by checking the rank of the Hankel matrix related to an exploration *** robustness of the learning-based policy iteration to measurement noise and unknown system disturbances is theoretically demonstrated by the input-to-state stability of the policy *** numerical simulations are conducted to demonstrate the efficacy of the proposed method.
Android applications are becoming increasingly powerful in recent years. While their functionality is still of paramount importance to users, the energy efficiency of these applications is also gaining more and more a...
详细信息
Android applications are becoming increasingly powerful in recent years. While their functionality is still of paramount importance to users, the energy efficiency of these applications is also gaining more and more attention. Researchers have discovered various types of energy defects in Android applications, which could quickly drain the battery power of mobile devices. Such defects not only cause inconvenience to users, but also frustrate Android developers as diagnosing the energy inefficiency of a software product is a non-trivial task. In this work, we perform a literature review to understand the state of the art of energy inefficiency diagnosis for Android applications. We identified 55 research papers published in recent years and classified existing studies from four different perspectives, including power estimation method, hardware component, types of energy defects, and program analysis approach. We also did a cross-perspective analysis to summarize and compare our studied techniques. We hope that our review can help structure and unify the literature and shed light on future research, as well as drawing developers' attention to build energy-efficient Android applications.
Chronic kidney disease (CKD) is a prominent disease that causes loss of functionality in the kidney. Doctors can now more easily gather patient health status data due to the growth of the Internet of Health Things (Io...
详细信息
This article discusses the importance of cloud-based multi-tenancy in private–public-private secure cloud environments, which is achieved through the isolation of end-user data and resources into tenants to ensure da...
详细信息
Heterogeneous networks are promising solutions for enhancing network performance of LTE-A mobile networks by deploying small cells within the area of the serving macro cells. The goal of deploying such networks is to ...
详细信息
A reconfigurable buck-boost multi-ratio Charge Pump (CP) based on the cross-coupled topology is presented in this paper. The proposed architecture aims to merge different ratios and different operation modes in one co...
详细信息
The brain tumor (BT) is a severe condition caused by abnormal cell growth. If left untreated, the BT may result in a variety of harsh conditions, including death. As a consequence of the significance of automatic BT d...
详细信息
Single sample per person face recognition (SSPP FR) is one of the most challenging problems in FR due to the extreme lack of enrolment data. To date, the most popular SSPP FR methods are the generic learning methods, ...
详细信息
作者:
Huang, Po-HsunHsiao, Tzu-Chien
Hsinchu300 Taiwan Nycu
Department of Computer Science College of Cs and Institute of Biomedical Engineering College of Electrical and Computer Engineering Hsinchu300 Taiwan
The determination of appropriate parameters and an appropriate window size in most entropy-based measurements of time-series complexity is a challenging problem. Inappropriate settings can lead to the loss of intrinsi...
详细信息
Neural machine translation (NMT) has become an essential tool for breaking down language barriers and facilitating communication between different cultures and communities. However, NMT’s potential impact is limited ...
详细信息
暂无评论