In this poster session we are reporting on the results of two, three-week summer graduate teaching experiences that took place in Nanjing, China over a two-year period. A faculty exchange program was entered into betw...
详细信息
ISBN:
(纸本)9781605587653
In this poster session we are reporting on the results of two, three-week summer graduate teaching experiences that took place in Nanjing, China over a two-year period. A faculty exchange program was entered into between Southeast University of Nanjing China and Purdue University Calumet of Hammond, Indiana, USA. One of the goals of the exchange program was to expose Chinese students to the instructional methods employed by United States Universities. By understanding the cultural differences and utilizing various teaching methodologies employed by American teachers, the faculty and students involved in these three-week classroom intensive training courses were able to adapt and successfully complete the graduate level material that was presented.
Machine learning models are increasingly being adopted across various fields, such as medicine, business, autonomous vehicles, and cybersecurity, to analyze vast amounts of data, detect patterns, and make predictions ...
详细信息
Glaucoma is an ophthalmic disorder which results in permanent vision loss because high intraocular pressure damages the optic nerve in the eye. This paper proposes a two-stage network for automated glaucoma identifica...
详细信息
To handle input and output time delays that commonly exist in many networked control systems(NCSs), a new robust continuous sliding mode control(CSMC) scheme is proposed for the output tracking in uncertain single inp...
详细信息
To handle input and output time delays that commonly exist in many networked control systems(NCSs), a new robust continuous sliding mode control(CSMC) scheme is proposed for the output tracking in uncertain single input-single-output(SISO) networked control systems. This scheme consists of three consecutive steps. First, although the network-induced delay in those systems can be effectively handled by using Pade approximation(PA), the unmatched disturbance cames out as another difficulty in the control design. Second, to actively estimate this unmatched disturbance, a generalized proportional integral observer(GPIO) technique is utilized based on only one measured state. Third, by constructing a new sliding manifold with the aid of the estimated unmatched disturbance and states, a GPIO-based CSMC is synthesized, which is employed to cope with not only matched and unmatched disturbances, but also networkinduced delays. The stability of the entire closed-loop system under the proposed GPIO-based CSMC is detailedly *** promising tracking efficiency and feasibility of the proposed control methodology are verified through simulations and experiments on Quanser's servo module for motion control under various test conditions.
A brain tumor is the abnormal cells that growth in the brain, and it is considered as one of the most dangerous diseases that lead to the cause of death. Diagnosis at early is important for increasing the survival rat...
详细信息
The integration of social networks with the Internet of Things (IoT) has been explored in recent research, giving rise to the Social Internet of Things (SIoT). One promising application of SIoT is viral marketing, whi...
详细信息
Reinforcement learning(RL) has been widely adopted for intelligent decision-making in embodied agents due to its effective trial-and-error learning capabilities. However, most RL methods overlook the causal relationsh...
详细信息
Reinforcement learning(RL) has been widely adopted for intelligent decision-making in embodied agents due to its effective trial-and-error learning capabilities. However, most RL methods overlook the causal relationships among states and actions during policy exploration and lack the human-like ability to distinguish signal from noise and reason with important abstractions, resulting in poor sample efficiency. To address this issue, we propose a novel method named causal action empowerment(CAE) for efficient RL, designed to improve sample efficiency in policy learning for embodied agents. CAE identifies and leverages causal relationships among states, actions, and rewards to extract controllable state variables and reweight actions for prioritizing high-impact behaviors. Moreover, by integrating a causality-aware empowerment term, CAE significantly enhances an embodied agent's execution of causally-aware behavior for more efficient exploration via boosting controllability in complex embodied environments. Benefiting from these two improvements, CAE bridges the gap between local causal discovery and global causal empowerment. To comprehensively evaluate the effectiveness of CAE, we conduct extensive experiments across 25 tasks in 5 diverse embodied environments, encompassing both locomotion and manipulation skill learning with dense and sparse reward settings. Experimental results demonstrate that CAE consistently outperforms existing methods across this wide range of scenarios, offering a promising avenue for improving sample efficiency in RL.
GPT is widely recognized as one of the most versatile and powerful large language models, excelling across diverse domains. However, its significant computational demands often render it economically unfeasible for in...
详细信息
This study introduces an integrated real-time monitoring system to enhance driver safety. The system incorporates facial recognition, alcohol detection, and drowsiness monitoring to comprehensively analyze the driver...
详细信息
Exploration strategy design is a challenging problem in reinforcement learning(RL),especially when the environment contains a large state space or sparse *** exploration,the agent tries to discover unexplored(novel)ar...
详细信息
Exploration strategy design is a challenging problem in reinforcement learning(RL),especially when the environment contains a large state space or sparse *** exploration,the agent tries to discover unexplored(novel)areas or high reward(quality)*** existing methods perform exploration by only utilizing the novelty of *** novelty and quality in the neighboring area of the current state have not been well utilized to simultaneously guide the agent’s *** address this problem,this paper proposes a novel RL framework,called clustered reinforcement learning(CRL),for efficient exploration in *** adopts clustering to divide the collected states into several clusters,based on which a bonus reward reflecting both novelty and quality in the neighboring area(cluster)of the current state is given to the *** leverages these bonus rewards to guide the agent to perform efficient ***,CRL can be combined with existing exploration strategies to improve their performance,as the bonus rewards employed by these existing exploration strategies solely capture the novelty of *** on four continuous control tasks and six hard-exploration Atari-2600 games show that our method can outperform other state-of-the-art methods to achieve the best performance.
暂无评论