Adaptive brain stimulation can treat neurological conditions such as Parkinson's disease and post-stroke motor deficits by influencing abnormal neural activity. Because of patient heterogeneity, each patient requi...
Adaptive brain stimulation can treat neurological conditions such as Parkinson's disease and post-stroke motor deficits by influencing abnormal neural activity. Because of patient heterogeneity, each patient requires a unique stimulation policy to achieve optimal neural responses. Model-free reinforcement learning (MFRL) holds promise in learning effective policies for a variety of similar control tasks, but is limited in domains like brain stimulation by a need for numerous costly environment interactions. In this work we introduce Coprocessor Actor Critic, a novel, model-based reinforcement learning (MBRL) approach for learning neural coprocessor policies for brain stimulation. Our key insight is that coprocessor policy learning is a combination of learning how to act optimally in the world and learning how to induce optimal actions in the world through stimulation of an injured brain. We show that our approach overcomes the limitations of traditional MFRL methods in terms of sample efficiency and task success and outperforms baseline MBRL approaches in a neurologically realistic model of an injured brain.
This paper presents a study on applying on-device machine learning (ML) algorithms to enhance MAC layer protocols in wireless communications. It focuses on the MU-MIMO Grouping algorithm and explores the benefits of e...
详细信息
ISBN:
(数字)9798350303582
ISBN:
(纸本)9798350303599
This paper presents a study on applying on-device machine learning (ML) algorithms to enhance MAC layer protocols in wireless communications. It focuses on the MU-MIMO Grouping algorithm and explores the benefits of executing ML models directly on devices such as computers, smartphones, and IoT devices. This approach promises improved speed, privacy, security, and adaptability in dynamic networks. The paper evaluates the effectiveness of this strategy in Wi-Fi and Mas-sive MIMO scenarios, demonstrating significant system capacity enhancement, latency reduction, and improved user experience. Additionally, it examines the interaction between on-device ML and changing network environments, underscoring the method's adaptability and robustness. This research represents a significant advancement in MAC layer protocols using on-device ML and may inspire future innovations in wireless networks.
Mean aortic pressure (MAP) is a primary measurement for monitoring blood and O2 delivery to major organs. Prolonged periods of hypotension, low MAP, lead to low tissue perfusion and subsequent end organ damage. Patien...
详细信息
Physically Unclonable Function (PUF) is a useful and versatile lightweight hardware security primitive that takes advantage of unavoidable and unpredictable random process variations during chip manufacturing. This pa...
详细信息
We improve the best known upper bounds on the density of corner-free sets over quasirandom groups from inverse poly-logarithmic to quasi-polynomial. We make similarly substantial improvements to the best known lower b...
详细信息
Data-driven offline model-based optimization (MBO) is an established practical approach to black-box computational design problems for which the true objective function is unknown and expensive to query. However, the ...
The deployment of Reinforcement Learning to robotics applications faces the difficulty of reward engineering. Therefore, approaches have focused on creating reward functions by Learning from Observations (LfO) which i...
The deployment of Reinforcement Learning to robotics applications faces the difficulty of reward engineering. Therefore, approaches have focused on creating reward functions by Learning from Observations (LfO) which is the task of learning policies from expert trajectories that only contain state sequences. We propose new methods for LfO for the important class of continuous control problems of learning to stabilize, by introducing intermediate proxy models acting as reward functions between the expert and the agent policy based on Lyapunov stability theory. Our LfO training process consists of two steps. The first step attempts to learn a Lyapunov-like landscape proxy model from expert state sequences without access to any kinematics model, and the second step uses the learned landscape model to guide in training the learner's policy. We formulate novel learning objectives for the two steps that are important for overall training success. We evaluate our methods in real automobile robot environments and other simulated stabilization control problems in model-free settings, like Quadrotor control and maintaining upright positions of Hopper in MuJoCo. We compare with state-of-the-art approaches and show the proposed methods can learn efficiently with less expert observations.
We introduce Conformal Decision Theory, a framework for producing safe autonomous decisions despite imperfect machine learning predictions. Examples of such decisions are ubiquitous, from robot planning algorithms tha...
详细信息
ISBN:
(数字)9798350384574
ISBN:
(纸本)9798350384581
We introduce Conformal Decision Theory, a framework for producing safe autonomous decisions despite imperfect machine learning predictions. Examples of such decisions are ubiquitous, from robot planning algorithms that rely on pedestrian predictions, to calibrating autonomous manufacturing to exhibit high throughput and low error, to the choice of trusting a nominal policy versus switching to a safe backup policy at run-time. The decisions produced by our algorithms are safe in the sense that they come with provable statistical guarantees of having low risk without any assumptions on the world model whatsoever; the observations need not be I.I.D. and can even be adversarial. The theory extends results from conformal prediction to calibrate decisions directly, without requiring the construction of prediction sets. Experiments demonstrate the utility of our approach in robot motion planning around humans and robot manufacturing.
Climate change is increasingly disrupting worldwide agriculture, making global food production less reliable. To tackle the growing challenges in feeding the planet, cutting-edge management strategies, such as precisi...
详细信息
暂无评论