We study the model-based undiscounted reinforcement learning for partially observable Markov decision processes (POMDPs). We propose a learning algorithm for this problem, building on spectral method-of-moments estima...
详细信息
We propose a new regression algorithm that learns from a set of input-output pairs. Our algorithm is designed for populations where the relation between the input variables and the output variable exhibits a heterogen...
详细信息
This article aims to predict Moroccan stock trends based on financial news articles. Data are collected from ***. All news collected for a single stock are lower to fit any machine learning algorithm, thus they are al...
详细信息
We provide an information-theoretic analysis of the generalization ability of Gibbs-based transfer learning algorithms by focusing on two popular transfer learning approaches, α-weighted-ERM and two-stage-ERM. Our ke...
详细信息
Our brains extract durable, generalizable knowledge from transient experiences of the world. Artificial neural networks come nowhere close to this ability. When tasked with learning to classify objects by training on ...
详细信息
The information ratio offers an approach to assessing the efficacy with which an agent balances between exploration and exploitation. Originally, this was defined to be the ratio between squared expected regret and th...
详细信息
Spurious credit card transactions are a significant source of financial losses and urge the development of accurate fraud detection algorithms. In this paper, we use machine learning strategies for such an aim. First,...
详细信息
The equivalence of realizable and agnostic learnability is a fundamental phenomenon in learning theory. With variants ranging from classical settings like PAC learning and regression to recent trends such as adversari...
详细信息
Catastrophic forgetting in neural networks during incremental learning remains a challenging problem. Previous research investigated catastrophic forgetting in fully connected networks, with some earlier work explorin...
详细信息
暂无评论