We consider online learning in finite stochastic Markovian environments where in each time step a new reward function is chosen by an oblivious adversary. The goal of the learning agent is to compete with the best sta...
We consider online learning in finite stochastic Markovian environments where in each time step a new reward function is chosen by an oblivious adversary. The goal of the learning agent is to compete with the best stationary policy in terms of the total reward received. In each time step the agent observes the current state and the reward associated with the last transition, however, the agent does not observe the rewards associated with other state-action pairs. The agent is assumed to know the transition probabilities. The state of the art result for this setting is a no-regret algorithm. In this paper we propose a new learning algorithm and, assuming that stationary policies mix uniformly fast, we show that after T time steps, the expected regret of the new algorithm is O (T2/3 (ln T)1/3), giving the first rigorously proved regret bound for the problem.
Nearest Neighbor Classifier is one of the most classical lazy learning schemes. The basic nearest neighbor classifiers suffer from the common problem that the instances used to train the classifier are all stored indi...
详细信息
AutoDock is a widely used automated protein docking program in structure-based drug-design. Different search algorithms such as simulated annealing, traditional genetic algorithm (GA) and Lamarckian genetic algorithm ...
详细信息
AutoDock is a widely used automated protein docking program in structure-based drug-design. Different search algorithms such as simulated annealing, traditional genetic algorithm (GA) and Lamarckian genetic algorithm (LGA) are implemented in AutoDock. However, the docking performance of these algorithms is still limited by the local optima issue of simulated annealing or the premature convergence issue typical in traditional evolutionary algorithms (EA). Due to the stochastic nature of these search algorithms, users usually need to run multiple times to get reasonable docking results, which is time-consuming. We have developed a new docking program AutoDockX by applying a sustainable GA, Age-Layered Population Structure (ALPS) to the protein docking problem. We tested the docking performance over three different proteins (pr, cox and hsp90) with more than 20 candidate ligands for each protein. Our experiments showed that the sustainable GA based AutodockX achieved significantly better docking performance in terms of running time and robustness than all the existing search algorithms implemented in the latest version of AutoDock. AutodockX thus has unique advantages in large-scale virtual screening.
For the virtues such as simplicity, high generalization capability, and few training cost, the K-Nearest-Neighbor (KNN) classifier is widely used in pattern recognition and machinelearning. However, the computation c...
详细信息
For the virtues such as simplicity, high generalization capability, and few training cost, the K-Nearest-Neighbor (KNN) classifier is widely used in pattern recognition and machinelearning. However, the computation complexity of KNN classifier will become higher when dealing with large data sets classification problem. In consequence, its efficiency will be decreased greatly. This paper proposes a general two-stage training set condensing algorithm for general KNN classifier. First, we identify the noise data points and remove them from the original training set. Second, a general condensed nearest neighbor rule based on the so-called Nearest Unlike Neighbor (NUN) is presented to further eliminate the redundant samples in training set. In order to verify the performance of the proposed method, some numerical experiments are conducted on several UCI benchmark databases.
The pool-based active learning intends to collect the samples into the pool firstly, and selects the best informative sample from it which has no label to add into the training sets for updating the classifier secondl...
详细信息
Lots of researchers have been studying on how to construct radial basis function neural networks. To determine the number and location of hidden neurons, a recursive procedure is adopted with a new evaluation criterio...
详细信息
Lots of researchers have been studying on how to construct radial basis function neural networks. To determine the number and location of hidden neurons, a recursive procedure is adopted with a new evaluation criterion based on localized generalization error model (L-GEM). We derive a new sensitivity expression for Gaussian radial basis function neural network based on L-GEM, and get a new localized generalization error bound. The RBF that yields the minimal localized generalization error bound is selected. We compare our approach with minimization of cross validation, and minimization of training mean square error (MSE) methods. The experimental results show that our approach performs much better than the other two methods with reasonable number of centers.
The sensitivity analysis can help to construct a tightly neural network. There are several methods to define the sensitivity of input and weight for perturbations to the trained neural network. This paper proposed a s...
详细信息
The sensitivity analysis can help to construct a tightly neural network. There are several methods to define the sensitivity of input and weight for perturbations to the trained neural network. This paper proposed a sensitivity definition based on elastic function. This definition considers the measure of the variable of the reference network. The sensitivity calculating formulae are deduced for perceptron and MLP.
In this paper we propose a Bayesian model for multi-task feature selection. This model is based on a generalized spike and slab sparse prior distribution that enforces the selection of a common subset of features acro...
详细信息
data mining tools able to semantically interpret textual or linguistic data are acquiring a growing importance. Moreover, the development of large ontologies for general and specific domains provides new tools to incl...
详细信息
It is estimated that over 8 million cell phones are lost or stolen each year [7]; often the loss of a cell phone means the loss of personal data, time and enormous aggravation. In this paper we present machine-learnin...
详细信息
It is estimated that over 8 million cell phones are lost or stolen each year [7]; often the loss of a cell phone means the loss of personal data, time and enormous aggravation. In this paper we present machine-learning based algorithms by which a cell phone can discern that it may be lost, and take steps to enhance its chances of being successfully recovered. We use data collected from the Reality Mining project [10] to create a suite of realistic test cases that model lost cell phone behavior. On these data sets our best algorithms can identify cases of a lost mobile device, based on its behavior over the previous 3 hours, with close to 100% accuracy. In addition, the algorithm generates false positive identifications with probability less than 3%; for individuals with relatively predictable lifestyles the False Positive Rate is substantially less. We also use the Reality Mining data to construct a set of test cases that model the behavior of a stolen phone, and show that similar algorithmic techniques give reasonable results in this setting as well.
暂无评论