We formulate the multiperiod, distribution-free perishable inventory problem as a problem of prediction with expert advice and apply an online learning method (the Weak aggregating algorithm) to solve it. We show that...
详细信息
We formulate the multiperiod, distribution-free perishable inventory problem as a problem of prediction with expert advice and apply an online learning method (the Weak aggregating algorithm) to solve it. We show that the asymptotic average performance of this method is as good as that of any time-dependent stocking rule in a given parametric class. (C) 2010 Elsevier B.V. All rights reserved.
In this paper, regularised regression for sequential data is investigated and a new ridge regression algo-rithm is proposed. It uses the aggregating algorithm (AA) to devise an iterative version of ridge regression (I...
详细信息
In this paper, regularised regression for sequential data is investigated and a new ridge regression algo-rithm is proposed. It uses the aggregating algorithm (AA) to devise an iterative version of ridge regression (IRR). This algorithm is called AAIRR. A competitive analysis is conducted to show that the guarantee on the performance of AAIRR is better than that of the known online ridge regression algorithms. Moreover, an empirical study is carried out on real-world datasets to demonstrate the superior performance over those state-of-the-art algorithms.(c) 2022 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license ( http://***/licenses/by/4.0/ )
Methods for generating predictions online and in the form of probability distributions of future outcomes are considered. The difference between the probabilistic forecast (probability distribution) and the numerical ...
详细信息
Methods for generating predictions online and in the form of probability distributions of future outcomes are considered. The difference between the probabilistic forecast (probability distribution) and the numerical outcome is measured using the loss function (scoring rule). In practical statistics, the continuous ranked probability score (CRPS) is often used to estimate the discrepancy between probabilistic forecasts and (quantitative) outcomes. The paper considers the case when several competing methods (experts) give their online predictions as distribution functions. An algorithm is proposed for online aggregation of these distribution functions. The performance bounds of the proposed algorithm are obtained in the form of a comparison of the cumulative loss of the algorithm and the loss of expert hypotheses. Unlike existing estimates, the proposed estimates do not depend on time. The results of numerical experiments illustrating the proposed methods are presented.
We propose online decision strategies for time-dependent sequences of linear programs which use no distributional and minimal geometric assumptions about the data. These strategies are obtained through Vovk's aggr...
详细信息
We propose online decision strategies for time-dependent sequences of linear programs which use no distributional and minimal geometric assumptions about the data. These strategies are obtained through Vovk's aggregating algorithm which combines recommendations from a given strategy pool. We establish an average-performance bound for the resulting solution sequence. (C) 2006 Elsevier B.V. All rights reserved.
The problem concerning the aggregating of the forecasts of specialized expert strategies is examined using the mathematical theory of machine learning. Expert strategies are understood as the algorithms capable of suc...
详细信息
The problem concerning the aggregating of the forecasts of specialized expert strategies is examined using the mathematical theory of machine learning. Expert strategies are understood as the algorithms capable of successively predicting the components of a time series in the online mode. The specialized strategies can refrain from predictions at certain time instants-they make forecasts in compliance with the application area of the specific model of an object region forming their basis. An optimal algorithm whereby the forecasts of such expert strategies are aggregated into the single forecast is proposed. The algorithmic optimality consists in that, on average, its total losses are asymptotically less than those of any active prediction strategies on a set of time instants. The uppermost estimated error of the given mixing of predictions, i.e., the regret of aggregating strategies, is determined. The errors are estimated in the worst situation where no assumptions are made about the mechanism underlying the initial data source. The proposed algorithm is tested using the real information on the commodity circulation of a trading network. The numerical results and estimates of the regret are presented.
The paper applies the method of defensive forecasting, based on the use of game-theoretic supermartingales, to prediction with expert advice. In the traditional setting of a countable number of experts and a finite nu...
详细信息
The paper applies the method of defensive forecasting, based on the use of game-theoretic supermartingales, to prediction with expert advice. In the traditional setting of a countable number of experts and a finite number of outcomes, the Defensive Forecasting algorithm is very close to the well-known aggregating algorithm. Not only the performance guarantees but also the predictions are the same for these two methods of fundamentally different nature. The paper also discusses a new setting where the experts can give advice conditional on the learner's future decision. Both the algorithms can be adapted to the new setting and give the same performance guarantees as in the traditional setting. Finally, an application of defensive forecasting to a setting with several loss functions is outlined. (C) 2010 Elsevier B.V. All rights reserved.
In this paper we apply methods of prediction with expert advice to real-world foreign exchange trading data with the aim of finding effective investment strategies. We start with the framework of the long-short game, ...
详细信息
In this paper we apply methods of prediction with expert advice to real-world foreign exchange trading data with the aim of finding effective investment strategies. We start with the framework of the long-short game, introduced by Vovk and Watkins (1998), and then propose modifications aimed at improving the performance with respect to standard portfolio performance indicators.
The increasing connectivity of data and cyber-physical systems has resulted in a growing number of cyber-attacks. Real-time detection of such attacks, through the identification of anomalous activity, is required so t...
详细信息
The increasing connectivity of data and cyber-physical systems has resulted in a growing number of cyber-attacks. Real-time detection of such attacks, through the identification of anomalous activity, is required so that mitigation and contingent actions can be effectively and rapidly deployed. We propose a new approach for aggregating unsupervised anomaly detection algorithms incorporating feedback when it becomes available. We apply this approach to open-source real datasets and show that both aggregating models, which we call experts, and incorporating feedback significantly improve performance. An important property of the proposed approaches is their theoretical guarantees that they perform close to the best superexpert, which can switch between the best performing experts, in terms of the cumulative average losses.
We propose herein a new portfolio selection method that switches between two distinct asset allocation strategies. An important component is a carefully designed adaptive switching rule, which is based on a machine le...
详细信息
We propose herein a new portfolio selection method that switches between two distinct asset allocation strategies. An important component is a carefully designed adaptive switching rule, which is based on a machine learning algorithm. It is shown that using this adaptive switching strategy, the combined wealth of the new approach is a weighted average of that of the successive constant rebalanced portfolio and that of the 1/N portfolio. In particular, it is asymptotically superior to the 1/N portfolio under mild conditions in the long run. Applications to real data show that both the returns and the Sharpe ratios of the proposed binary switch portfolio are the best among several popular competing methods over varying time horizons and stock pools.
Helmbold and Schapire gave an on-line prediction algorithm that, when given an unpruned decision tree, produces predictions not much worse than the predictions made by the best pruning of the given decision tree. In t...
详细信息
Helmbold and Schapire gave an on-line prediction algorithm that, when given an unpruned decision tree, produces predictions not much worse than the predictions made by the best pruning of the given decision tree. In this paper, we give two new on-line algorithms. The first algorithm is based on the observation that finding the best pruning can be efficiently solved by a dynamic programming in the "batch" setting where all the data to be predicted are given in advance. This algorithm works well for a wide class of Loss functions, whereas the one given by Helmbold and Schapire is only described for the absolute loss function. Moreover, the algorithm given in this paper is so simple and general that it could be applied to many other on-line optimization problems solved by dynamic programming. We also explore the second algorithm that is competitive not only with the best pruning but also with the best prediction values which are associated with nodes in the decision tree. In this setting, a greatly simplified algorithm is given for the absolute loss function. It can be easily generalized to the case where, instead of using decision trees, data are classified in some arbitrarily fixed manner. (C) 2001 Elsevier Science B.V. All rights reserved.
暂无评论