作者:
Lamond, BFUniv Laval
Fac Sci Adm Dept Operat & Syst Decis Quebec City PQ G1K 7P4 Canada
We propose a method for optimizing a single hydro-electric reservoir using a piecewise polynomial approximation of the future value functions. Unlike previous methods based on splines, we avoid discretizing the inflow...
详细信息
We propose a method for optimizing a single hydro-electric reservoir using a piecewise polynomial approximation of the future value functions. Unlike previous methods based on splines, we avoid discretizing the inflow distribution. Instead, we carry out the expectation step of dynamicprogramming using an exact, easy-to-evaluate formula for the integral of a piecewise polynomial function. We then apply our method to solving a model which assumes a piecewise linear reward function of the energy produced, and takes into account the turbine head effects.
We study a US OEM that outsources its production to two contract manufacturers, a local manufacturer (e.g. in the US or Mexico) and a foreign manufacturer (e.g. in China). The local manufacturer is relatively reliable...
详细信息
We study a US OEM that outsources its production to two contract manufacturers, a local manufacturer (e.g. in the US or Mexico) and a foreign manufacturer (e.g. in China). The local manufacturer is relatively reliable, but low margin. The foreign manufacturer offers high margin, but is subject to disruption risks. Both manufacturers experience some level of operational uncertainties, and the operational risks can be positively or negatively correlated. Disruption risks are modelled as a Poisson jump process at a random magnitude, and operational risks are modelled as correlated stochastic diffusion processes. We develop a stochastic dynamic programming formulation to characterise the OEM's optimal capital allocation decision to contract manufacturers, and provide the necessary and sufficient conditions for each optimal decision. The objective of our study is to investigate how dual sourcing balances the risks and opportunities, when the OEM bears disruption risks and correlated operational risks. We find that the two manufacturers can be substitutes or complements to each other. Risk of disruption renders the unreliable foreign manufacturer less attractive, but has a moderating effect on the allocation to the local manufacturer. We also provide managerial implications of our study.
This paper studies a single-product, dynamic, non-stationary, stochastic inventory problem with capacity commitment, in which a buyer purchases a fixed capacity from a supplier at the beginning of a planning horizon a...
详细信息
This paper studies a single-product, dynamic, non-stationary, stochastic inventory problem with capacity commitment, in which a buyer purchases a fixed capacity from a supplier at the beginning of a planning horizon and the buyer's total cumulative order quantity over the planning horizon is constrained with the capacity. The objective of the buyer is to choose the capacity at the beginning of the planning horizon and the order quantity in each period to minimize the expected total cost over the planning horizon. We characterize the structure of the minimum sum of the expected ordering, storage and shortage costs in a period and thereafter and the optimal ordering policy for a given capacity. Based on the structure, we identify conditions under which a myopic ordering policy is optimal and derive an equation for the optimal capacity commitment. We then use the optimal capacity and the myopic ordering policy to evaluate the effect of the various parameters on the minimum expected total cost over the planning horizon. (C) 2008 Elsevier B.V. All rights reserved.
This paper proposes a deep learning approach for solving optimal stopping problems and high-dimensional American-style options pricing problems. Through state-space partition, the method does not require recalculation...
详细信息
This paper proposes a deep learning approach for solving optimal stopping problems and high-dimensional American-style options pricing problems. Through state-space partition, the method does not require recalculation of the structure of networks when the price of the asset changes, which makes tracking valuation more efficient. This paper also offers theoretical proof for the existence of a deep learning network that can determine the optimal stopping time via state-space partition. We present convergence proofs for the estimators and also test the method on Bermuda max-call options as examples.
In standard stochastic dynamic programming, the transition probability distributions of the underlying Markov Chains are assumed to be known with certainty. We focus on the case where the transition probabilities or o...
详细信息
In standard stochastic dynamic programming, the transition probability distributions of the underlying Markov Chains are assumed to be known with certainty. We focus on the case where the transition probabilities or other input data are uncertain. Robust dynamicprogramming addresses this problem by defining a min-max game between Nature and the controller. Considering examples from inventory and queueing control, we examine the structure of the optimal policy in such robust dynamic programs when event probabilities are uncertain. We identify the cases where certain monotonicity results still hold and the form of the optimal policy is determined by a threshold. We also investigate the marginal value of time and the case of uncertain rewards.(c) 2017 Wiley Periodicals, Inc. Naval Research Logistics 65: 699-716, 2018
This paper investigates optimal decision-making for traffic management under demand and supply uncertainties by stochastic dynamic programming. Traffic flow dynamics under demand and supply uncertainties is described ...
详细信息
This paper investigates optimal decision-making for traffic management under demand and supply uncertainties by stochastic dynamic programming. Traffic flow dynamics under demand and supply uncertainties is described by a simplified version of the stochastic cell transmission model. Based on this model, the optimal traffic management problem is analysed wherein the existence of solution is guaranteed by verifying the well-posed condition. An analytical optimal control law is derived in terms of a set of coupled generalised recursive Riccati equations. As optimal control laws may be fragile with respect to model misspecification, a robust (optimal) decision-making law that aims to act robust with respect to the parameter misspecification in the traffic flow model (which can be originated from model calibration), and to attenuate the effect of disturbances in freeway networks (wherein demand uncertainty is usually regarded as a kind of disturbance) is proposed. Conventionally, network uncertainties have been considered to induce negative effects on traffic management in transportation literature. In contrast, the proposed methodology outlines an interesting issue that is to make benefit (or trade-off) from the inherent network uncertainties. Finally, some practical issues in traffic management that can be addressed by extending the current framework are briefly discussed.
stochastic dynamic programming (SDP) can improve the management of a multipurpose water reservoir by generating management policies which are efficient with respect to the management objectives (flood protection, wate...
详细信息
stochastic dynamic programming (SDP) can improve the management of a multipurpose water reservoir by generating management policies which are efficient with respect to the management objectives (flood protection, water supply for irrigation, hydropower generation, etc.). The improvement in efficiency is even more remarkable for networks of reservoirs. Unfortunately, SDP is affected by the well-known 'curse of dimensionality', i.e. computational time and computer memory occupation increase exponentially with the dimension of the problem (number of reservoirs), and the problem rapidly becomes intractable. Neuro-dynamicprogramming (NDP) can sensibly mitigate this limitation by approximating Bellman functions with artificial neural networks (ANNs). In this paper the application of NDP to the problem of the management of reservoir networks is introduced. Results obtained in a real-world case study are finally presented. (c) 2006 Elsevier Ltd. All rights reserved.
We consider a production planning problem for a dynamic jobshop producing a number of products and subject to breakdown and repair of machines. The machine capacities are assumed to be finite-state Markov chains. As t...
详细信息
We consider a production planning problem for a dynamic jobshop producing a number of products and subject to breakdown and repair of machines. The machine capacities are assumed to be finite-state Markov chains. As the rates of change of the machine states approach infinity, an asymptotic analysis of this stochastic manufacturing systems is given. The analysis results in a limiting problem in which the stochastic machine availability is replaced by its equilibrium mean availability. The long-run average cost for the original problem is shown to converge to the long-run average cost of the limiting problem. The convergence rate of the long-run average cost for the original problem to that of the limiting problem together with an error estimate for the constructed asymptotic optimal control is established.
Risk minimization in stochastic systems is a challenging problem and this paper compares results of three different techniques in reservoir management. Two-stage stochasticprogramming (TSP) for maximizing expected be...
详细信息
Risk minimization in stochastic systems is a challenging problem and this paper compares results of three different techniques in reservoir management. Two-stage stochasticprogramming (TSP) for maximizing expected benefits is a well-known method, Fletcher and Ponnambalam (FP) and Q-Learning are the two new methods in reservoir management, all of which can include risk minimization in the objective function. The water price uncertainties caused by deregulated markets are considered in addition to random inflows in optimization and simulation is used to compare the results and to develop a risk versus return trade-off curve. One of the contributions of this paper is to consider risk in the Q-Learning algorithm.
The optimal scheduling of hydrothermal systems requires the representation of uncertainties in future streamflows to devise a cost-effective operations policy. stochastic optimization has been widely used as a powerfu...
详细信息
The optimal scheduling of hydrothermal systems requires the representation of uncertainties in future streamflows to devise a cost-effective operations policy. stochastic optimization has been widely used as a powerful tool to solve this problem but results will necessarily depend on the stochastic model used to generate future scenarios for streamflows. Periodic autoregressive (PAR) models have been widely used in this task. However, its parameters are typically unknown and must be estimated from historical data, incorporating a natural estimation error. Furthermore, the model is just a linear approximation of the real stochastic process. The consequence is that the operator will be uncertain about the correct linear model that should be used at each period. The objective of this work is to assess the impacts of incorporating the uncertainty of the parameters of the PAR models into a stochastic hydrothermal scheduling model. The proposed methodology is tested with case studies based on data from the Brazilian hydroelectric system. It is shown that when the uncertainty of the parameters is ignored, the policies given by the stochastic optimization tend to be too optimistic.
暂无评论