In this paper, we develop a novel mechanism for reducing volatility of residential demand for electricity We construct a reward-based (rebate) mechanism that provides consumers with incentives to shift their demand to...
详细信息
In this paper we examine combined fully distributed payoff and strategy learning (CODIPAS) in a queue-aware access game over a graph. The classical strategic learning analysis relies on vanishing or small learning rat...
详细信息
In this paper we examine combined fully distributed payoff and strategy learning (CODIPAS) in a queue-aware access game over a graph. The classical strategic learning analysis relies on vanishing or small learning rate and uses stochastic approximation tool to derive steady states and invariant sets of the underlying learning process. Here, the stochastic approximation framework does not apply due to non-vanishing learning rate. We propose a direct proof of convergence of the process. Interestingly, the convergence time to one of the global optima is almost surely finite and we explicitly characterize the convergence time. We show that pursuit-based CODIPAS learning is much faster than the classical learning algorithms in games. We extend the methodology to coalitional learning and proves a very fast formation of coalitions for queue-aware access games where the action space is dynamically changing depending on the location of the user over a graph.
Mean-field games have been studied under the assumption of very large number of players. For such large systems, the basic idea consists to approximate large games by a stylized game model with a continuum of players....
详细信息
Mean-field games have been studied under the assumption of very large number of players. For such large systems, the basic idea consists to approximate large games by a stylized game model with a continuum of players. The approach has been shown to be useful in some applications. However, the stylized game model with continuum of decision-makers is rarely observed in practice and the approximation proposed in the asymptotic regime is meaningless for networked systems with few entities. In this paper we propose a mean-field framework that is suitable not only for large systems but also for a small world with few number of entities. The applicability of the proposed framework is illustrated through a dynamic auction with asymmetric valuation distributions.
In this paper we study mean-field type control problems with risk-sensitive performance functionals. We establish a stochastic maximum principle for optimal control of stochastic differential equations of mean-field t...
详细信息
ISBN:
(纸本)9781467360890
In this paper we study mean-field type control problems with risk-sensitive performance functionals. We establish a stochastic maximum principle for optimal control of stochastic differential equations of mean-field type, in which the drift and the diffusion coefficients as well as the performance functional depend not only on the state and the control but also on the mean of the distribution of the state. Our result extends to optimal control problems for non-Markovian dynamics which may be time-inconsistent in the sense that the Bellman optimality principle does not hold. For a general action space a Peng's type stochastic maximum principle is derived, specifying the necessary conditions for optimality. Two examples are carried out to illustrate the proposed risk-sensitive mean-field type under linear stochastic dynamics with exponential quadratic cost function. Explicit characterizations are given for both mean-field free and mean-field risk-sensitive models.
作者:
Hon-Yu MaComputer
Electrical and Mathematical Sciences and Engineering (CEMSE) Division KAUST SRI Center for Uncertainty Quantification in Computational Science and Engineering Thuwal Makkah Province Saudi Arabia
Although many scholars have proposed all kinds of key successful factors (KSFs) for ERP activity to smoothly enhance the implementation of ERP, the KSFs are based too much on concept and protocol cannot help an enterp...
详细信息
Although many scholars have proposed all kinds of key successful factors (KSFs) for ERP activity to smoothly enhance the implementation of ERP, the KSFs are based too much on concept and protocol cannot help an enterprise to achieve this objective. Therefore, this research through qualitative interview method, integrating the KSFs and dynamic capability to set up a model so as to exhibit the kind of dynamic capability needed for each factor. This result not only creates practical value for the KSFs and displays implementable effectiveness for the dynamic capability concept, but also brings a new direction to the academic research.
In this paper, we develop a novel mechanism for reducing volatility of residential demand for electricity. We construct a reward-based (rebate) mechanism that provides consumers with incentives to shift their demand t...
详细信息
In this paper, we develop a novel mechanism for reducing volatility of residential demand for electricity. We construct a reward-based (rebate) mechanism that provides consumers with incentives to shift their demand to off-peak time. In contrast to most other mechanisms proposed in the literature, the key feature of our mechanism is its modest requirements on user preferences, i.e., it does not require exact knowledge of user responsiveness to rewards for shifting their demand from the peak to the off-peak time. Specifically, our mechanism utilizes a probabilistic reward structure for users who shift their demand to the off-peak time, and is robust to incomplete information about user demand and/or risk preferences. We approach the problem from the public good perspective, and demonstrate that the mechanism can be implemented via lottery-like schemes. Our mechanism permits to reduce the distribution losses, and thus improve efficiency of electricity distribution. Finally, the mechanism can be readily incorporated into the emerging demand response schemes (e.g., the time-of-day pricing, and critical peak pricing schemes), and has security and privacy-preserving properties.
One of the fundamental challenges in distributed interactive systems is to design efficient, accurate, and fair solutions. In such systems, a satisfactory solution is an innovative approach that aims to provide all pl...
详细信息
ISBN:
(纸本)9781467357159
One of the fundamental challenges in distributed interactive systems is to design efficient, accurate, and fair solutions. In such systems, a satisfactory solution is an innovative approach that aims to provide all players with a satisfactory payoff anytime and anywhere. In this paper we study fully distributed learning schemes for satisfactory solutions in games with continuous action space. Considering games where the payoff function depends only on own-action and an aggregate term, we show that the complexity of learning systems can be significantly reduced, leading to the so-called mean-field learning. We provide sufficient conditions for convergence to a satisfactory solution and we give explicit convergence time bounds. Then, several acceleration techniques are used in order to improve the convergence rate. We illustrate numerically the proposed mean-field learning schemes for quality-of-service management in communication networks.
暂无评论