In this paper, a novel dual iterative q-learning algorithm is developed to solve the optimal battery management and control problems in smart residential environments. The main idea is to use adaptive dynamic programm...
详细信息
ISBN:
(纸本)9781479945511
In this paper, a novel dual iterative q-learning algorithm is developed to solve the optimal battery management and control problems in smart residential environments. The main idea is to use adaptive dynamic programming (ADP) technique to obtain the optimal battery management and control scheme iteratively for residential energy systems. In the developed dual iterative q-learning algorithm, two iterations, including external and internal iterations, are introduced, where internal iteration minimizes the total cost of power loads in each period and the external iteration makes the iterativeq function converge to the optimum. For the first time, the convergence property of iterativeq-learning method is proven to guarantee the convergence property of the iterativeq function. Finally, numerical results are given to illustrate the performance of the developed algorithm.
暂无评论