We present a new algorithm based on posterior sampling for learning in Constrained Markov Decision Processes (CMDP) in the infinite-horizon undiscounted setting. The algorithm achieves near-optimal regret bounds while...
详细信息
We present a new algorithm based on posterior sampling for learning in Constrained Markov Decision Processes (CMDP) in the infinite-horizon undiscounted setting. The algorithm achieves near-optimal regret bounds while being advantageous empirically compared to the existing algorithms. Our main theoretical result is a Bayesian regret bound for each cost component of Õ(DS√AT) for any communicating CMDP with S states, A actions, and diameter D. This regret bound matches the lower bound in order of time horizon T and is the best-known regret bound for communicating CMDPs achieved by a computationally tractable algorithm. Empirical results show that our posterior sampling algorithm outperforms the existing algorithms for constrained reinforcement learning. Copyright 2024 by the author(s)
Predicting reservoir water levels helps manage droughts and floods. Predicting reservoir water level is complex because it depends on factors such as climate parameters and human intervention. Therefore, predicting wa...
详细信息
Predicting reservoir water levels helps manage droughts and floods. Predicting reservoir water level is complex because it depends on factors such as climate parameters and human intervention. Therefore, predicting water level needs robust models. Our study introduces a new model for predicting reservoir water levels. An extreme learning machine, the multi-kernel least square support vector machine model (MKLSSVM), is developed to predict the water level of a reservoir in Malaysia. The study also introduces a novel optimization algorithm for selecting inputs. While the LSSVM model may not capture nonlinear components of the time series data, the extreme learning machine (ELM) model-MKLSSVM model can capture nonlinear and linear components of the time series data. A coati optimization algorithm is introduced to select input scenarios. The MKLSSVM model takes advantage of multiple kernel functions. The extreme learning machine model-multi-kernel least square support vector machine model also takes the benefit of both the ELM model and MKLSSVM model models to predict water levels. This paper's novelty includes introducing a new method for selecting inputs and developing a new model for predicting water levels. For water level prediction, lagged rainfall and water level are used. In this study, we used extreme learning machine (ELM)-multi-kernel least square support vector machine (ELM-MKLSSVM), extreme learning machine (ELM)-LSSVM-polynomial kernel function (PKF) (ELM-LSSVM-PKF), ELM-LSSVM-radial basis kernel function (RBF) (ELM-LSSVM-RBF), ELM-LSSVM-Linear Kernel function (LKF), ELM, and MKLSSVM models to predict water level. The testing means absolute of the same models was 0.710, 0.742, 0.832, 0.871, 0.912, and 0.919, respectively. The Nash-Sutcliff efficiency (NSE) testing of the same models was 0.97, 0.94, 0.90, 0.87, 0.83, and 0.18, respectively. The ELM-MKLSSVM model is a robust tool for predicting reservoir water levels.
Incomplete data poses challenges in accurately assessing structural health and detecting damage. It limits the ability to capture the complete behavior and response of the structure, which may hinder the identificatio...
详细信息
Incomplete data poses challenges in accurately assessing structural health and detecting damage. It limits the ability to capture the complete behavior and response of the structure, which may hinder the identification and localization of potential damage or anomalies. Addressing the issue of incomplete data requires developing strategies and algorithms that can effectively handle missing or limited measurements. Using incomplete and noisy measurements, we propose an optimization-based damage detection method for laminated composite plates with closely-spaced eigenvalues. The proposed method consists of two stages. In the first stage, the most probable defective elements are identified by utilizing condensed mode shapes as incomplete noisy inputs for modal residual vectors. This approach significantly reduces the computational effort for damage estimation. The second stage introduces an objective function based on incomplete and noisy Condensed Frequency Response Functions (CFRFs). To optimize the damage quantification, the Improved Particle Swarm optimization (IPSO) algorithm is employed to minimize errors in the proposed objective function based on CFRFs of damaged and intact laminates. The proposed method is demonstrated on laminated composite plates with different lamination schemes, ply orientations, and multiple damaged elements in different damage scenarios. By evaluating the method on numerical results and comparing it with previous studies, its superiority is demonstrated. Furthermore, the proposed method exhibits robustness to changes in mass distribution in the system investigated by retrofitting extra masses to the plate structures that lead to worsening closely-spaced eigenvalues.
Robust optimization (RO) is one of the key paradigms for solving optimization problems affected by uncertainty. Two principal approaches for RO, the robust counterpart method and the adversarial approach, potentially ...
详细信息
In this paper we propose new approaches to estimating large dimensional monotone index models. This class of models has been popular in the applied and theoretical econometrics literatures as it includes discrete choi...
详细信息
For evolutionary multi-objective optimization algorithms (EMOAs), an external archive can be utilized for saving good solutions found throughout the evolutionary process. Recent studies showed that a solution set sele...
详细信息
Detecting infeasible solutions is an important challenge in black-box Multiobjective Bilevel optimization (MOBO) due to a lower-level (LL) optimization problem used as a constraint (along with equality and inequality ...
详细信息
In response to the problems of low search efficiency and a large number of redundant points traversed by the traditional A∗ algorithm in the path planning process of AUV, this article proposes an improved A∗ algorithm...
详细信息
In this paper, a rational cooperative foraging based grey wolf optimizer (CFGWO) is proposed to enhance the performance of conventional grey wolf optimizer (GWO). It inherits the communicating behaviou...
详细信息
Multiobjective multitasking evolutionary algorithms have shown promising performance for tackling a set of multiobjective optimization tasks simultaneously, as the optimization experience gained within one task can be...
详细信息
暂无评论