Relative overgeneralization (RO) occurs in cooperative multi-agent learning tasks when agents converge towards a suboptimal joint policy due to overfitting to suboptimal behaviors of other *** methods have been propos...
详细信息
Relative overgeneralization (RO) occurs in cooperative multi-agent learning tasks when agents converge towards a suboptimal joint policy due to overfitting to suboptimal behaviors of other *** methods have been proposed for addressing RO in multi-agent policy gradient (MAPG) methods although these methods produce state-of-the-art *** address this gap, we propose a general, yet simple, framework to enable optimistic updates in MAPG methods that alleviate the RO *** approach involves clipping the advantage to eliminate negative values, thereby facilitating optimistic updates in *** optimism prevents individual agents from quickly converging to a local ***, we provide a formal analysis to show that the proposed method retains optimality at a fixed *** extensive evaluations on a diverse set of tasks including the Multi-agent MuJoCo and Overcooked benchmarks, our method outperforms strong baselines on 13 out of 19 tested tasks and matches the performance on the rest. Copyright 2024 by the author(s)
This paper presents a novel data-driven control algorithm that coordinates a large aggregation of heterogeneous thermostatically controlled loads (TCLs) with unknown temperature dynamics and disturbance distributions ...
详细信息
Federated Learning (FL) has emerged as a privacy-preserving machine learning approach, enabling collaborative model training across devices while maintaining the decentralization of raw data. This paper investigates t...
详细信息
This study scrutinizes five years of Sarajevo's Air Quality Index (AQI) data using diverse machine learning models - Fourier autoregressive integrated moving average (Fourier ARIMA), Prophet, and Long short-term m...
详细信息
In this paper, we analyze collaborative inference in a mobile edge computing (MEC) network aided by a reconfigurable intelligent surface (RIS). In particular, we consider multiple user equipments (UEs) with collaborat...
详细信息
This paper presents a square coaxial transmission line that is partially filled with 3-D printing low-cost insulator. The proposed coaxial line is composed of two metal conductors and 3D printing insulator. To reduce ...
详细信息
Machine learning over graphs has recently attracted growing attention due to its ability to analyze and learn complex relations within critical interconnected systems. However, the disparate impact that is amplified b...
Modeling and control of wave energy conversion (WEC) systems for maximum power extraction is challenging due to complex multiphysics that include fluids, mechanics, and machine drives. To uncover an intuitive model th...
详细信息
Human intelligence tasks(HITs),such as labeling images for machine learning,are widely utilized for crowdsourcing human *** crowdsourcing platforms face challenges of a single point of failure and a lack of service **...
详细信息
Human intelligence tasks(HITs),such as labeling images for machine learning,are widely utilized for crowdsourcing human *** crowdsourcing platforms face challenges of a single point of failure and a lack of service *** blockchain-based crowdsourcing approaches overlook the low scalability problem of permissionless blockchains or inconveniently rely on existing ground-truth data as the root of trust in evaluating the quality of workers’*** propose a blockchain-based crowdsourcing scheme for ensuring dual fairness(i.e.,preventing false reporting and free riding)and improving on-chain efficiency concerning on-chain storage and smart contract *** proposed scheme does not rely on trusted authorities but rather depends on a public blockchain to guarantee dual *** efficient and publicly verifiable truth discovery scheme is designed based on majority voting and cryptographic *** truth discovery scheme aims at inferring ground truth from workers’*** ground truth is further utilized to estimate the quality of workers’***,a novel blockchain-based protocol is designed to further reduce on-chain costs while ensuring *** scheme has O(n)complexity for both on-chain storage and smart contract computation,regardless of the number of questions,where𝑛denotes the number of *** security analysis is provided,and extensive experiments are conducted to evaluate its effectiveness and performance.
The development of software systems is preceded by an important first phase, requirements elicitation, wherein developers establish the intended functionality of a system to be developed in a series of interviews with...
详细信息
暂无评论