An important challenge in robust machinelearning is when training data is provided by strategic sources who may intentionally report erroneous data for their own benefit (Caro and Gallien, 2010;Caro et al., 2010). A ...
详细信息
An important challenge in robust machinelearning is when training data is provided by strategic sources who may intentionally report erroneous data for their own benefit (Caro and Gallien, 2010;Caro et al., 2010). A line of work at the intersection of machinelearning and mechanism design aims to deter strategic agents from reporting erroneous training data by designing learning algorithms that are strategyproof (Dekel et al., 2010;Perote and Perote-Pena, 2004;Chen et al., 2018;Cummings et al., 2015;Meir and Rosenschein, 2011;Meir et al., 2008, 2010, 2012;Hardt et al., 2016;Ghalme et al., 2021;Dong et al., 2018;Ahmadi et al., 2021). Strategyproofness is a strong and desirable property, but it comes at a cost in the approximation ratio of even simple risk minimization problems. A recent line of work on mechanism design with advice has shown that side information can be leveraged to overcome worst-case bounds in mechanism design. Strategyproof mechanisms that achieve an improved approximation ratio when the advice is accurate (consistency) and an acceptable approximation ratio when the advice is inaccurate (robustness) have been designed for problems such as strategic facility location (Agrawal et al., 2022;Xu and Lu, 2022;Balkanski et al., 2024a), auction design (Lu et al., 2024;Caragiannis and Kalantzis, 2024;Balkanski et al., 2024b), strategic scheduling (Balkanski et al., 2023), strategic assignment (Colini-Baldeschi et al., 2024), and metric distortion (Berger et al., 2024). In this paper, we study strategyproof learning in two settings: regression and classification;we provide the first non-trivial consistency-robustness tradeoffs for both. In strategic learning, each agent i ∈ {1, ..., n} reports a set Si = {(xi,j, yi,j)}j of labeled data points to the learner. The points xi,j are public information, but the labels yi,j are private information that agent i can potentially misreport. The goal of the mechanism is to learn a function f from a function class F tha
Timely and long-term predictions of particle concentration changes in pipes are of great significance. The particle concentration outlier data across the pipe cross-section is obtained through numerical simulation and...
详细信息
Timely and long-term predictions of particle concentration changes in pipes are of great significance. The particle concentration outlier data across the pipe cross-section is obtained through numerical simulation and smooth interpolation method. The data is then decomposed using the Ensemble Empirical Mode Decomposition (EEMD), Variational Mode Decomposition (VMD), and VMD-EEMD methods. Finally, the Long Short-Term Memory (LSTM) model is employed to perform multi-step predictions on both the undifferentiated data and the data decomposed by the various models, with a step size of one second. The EEMD-LSTM, LSTM, and VMD-EEMD-LSTM models exhibited the highest accuracy in predicting the first, second, and third steps, respectively. The mean absolute errors (MAE) are 0.00014, 0.00040, and 0.00081;the root mean square errors (RMSE) are 0.00018, 0.00049, and 0.00103;the mean absolute percentage errors (MAPE) are 0.10, 0.27, and 0.56;and the coefficients of determination (R2) are 0.9978, 0.9844, and 0.9316. When selecting this outcome as the ensemble model, it demonstrates greater accuracy compared to other predictive models. This model is applicable to other cross-sections and exhibits values comparable to the MAE, RMSE, MAPE, and R2 of the original cross-section. Additional research is conducted on the formation mechanism of plug flow under various operating conditions using numerical simulations. The lubricating effect of small particles, along with the significant momentum exchange between numerous particles and fluids, can effectively mitigate the plug effect of the pipe flow.
The proceedings contain 39 papers. The topics discussed include: evaluation of machinelearning to early detection of highly cited papers;towards using deep reinforcement learning for better COVID-19 vaccine distribut...
ISBN:
(纸本)9781665410144
The proceedings contain 39 papers. The topics discussed include: evaluation of machinelearning to early detection of highly cited papers;towards using deep reinforcement learning for better COVID-19 vaccine distribution strategies;an investigation of forecasting Tadawul all share index (TASI) using machinelearning;intelligent deep detection method for malicious tampering of cancer imagery;an empirical analysis of health-related campaigns on twitter Arabic hashtags;the accuracy performance of semantic segmentation network with different backbones;a comprehensive evaluation of statistical, machinelearning and deep learning models for time series prediction;depression detection in Arabic using speech language recognition;a deep learning framework for temperature forecasting;improving relevance in a recommendation system to suggest charities without explicit user profiles using dual-autoencoders;and the impact of feature selection on different machinelearning models for breast cancer classification.
This paper proposed an approach to estimate disc cutter wear utilizing a combination of multiple operational parameters and vibration data collected during shield tunneling operations. The incorporation of vibration s...
详细信息
This paper proposed an approach to estimate disc cutter wear utilizing a combination of multiple operational parameters and vibration data collected during shield tunneling operations. The incorporation of vibration signals, notably those originating from acceleration sensors mounted on the back plate of the soil chamber, has markedly enhanced the accuracy of the model. Time-frequency domain features were extracted through analysis methods such as Fast Fourier Transform (FFT), Short-Time Fourier Transform (STFT), and Continuous Wavelet Transform (CWT). A predictive model utilizing vibration and shield operation parameters was developed using the XGBoost algorithm, and a deep GoogLeNet Convolutional Neural Network (CNN) was trained on time-frequency graphs from the CWT. In addition, this study also investigated the impact of signal duration on wavelet image information and model accuracy. In the Huang-Shang Intercity Railway Project, the approach effectively assessed disc cutter wear during tunneling operations and dynamically optimized the operational parameters of the shield tunnel machine through predictive analysis.
The maritime industry plays a crucial role in global trade and transport, necessitating effective collision avoidance, navigation security, and reduction in human errors and costs. This paper presents an automated dat...
详细信息
Histopathological analysis of biopsy sections is crucial for the detection of cancer and the distinction between different tumor subtypes. To this end, pathologists identify certain key regions of the biopsy from whic...
详细信息
ISBN:
(纸本)9783031777301;9783031777318
Histopathological analysis of biopsy sections is crucial for the detection of cancer and the distinction between different tumor subtypes. To this end, pathologists identify certain key regions of the biopsy from which a diagnosis is derived. The classification of whole slide images (WSIs) can be addressed as a multiple instance learning (MIL) problem where only slide-level labels are available. In order to model the relevance scores of the different WSI patches, attention mechanisms are implemented within the MIL framework. However, excessive flexibility in the attention mechanisms for computing attention scores to patches may lead to nearly uniform attention distributions, potentially deteriorating the model's performance. In this paper, we introduce an unsupervised auxiliary loss function to recalibrate the attention mechanism enhancing emphasis on crucial patches and downscaling the influence of less relevant ones. The proposed MIL framework has been evaluated on invasive breast carcinoma (TCGA-BRCA) and renal cell carcinoma (TCGA-RCC) subtyping. The results obtained show that attention regularization not only improves the predictive capacity of the model but also significantly increases its interpretability by identifying regions and patterns of high diagnostic value.
Crime prediction has been challenging, partly due to the difficulty in selecting predictive variables. Previous studies theoretically and empirically confirmed the relationship between many variables and crime. This s...
详细信息
Early identification of plant diseases plays an important role in reducing the financial losses for farmers and enhancing the productivity and quality of crops. The present work proposes five machinelearning (ML) alg...
详细信息
machinelearning and data Mining imply notable privacy vulnerabilities since they have the potential to expose confidential details about people or collectives that have contributed to the data. This paper proposes a ...
详细信息
In an increasingly globalized economy, the communication of corporate image plays a pivotal role in shaping perceptions and fostering relationships with international stakeholders. This study explores the design and i...
详细信息
暂无评论