The paper introduces an interpretable machinelearning technique SHAP (SHapley Additive exPlanation) to analyze the vehicle yielding behaviors during pedestrian-vehicle interactions at unsignalized intersections. The ...
详细信息
ISBN:
(纸本)9798350399462
The paper introduces an interpretable machinelearning technique SHAP (SHapley Additive exPlanation) to analyze the vehicle yielding behaviors during pedestrian-vehicle interactions at unsignalized intersections. The study first extracts trajectory data from drone videos and then exploits machinelearning methods to construct the yielding classification model. The results indicate that Random Forest (RF) outperforms Support Vector machine (SVM), Gradient Boosting machine (GBM), and eXtreme Gradient Boosting (XGBoost), achieving the best classification performance with an area under the ROC curve (AUC) of 0.934. Finally, the SHAP algorithm is fused with RF to improve the model interpretability. The analysis reveals that the distances between vehicles and pedestrians make the most significant impact on vehicle yielding behavior. Furthermore, it is found that traffic-related variables exhibit non-linear and threshold effects on vehicle yielding.
Most real-world datasets contain label noise, which can negatively affect downstream ML models trained on them. To deal with this problem, one can clean the mislabeled data before training, which is not only time-cons...
详细信息
ISBN:
(数字)9781665408837
ISBN:
(纸本)9781665408837
Most real-world datasets contain label noise, which can negatively affect downstream ML models trained on them. To deal with this problem, one can clean the mislabeled data before training, which is not only time-consuming and expensive but also requires domain expertise. Another approach is to use a noiserobust ML training algorithm. However, existing methods have some prerequisites that may not be practical in many applications (e.g., they are tied to specific downstream model architecture or they are applicable to specific noise distributions). In this paper, we propose a model-agnostic approach for learning with noisy labels of arbitrary distributions. In particular, our approach can work with any gradient descent optimization based machinelearning model and deal with any label noise distribution. We achieve them by proposing two theoretically grounded noise-robust loss functions (for different noise distributions), and we are able to automatically decide which loss function to use based on a novel noise setting detection module. We directly learn the required hyper-parameters in the loss functions via meta-learning technique to minimize the loss on a given small clean validation set, and propose several strategies to improve the efficiency of training. Experiments on multiple datasets with both real-world and injected label noise show that our method performs better than state-of-the-art approaches.
We present a new data generation method to facilitate an automatic machine interpretation of 2D engineering part drawings. While such drawings are a common medium for clients to encode design and manufacturing require...
详细信息
ISBN:
(纸本)9780791886229
We present a new data generation method to facilitate an automatic machine interpretation of 2D engineering part drawings. While such drawings are a common medium for clients to encode design and manufacturing requirements, a lack of computer support to automatically interpret these drawings necessitates part manufacturers to resort to laborious manual approaches for interpretation which, in turn, severely limits processing capacity. Although recent advances in trainable computer vision methods may enable automatic machine interpretation, it remains challenging to apply such methods to engineering drawings due to a lack of labeled training data. As one step toward this challenge, we propose a constrained data synthesis method to generate an arbitrarily large set of synthetic training drawings using only a handful of labeled examples. Our method is based on the randomization of the dimension sets subject to two major constraints to ensure the validity of the synthetic drawings. The effectiveness of our method is demonstrated in the context of a binary component segmentation task with a proposed list of descriptors. An evaluation of several image segmentation methods trained on our synthetic dataset shows that our approach to new data generation can boost the segmentation accuracy and the generalizability of the machinelearning models to unseen drawings.
The evolution of healthcare from its early beginnings to the recent healthcare 4.0 revolution has remarkably improved human life and living standards. The integration of emerging technologies has played a pivotal role...
详细信息
Timely diagnosis is crucial for the successful treatment of a serious medical condition like brain hemorrhage. Deep learning algorithms have shown great promise in applications for medical image analysis, like the ide...
详细信息
Vehicular Ad-hoc Networks (VANETs) are dynamic networks formed among the vehicles travelling on road. They are prone to change in network topologies very quickly due to limited range of randomly moving vehicles. A veh...
详细信息
Federated learning (FL) is a way for machines to learn from data that is kept locally, in order to protect the privacy of clients. This is typically done using local SGD, which helps to improve communication efficienc...
详细信息
Federated learning (FL) is a way for machines to learn from data that is kept locally, in order to protect the privacy of clients. This is typically done using local SGD, which helps to improve communication efficiency. However, such a scheme is currently constrained by slow and unstable convergence due to the variety of data on different clients' devices. In this work, we identify three under-explored phenomena of biased local learning that may explain these challenges caused by local updates in supervised FL. As a remedy, we propose FedBR, a novel unified algorithm that reduces the local learning bias on features and classifiers to tackle these challenges. FedBR has two components. The first component helps to reduce bias in local classifiers by balancing the output of the models. The second component helps to learn local features that are similar to global features, but different from those learned from other data sources. We conducted several experiments to test FedBR and found that it consistently outperforms other SOTA FL methods. Both of its components also individually show performance gains. Our code is available at https: //***/lins-lab/fedbr.
With the development of big data, artificial intelligence, and wearable technology, the data generated by learners in the learning process can be fully recorded and stored. Using these data to study the characteristic...
详细信息
This study proposes a pioneering integrated care model for elderly care service robots that integrates sentiment analysis and knowledge reasoning through a deep learning framework. The primary objective of this resear...
详细信息
This paper investigates the application of machinelearning for credit risk assessment in Multichain Decentralized Finance (DeFi). With DeFi expanding its scope, the need for effective credit risk evaluation becomes p...
详细信息
暂无评论