Multi-task learning (MTL) has succeeded in various industrial applications by utilizing common knowledge among joint training tasks to enhance the generalization of MTL models, resulting in improved performance across...
详细信息
In this paper, we propose a novel application of syntax-guided synthesis to find symbolic representations of a model’s decision-making process, designed for easy comprehension and validation by humans. Our approach t...
详细信息
Ocean and climate research benefits from global ocean observation initiatives such as Argo, GLOSS, and EMSO. The Argo network, dedicated to ocean profiling, generates a vast volume of observatory data. However, data q...
详细信息
In this paper, the impact of data pre-processing strategies on the performance of forecasting models for solar power generation is comprehensively analyzed. The integration of outlier detection, normalization, and mis...
详细信息
Time series anomaly detection is an important research topic in the field of intelligent operation and maintenance. When software systems are frequently updated with continuous integration and deployment, the distribu...
详细信息
ISBN:
(数字)9781665416931
ISBN:
(纸本)9781665416931
Time series anomaly detection is an important research topic in the field of intelligent operation and maintenance. When software systems are frequently updated with continuous integration and deployment, the distribution of KPI data will also change, and the accuracy of anomaly detection models will inevitably decrease. To tackle this problem, we propose an active anomaly detection framework named Active-MTSAD suitable for multi-dimensional time series, combining unsupervised anomaly detection and active learning. The active learning module introduces three feedback strategies, namely denominator penalty, negative penalty, and metric learning, to learn new anomalous patterns under new data distribution. In metric learning, we consider the difference between normal and abnormal samples in reconstruction error and latent space. We conduct extensive experiments on a large-scale public dataset and a real-world dataset coming from Tencent. The experimental results show that Active-MTSAD can still achieve excellent performance in real scenarios where the distribution changes with only 0.2(% of labels.
SARS-CoV-2, also referred to as the acute respiratory syndrome with coronavirus, first surfaced in late 2019. It was later determined to be a fatal illness that is wreaking havoc on the global economy. It was designat...
详细信息
Ticket classification is a process to define the category name of each ticket before assigning the resolution team to serve each ticket. It is an important process to support the customers inside and outside the compa...
详细信息
ISBN:
(数字)9781665485104
ISBN:
(纸本)9781665485104
Ticket classification is a process to define the category name of each ticket before assigning the resolution team to serve each ticket. It is an important process to support the customers inside and outside the company. It can make customer dissatisfaction if the processing time is high or delayed. Based on the recording data in 2019 - 2021 at the studying company, we found that the manual ticket classification got an error rate about 53 percent because the office workers misunderstand. To alleviate this problem, we propose the methodology for automatic Thai ticket classification by using Term Frequency-Inverse Document Frequency with Support Vector machine. The experimental result shows that the performance of the proposed methodology is higher than the manual classification by 2 times or 41 percent.
The hydraulic piston pump is an important component in the hydraulic system, and it has always been an important problem to accurately and effectively estimate the degradation state of the pump. Physical model-based a...
详细信息
machinelearning (ML) training and scoring fundamentally relies on linear algebra programs and more general tensor computations. Most ML systems utilize distributed parameter servers and similar distribution strategie...
详细信息
Self-supervised learning has achieved a great success in the representation learning of visual and textual data. However, the current methods are mainly validated on the well-curated datasets, which do not exhibit the...
Self-supervised learning has achieved a great success in the representation learning of visual and textual data. However, the current methods are mainly validated on the well-curated datasets, which do not exhibit the real-world long-tailed distribution. Recent attempts to consider self-supervised long-tailed learning are made by rebalancing in the loss perspective or the model perspective, resembling the paradigms in the supervised long-tailed learning. Nevertheless, without the aid of labels, these explorations have not shown the expected significant promise due to the limitation in tail sample discovery or the heuristic structure design. Different from previous works, we explore this direction from an alternative perspective, i.e., the data perspective, and propose a novel Boosted Contrastive learning (BCL) method. Specifically, BCL leverages the memorization effect of deep neural networks to automatically drive the information discrepancy of the sample views in contrastive learning, which is more efficient to enhance the long-tailed learning in the label-unaware context. Extensive experiments on a range of benchmark datasets demonstrate the effectiveness of BCL over several state-of-the-art methods. Our code is available at https://***/MediaBrain-SJTU/BCL.
暂无评论