Despite notable progress in enhancing the capability of machine learning against distribution shifts, training data quality remains a bottleneck for cross-distribution generalization. Recently, from a data-centric per...
Despite notable progress in enhancing the capability of machine learning against distribution shifts, training data quality remains a bottleneck for cross-distribution generalization. Recently, from a data-centric perspective, there have been considerable efforts to improve model performance through refining the preparation of training data. Inspired by realistic scenarios, this paper addresses a practical requirement of acquiring training samples from various domains on a limited budget to facilitate model generalization to target test domain with distribution shift. Our empirical evidence indicates that the advance in data acquisition can significantly benefit the model performance on shifted data. Additionally, by leveraging unlabeled test domain data, we introduce a Domain-wise Active Acquisition framework. This framework iteratively optimizes the data acquisition strategy as training samples are accumulated, theoretically ensuring the effective approximation of test distribution. Extensive real-world experiments demonstrate our proposal's advantages in machine learning applications. The code is available at https://***/dongbaili/DAA.
Missing link prediction is a method for network analysis, with applications in recommender systems, biology, social sciences, cybersecurity, information retrieval, and Artificial Intelligence (AI) reasoning in Knowled...
详细信息
Node classification on graphs is a significant task with a wide range of applications, including social analysis and anomaly detection. Even though graph neural networks (GNNs) have produced promising results on this ...
详细信息
Node classification on graphs is a significant task with a wide range of applications, including social analysis and anomaly detection. Even though graph neural networks (GNNs) have produced promising results on this task, current techniques often presume that label information of nodes is accurate, which may not be the case in real-world applications. To tackle this issue, we investigate the problem of learning on graphs with label noise and develop a novel approach dubbed Consistent Graph Neural Network (CGNN) to solve it. Specifically, we employ graph contrastive learning as a regularization term, which promotes two views of augmented nodes to have consistent representations. Since this regularization term cannot utilize label information, it can enhance the robustness of node representations to label noise. Moreover, to detect noisy labels on the graph, we present a sample selection technique based on the homophily assumption, which identifies noisy nodes by measuring the consistency between the labels with their neighbors. Finally, we purify these confident noisy labels to permit efficient semantic graph learning. Extensive experiments on three well-known benchmark datasets demonstrate the superiority of our CGNN over competing approaches.
The classification of medical images has greatly advanced due to improvements in imaging technologies and the application of deep learning. This study presents an automatic system for classifying peripheral blood cell...
详细信息
ISBN:
(数字)9798350386448
ISBN:
(纸本)9798350386455
The classification of medical images has greatly advanced due to improvements in imaging technologies and the application of deep learning. This study presents an automatic system for classifying peripheral blood cells, leveraging deep learning and transfer learning techniques to enhance performance and efficiency. We developed three designs combining convolutional architectures: VGG16, InceptionV3and ResNet50. The first design combines VGG16 and InceptionV3, the second concatenates InceptionV3and ResNet50, and the third associates VGG16 and ResNet50. Experiments were conducted on the Peripheral Blood Cell (PBC) dataset, containing 17,092 images across eight distinct classes. The results demonstrate the effectiveness of our approach, achieving a maximum accuracy of 99%.
The existence of deep learning’s "black box" makes it difficult to understand how the algorithms analyze patterns and make image-level predictions. A representation of the pixels contributing the most to th...
详细信息
Behind the scenes, local municipalities and environmental agencies conduct the water testing needed to ensure public safety. While this is a critical service, it can also be resource-intensive and timeconsuming, as ma...
详细信息
Behind the scenes, local municipalities and environmental agencies conduct the water testing needed to ensure public safety. While this is a critical service, it can also be resource-intensive and timeconsuming, as many programs rely on manual sampling and data collection. This can mean that some water bodies are sampled only once a year, or even less.
Objective and Impact *** tumors from normal tissues is vital in the intraoperative diagnosis and pathological *** this work,we propose to utilize Raman spectroscopy as a novel modality in surgery to detect colorectal ...
详细信息
Objective and Impact *** tumors from normal tissues is vital in the intraoperative diagnosis and pathological *** this work,we propose to utilize Raman spectroscopy as a novel modality in surgery to detect colorectal cancer *** spectra can reflect the substance components of the target ***,the feature peak is slight and hard to detect due to environmental *** a high-quality Raman spectroscopy dataset and developing effective deep learning detection methods are possibly viable ***,we collect a large Raman spectroscopy dataset from 26 colorectal cancer patients with the Raman shift ranging from 385 to 1545 cm^(−1).Second,a one-dimensional residual convolutional neural network(1D-ResNet)architecture is designed to classify the tumor tissues of colorectal ***,we visualize and interpret the fingerprint peaks found by our deep learning *** results show that our deep learning method achieves 98.5%accuracy in the detection of colorectal cancer and outperforms traditional ***,Raman spectra are a novel modality for clinical detection of colorectal *** proposed ensemble 1D-ResNet could effectively classify the Raman spectra obtained from colorectal tumor tissues or normal tissues.
Abuses of forgery techniques have created a considerable problem of misinformation on social media. Although scholars devote many efforts to face forgery detection (a.k.a DeepFake detection) and achieve some results, ...
Abuses of forgery techniques have created a considerable problem of misinformation on social media. Although scholars devote many efforts to face forgery detection (a.k.a DeepFake detection) and achieve some results, two issues still hinder the practical application. 1) Most detectors do not generalize well to unseen datasets. 2) In a supervised manner, most previous works require a considerable amount of manually labeled data. To address these problems, we propose a simple contrastive pertaining framework for DeepFake detection (DFCP), which works in a finetuning-after-pretraining manner, and requires only a few labels (5%). Specifically, we design a two-stream framework to simultaneously learn high-frequency texture features and high-level semantics information during pretraining. In addition, a video-based frame sampling strategy is proposed to mitigate potential noise data in the instance-discriminative contrastive learning to achieve better performance. Experimental results on several downstream datasets show the state-of-the-art performance of the proposed DFCP, which works at frame-level (w/o temporal reasoning) with high efficiency but outperforms video-level methods.
Fashion recommendation is an essential component of user shopping that it is capable of selecting and presenting fascinating items to customers. The fact that humans exhibit inconsistencies for fashion items in their ...
详细信息
Encoding constraints into neural networks is attractive. This paper studies how to introduce the popular positive linear satisfiability to neural networks. We propose the first differentiable satisfiability layer base...
详细信息
暂无评论