Natural disasters, including earthquakes, cyclones, floods, and wildfires, cause significant environmental damage and have emerged as a major global issue. These events can result in loss of life and disrupt communiti...
详细信息
Vision-language models (VLMs) have emerged as formidable tools, showing their strong capability in handling various open-vocabulary tasks in image recognition, text-driven visual content generation, and visual chatbot...
详细信息
Vision-language models (VLMs) have emerged as formidable tools, showing their strong capability in handling various open-vocabulary tasks in image recognition, text-driven visual content generation, and visual chatbots, to name a few. In recent years, considerable efforts and resources have been devoted to adaptation methods for improving the downstream performance of VLMs, particularly on parameter-efficient fine-tuning methods like prompt learning. However, a crucial aspect that has been largely overlooked is the confidence calibration problem in fine-tuned VLMs, which could greatly reduce reliability when deploying such models in the real world. This paper bridges the gap by systematically investigating the confidence calibration problem in the context of prompt learning and reveals that existing calibration methods are insufficient to address the problem, especially in the open-vocabulary setting. To solve the problem, we present a simple and effective approach called Distance-Aware Calibration (DAC), which is based on scaling the temperature using as guidance the distance between predicted text labels and base classes. The experiments with 7 distinct prompt learning methods applied across 11 diverse downstream datasets demonstrate the effectiveness of DAC, which achieves high efficacy without sacrificing the inference speed. Our code is available at https://***/mlstat-Sustech/CLIP Calibration. Copyright 2024 by the author(s)
Coreset selection is powerful in reducing computational costs and accelerating data processing for deep learning algorithms. It strives to identify a small subset from large-scale data, so that training only on the su...
Coreset selection is powerful in reducing computational costs and accelerating data processing for deep learning algorithms. It strives to identify a small subset from large-scale data, so that training only on the subset practically performs on par with full data. Practitioners regularly desire to identify the smallest possible coreset in realistic scenes while maintaining comparable model performance, to minimize costs and maximize acceleration. Motivated by this desideratum, for the first time, we pose the problem of refined coreset selection, in which the minimal coreset size under model performance constraints is explored. Moreover, to address this problem, we propose an innovative method, which maintains optimization priority order over the model performance and coreset size, and efficiently optimizes them in the coreset selection procedure. Theoretically, we provide the convergence guarantee of the proposed method. Empirically, extensive experiments confirm its superiority compared with previous strategies, often yielding better model performance with smaller coreset sizes. The implementation is available at https://***/xiaoboxia/LBCS. Copyright 2024 by the author(s)
The study investigates the increasing demand of online learning as a means of addressing education issues in the context of the COVID-19 epidemic. Online learning requires several adaptations for teaching methods, lea...
详细信息
This research offers a new perspective on predicting the activity of the HIV virus from the Drug Therapeutics Program (DTP) Antiviral Screen by using the molecular data represented in SMILES notation. The topic has si...
详细信息
As social media significantly shapes societal norms and ethical paradigms, understanding real-time public sentiment provides valuable insights for political parties to evaluate their candidates, key issues, and campai...
详细信息
This paper addresses the vital task of diagnosing breast cancer early and accurately. It acknowledges the difficulty in detecting minor variations in breast cancer patterns through visual examination alone, particular...
详细信息
This research delves to predict PT Vale Indonesia Tbk stock price as an experiment on Indonesian stock using three models: naïve, LSTM, and 1D-CNN. Our analysis emphasizes the importance of matching model archite...
详细信息
The successful development of an effective machine learning model for detecting malicious queries is a challenging task that requires domain expertise. Malicious queries are complex and constantly evolving, and their ...
详细信息
In the digital era and the evolution of social media platforms like TikTok, understanding the factors influencing content virality has become increasingly crucial. Therefore, this research aims to delve into the music...
详细信息
暂无评论