The concept of a minimax classifier is well-established in statistical decision theory, but its implementation via neural networks remains challenging, particularly in scenarios with imbalanced training data having a ...
详细信息
1 Introduction In recent years,foundation Vision-Language Models(VLMs),such as CLIP[1],which empower zero-shot transfer to a wide variety of domains without fine-tuning,have led to a significant shift in machine learn...
详细信息
1 Introduction In recent years,foundation Vision-Language Models(VLMs),such as CLIP[1],which empower zero-shot transfer to a wide variety of domains without fine-tuning,have led to a significant shift in machine learning *** the impressive capabilities,it is concerning that the VLMs are prone to inheriting biases from the uncurated datasets scraped from the Internet[2–5].We examine these biases from three perspectives.(1)Label bias,certain classes(words)appear more frequently in the pre-training data.(2)Spurious correlation,non-target features,e.g.,image background,that are correlated with labels,resulting in poor group robustness.(3)Social bias,which is a special form of spurious correlation,focuses on societal *** image-text pairs might contain human prejudice,e.g.,gender,ethnicity,and age,that are correlated with *** biases are subsequently propagated to downstream tasks,leading to biased predictions.
This paper presents a novel approach for generating intricate Batik motifs using a modified Diffusion-Generative Adversarial Network (Diffusion-GAN) augmented with StyleGAN2-Ada. Motivated by the rich cultural heritag...
详细信息
Recommendation systems (RS) have become prevalent across different domains including music, e-commerce, e-learning, entertainment, and social media to address the issue of information overload. While traditional RS ap...
详细信息
In this study, tests were done to see what would happen if hydrogen (H2) and lemon grass oil (LO) were used for a lone-cylinder compression ignition engine as a partial diesel replacement. After starting the trial wit...
详细信息
The detection of violence in videos has become an extremely valuable application in real-life situations, which aim to maintain and protect people’s safety. Despite the complexities inherent in videos and the abrupt ...
详细信息
Voice pathology detection (VPD) aims to accurately identify voice impairments by analyzing speech signals. This study proposes models based on deep learning (DL) for binary classification to distinguish between health...
详细信息
Emotions have a significant impact on how people make decisions. Due to its potential applications in various fields, emotion intensity detection has attracted a lot of attention recently. Several methods have been pr...
详细信息
Effective recommender systems play a crucial role in accurately capturing user and item attributes that mirror individual preferences. Some existing recommendation techniques have started to shift their focus towards ...
详细信息
Medical imaging, a cornerstone of disease diagnosis and treatment planning, faces the hurdles of subjective interpretation and reliance on specialized expertise. Deep learning algorithms show improvements in automatin...
详细信息
暂无评论