In this study, we present an innovative unsupervised hyperspectral image classification method using a dual-branch architecture that merges spatial and spectral feature extraction. Our unique approach employs masked a...
详细信息
1 *** Activity Recognition(GAR),which aims to identify activities performed collectively in videos,has gained significant attention *** conventional action recognition centered on single individuals,GAR explores the c...
详细信息
1 *** Activity Recognition(GAR),which aims to identify activities performed collectively in videos,has gained significant attention *** conventional action recognition centered on single individuals,GAR explores the complex interactions between multiple individuals.
In this study, we present an innovative unsupervised hyperspectral image classification method using a dual-branch architecture that merges spatial and spectral feature extraction. Our unique approach employs masked a...
In this study, we present an innovative unsupervised hyperspectral image classification method using a dual-branch architecture that merges spatial and spectral feature extraction. Our unique approach employs masked autoencoders, significantly outperforming traditional methods with an impressive overall accuracy of 97.1%. The paper details the model’s performance evaluation, offers visual insights into its classification capabilities, and compares it with existing techniques, demonstrating its effectiveness and potential for advancing remote sensing applications.
In the intelligent microscopic imaging system, the focusing evaluation function is one of the important core links in the automatic focusing system. In order to solve the problem that the focusing curve loses the char...
详细信息
Privacy-preserving image generation is particularly crucial in fields like healthcare, where data are both sensitive and limited. However, effective privacy preservation often compromises the visual quality and utilit...
详细信息
This paper introduces a novel deep learning framework for the discovery of breast cancer stages, which integrates GAN-generated synthetic images with multi-omics data. By employing StyleGAN3 for the generation of real...
详细信息
ISBN:
(数字)9798331529819
ISBN:
(纸本)9798331529826
This paper introduces a novel deep learning framework for the discovery of breast cancer stages, which integrates GAN-generated synthetic images with multi-omics data. By employing StyleGAN3 for the generation of realistic histopathological images and Swin Transformer for classification, the model draws upon both visual and biological data to enhance the accuracy of cancer staging predictions. The proposed methodology entails the generation of high-quality synthetic images using StyleGAN3, with a Fréchet Inception Distance (FID) score of 35, indicating a reasonable degree of similarity to real images. The images, in conjunction with RNA, miRNA, and clinical data, are integrated into a Swin Transformer-based classifier, resulting in an accuracy of 95.03 %, a precision of 95.00 %, and an F1 score of 95.00 %. A threshold-based softmax probability analysis was employed during the inference stage to explore the potential discovery of new cancer stages. The preliminary observation-based threshold of 30 % may be optimized through further experimentation. In the event that the model exhibited a confidence level for a given class below the specified threshold, the image was identified as a potential candidate for a previously unidentified stage. This study underscores the potential of multimodal data integration in enhancing breast cancer staging and offers insights into leveraging deep learning models for generating and classifying histopathological data, alongside identifying novel disease stages.
Today, the world economy is in the stage of rapid development, followed by the rising quantity, the decreasing moisture content, and the increasing heat value of Municipal Solid Waste (MSW). Since waste incineration h...
详细信息
Micro-expressions (MEs) are spontaneous facial movements that reveal an individual's genuine emotions and play a crucial role in various domains, including lie detection, criminal analysis, mental health treatment...
详细信息
One of the key problems in computer vision is adaptation: models are too rigid to follow the variability of the inputs. The canonical computation that explains adaptation in sensory neuroscience is divisive normalizat...
详细信息
One of the key problems in computer vision is adaptation: models are too rigid to follow the variability of the inputs. The canonical computation that explains adaptation in sensory neuroscience is divisive normalizat...
详细信息
暂无评论