检索结果-内蒙古大学图书馆

Ethical AI in facial expression analysis: racial bias

signal image AND VIDEO processing 2023年第2期17卷 399-406页

作者： Sham, Abdallah Hussein Aktas, Kadir Rizhinashvili, Davit Kuklianov, Danila Alisinanoglu, Fatih Ofodile, Ikechukwu Ozcinar, Cagri Anbarjafari, Gholamreza Tallinn Univ Baltic Film Media & Arts Sch Enact Virtual Lab Narva Mnt 25 EE-10120 Tallinn Estonia Univ Tartu iCV Lab Tartu Estonia iVCV EE-51011 Tartu Estonia Hasan Kalyoncu Univ Fac Engn Gaziantep Turkey PwC Advisory Helsinki Finland Yildiz Tech Univ Istanbul Turkey

Facial expression recognition using deep neural networks has become very popular due to their successful performances. However, the datasets used during the development and testing of these methods lack a balanced distribution of races among the sample images. This leaves a possibility of the methods being biased toward certain races. Therefore, a concern about fairness arises, and the lack of research aimed at investigating racial bias only increases the concern. On the other hand, such bias in the method would decrease the real-world performance due to the wrong generalization. For these reasons, in this study, we investigated the racial bias within popular state-of-the-art facial expression recognition methods such as Deep Emotion, Self-Cure Network, ResNet50, InceptionV3, and DenseNet121. We compiled an elaborated dataset with images of different races, cross-checked the bias for methods trained, and tested on images of people of other races. We observed that the methods are inclined towards the races included in the training data. Moreover, an increase in the performance increases the bias as well if the training dataset is imbalanced. Some methods can make up for the bias if enough variance is provided in the training set. However, this does not mitigate the bias completely. Our findings suggest that an unbiased performance can be obtained by adding the missing races into the training data equally.

关键词： Facial expression recognition (FER) Deep neural networks Reaction emotion LSTM

来源：评论

学校读者我要写书评

暂无评论

BCNN: Backpropagation CNN-Based fully unsupervised skull stripping for accurate brain segmentation

引用

BIOMEDICAL signal processing AND CONTROL 2024年 88卷

作者： Verma, Poonam Rani Bhandari, Ashish Kumar Natl Inst Technol Patna Dept Elect & Commun Engn Patna 800005 India

Brain skull stripping is an essential step before performing the segmentation. It leads to better performance and less computational load on the model due to the elimination of redundant features from the image. However, the preparation of the skull-stripped ground truth brain images by the experts is a very tedious task and may lead to human errors. In this article, a fully unsupervised approach to brain extraction has been proposed. The cascaded loss function is used and is tuned for better segmentation. The number of connected components is also tuned in each type of dataset. The cascaded loss function is a combination of focal loss (FL) and dice loss (DL). To address the class imbalance issue, the leaky ReLU activation function is used. Enhancement of the brain image has been performed before extraction which yielded better performance. In comparison with other methods, the proposed backpropagation-based convolutional neural network (BCNN) work gives better qualitative and quantitative outcomes on four out of seven parameters. The dice similarity coefficient (DSC) of the proposed model is 0.89 which is the highest as compared to the other models for brain extraction. The specificity of the model is 0.998 with 97.21 % accuracy. The undersegmentation error is reduced to 1.2002 using the cascaded loss function. The proposed work has been evaluated on four brain image datasets. It has been found that the proposed model extracts the brain from the skull and also the white matter, gray matter, and cerebrospinal fluid. Therefore, it has been noted that the proposed model will be beneficial for efficient brain skull stripping without the availability of ground truth data.

关键词： Brain extraction Brain tissues Focal loss Skull stripping Unsupervised image segmentation

来源：评论

学校读者我要写书评

暂无评论

Automated magnetocardiography classification using a deformable convolutional block attention module

引用

BIOMEDICAL signal processing AND CONTROL 2025年 105卷

作者： Wang, Ruizhe Pang, Jiaojiao Han, Xiaole Xiang, Min Ning, Xiaolin Beihang Univ Sch Instrumentat & Optoelect Engn Key Lab Ultraweak Magnet Field Measurement Technol Minist Educ Beijing 100191 Peoples R China Beihang Univ Hangzhou Innovat Inst Zhejiang Prov Key Lab Ultraweak Magnet Field Space Hangzhou 310051 Zhejiang Peoples R China Beihang Univ Hangzhou Inst Natl Extremely Weak Magnet Field Inf Hangzhou 310028 Zhejiang Peoples R China Shandong Univ Inst Magnet Field Free Med & Funct Imaging Shandong Key Lab Magnet Field Free Med & Funct Ima Jinan Peoples R China Shandong Univ Shandong Prov Clin Res Ctr Emergency & Crit Care M Dept Emergency Med Qilu Hosp Jinan Peoples R China Shandong Univ Natl Innovat Platform Ind Educ Intearat Med Engn I Jinan Peoples R China Hefei Natl Lab Hefei 230088 Anhui Peoples R China

Objective: This study developed a fast and accurate automated method for magnetocardiography (MCG) classification. Approach: We propose a deformable convolutional block attention module (DCBAM)-based method for classifying coronary artery disease (CAD) using MCG. After preprocessing, the raw MCG data were segmented into individual heartbeat segments and encoded into image representations using the Hilbert curve to convert the temporal features into spatial image features. We combined DCBAM with convolutional neural networks (CNNs) for MCG classification. DCBAM incorporated a deformable convolutional architecture along with temporal and spatial attention mechanisms to capture representative and correlative features of the image representation MCG along the temporal and spatial multichannel dimensions. We performed ablation experiments to evaluate the rationality and validity of the proposed model structure. Additionally, we performed an interpretability analysis to investigate the model's region of interest for CAD diagnosis. Results: The proposed method achieved an average accuracy of 93.57%, precision of 94.71%, sensitivity of 92.56%, specificity of 94.68%, and average F1-score of 93.60%. In contrast to existing methods, our proposed model achieved superior diagnostic classification results in MCG with fewer parameters. Significance: Integrating DCBAM with image-representation MCG establishes a novel feature extraction method that enhances the clinical utility of MCG and effectively addresses long-range dependencies and spatiotemporal inconsistencies in time-series signal analysis.

关键词： Magnetocardiography Coronary artery disease Convolutional neural network Attention mechanism Deformable convolutional block attention module

来源：评论

学校读者我要写书评

暂无评论

Multi-head attention with CNN and wavelet for classification of hyperspectral image

引用

neural COMPUTING & APPLICATIONS 2023年第10期35卷 7595-7609页

作者： Tulapurkar, Harshula Banerjee, Biplab Buddhiraju, Krishna Mohan Indian Inst Technol Ctr Studies Resources Engn Mumbai 400076 India

Hyperspectral image (HSI) is characterized by large number of bands with a high spectral resolution where continuous spectrum is measured for each pixel. This high volume therefore leads to challenges in processing the dataset. Objective of Dimensionality Reduction (DR) algorithms is to identify and eliminate statistical redundancies of hyperspectral data while keeping as much spectral information as possible. Combining spectral and spatial information offers a more comprehensive classification approach. Convolutional neural network (CNN) has the potential to extract complex spatial and spectral features embedded in Hyperspectral data. Wavelet transform belongs to the family of multi-scale transformation where the input signal is analyzed at different levels of granularity. Attention mechanism is a method in neural networks to guide the algorithm to focus on the important information in the data. In this paper, we use Multi-head Transformer-based Attention (Vaswani et al. in Attention is all you Need, 2017) technique for Channel attention which captures the long-range spectral dependencies. The experimental results show that the proposed algorithm MT-CW Band Selection-based multi-head transformer for dimensionality reduction and Wavelet CNN-based algorithm for feature extraction yields impressive results in terms of information conservation and class separability.

关键词： Transformer Band attention Convolutional neural network (CNN) Hyperspectral (HSI) image classification Wavelet Dimensionality reduction Multi-head channel attention

来源：评论

学校读者我要写书评

暂无评论

Multi-dimensional spatial pruning for remote sensing image scene classification

引用

DIGITAL signal processing 2025年 158卷

作者： Zhai, Dezhao Chen, Wei Miao, Baoming Liu, Fulong Han, Siqi Ding, Yinghao Yu, Ming Wu, Hang Tianjin Univ Technol Sch Mech Engn Tianjin Key Lab Adv Mechatron Syst Design & Intell Tianjin 300384 Peoples R China Tianjin Univ Technol Natl Demonstrat Ctr Expt Mech & Elect Engn Educ Tianjin Peoples R China Peoples Liberat Army Acad Mil Sci Syst Engn Inst Tianjin 300161 Peoples R China Nankai Univ Sch Artificial Intelligence Tianjin 300381 Peoples R China

In recent years, remote sensing image classification tasks have garnered widespread attention and have been extensively studied by researchers. Most current studies focus on improving classification accuracy, leading to overly large and complex networks with high computational costs that are challenging to deploy for real-time remote sensing tasks. To address this issue, neural network pruning has emerged as an effective solution. However, existing pruning methods typically prune along a single dimension, and as the pruning ratio increases, important weights in that dimension often suffer from over-pruning, resulting in significant accuracy loss. This paper proposes a novel pruning method for remote sensing scene classification-Multidimensional Space Pruning (MSP). MSP performs stereoscopic pruning of filters along both channel and depth dimensions, simultaneously removing redundant information across two different dimensions. This prevents excessive pruning of important weights in a single dimension, thereby significantly reducing model complexity while maintaining accuracy. As a novel pruning method, MSP achieves remarkable results. At a pruning ratio of 0.4, MSP-pruned VGG-16 and ResNet-34 models on the NWPU-RESISC45 dataset show accuracy drops of only 1.05 % and 0.71 %, respectively, while achieving compression ratios of 92.52 % and 93.19 %. Similarly, on the AID dataset, the accuracy drops are merely 0.26 % and 0.54 %, with compression ratios reaching 96.23 % and 88.56 %, respectively. Experimental results on two public remote sensing image datasets demonstrate that compared to existing methods, MSP achieves higher compression ratios while maintaining model accuracy, showcasing superior model compression performance.

关键词： Model pruning Model compression Multidimensional space pruning (MSP) Remote sensing image classification

来源：评论

学校读者我要写书评

暂无评论

Learning Degradation-Independent Representations for Camera ISP Pipelines

Learning Degradation-Independent Representations for Camera ...

引用

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

作者： Guo, Yanhui Luo, Fangzhou Wu, Xiaolin McMaster Univ Hamilton ON Canada

ISBN: (纸本)9798350353006

image signal processing (ISP) pipeline plays a fundamental role in digital cameras, which converts raw Bayer sensor data to RGB images. However, ISP-generated images usually suffer from imperfections due to the compounded degradations that stem from sensor noises, demosaicing noises, compression artifacts, and possibly adverse effects of erroneous ISP hyperparameter settings such as ISO and gamma values. In a general sense, these ISP imperfections can be considered as degradations. The highly complex mechanisms of ISP degradations, some of which are even unknown, pose great challenges to the generalization capability of deep neural networks (DNN) for image restoration and to their adaptability to downstream tasks. To tackle the issues, we propose a novel DNN approach to learn degradation-independent representations (DiR) through the refinement of a self-supervised learned baseline representation. The proposed DiR learning technique has remarkable domain generalization capability and consequently, it outperforms state-of-the-art methods across various downstream tasks, including blind image restoration, object detection, and instance segmentation, as verified in our experiments.

关键词： Deep neural Networks Degradation Independent image Restoration image signal processing

来源：评论

学校读者我要写书评

暂无评论

Investigation on the blurred image restoration based on brain-inspired model

引用

IET image processing 2023年第1期17卷 12-23页

作者： Xiangyan, Meng Chen, Zhao Li, Zhao Keding, Yan Yumiao, Ren Haixian, Pan Xian Technol Univ Elect Engn Inst Xian Shaanxi Peoples R China

Though relatively good effect has been achieved by the image de-blurring method based on deep learning, the existing methods still suffer from the problem of unclear restoration of the edges. Therefore, brain-inspired image restoration model based on human attention and "fine vision" is proposed to improve the blind restoration quality of the image in this paper according to the response mechanism of the different cerebral cortices for high and low spatial resolutions. The designed brain-inspired model consists of dual-channel network available to realize the function of feature merger for low and high resolutions, which is used to extract the image edges with detailed information filtered out. Confirmatory experiment is implemented based on the blurred image in the data set of GOPRO, LIVE and set14. As per the result, the model proposed is available for relatively good restoration of blurred image and super-resolution, as well as looking results by visual inspection.

关键词： blind restoration quality image edges low spatial resolutions relatively good restoration deep learning de-blurring method brain-inspired image restoration model neural nets Optical, image and video signal processing Computer vision and image processing techniques high resolutions image restoration low resolutions different cerebral cortices human attention deep learning (artificial intelligence) blurred image restoration designed brain-inspired model relatively good effect unclear restoration image reconstruction brain image resolution

来源：评论

学校读者我要写书评

暂无评论

Research on Brain Visual image signal Recognition Method Based on Deep neural Network 17

Research on Brain Visual Image Signal Recognition Method Bas...

引用

17th IEEE International Conference on signal processing, ICSP 2024

作者： Chen, Jingyuan Guo, Wenhui Liu, Mengxue Ma, Rui Wang, Yanjiang College of Control Science and Engineering Qingdao China

ISBN: (纸本)9798350387384

With the advancement of deep learning, the accuracy of image classification continues to rise. However, the generalization capability of deep learning methods, such as those applied in image classification, lags significantly behind that of the human brain. Therefore, in this study, we propose a novel approach driven by human brain activity for classifying visual image signals. Specifically, we utilize a non-invasive 64-channel EEG cap to capture the brain activity of six subjects as they view 40 images across five object categories. Subsequently, we employ two Long Short-Term Memory (LSTM) network structures to extract feature vectors from the EEG signals. Following this, we employ Convolutional neural Network (CNN) to extract feature vectors from the images. Lastly, we conduct a regression task on the extracted EEG and image feature vectors, enabling the machine to perform visual image classification based on human brain-inspired features. Our LSTM-based approach achieves a maximum accuracy of 66.13% in distinguishing object categories, surpassing existing methods for learning EEG-based visual object representations. Regarding image classification, our human brain-driven approach demonstrates commendable performance, even comparable to robust CNN models. The dataset can be found on https://***/Goustjager0/CJY-dataset. © 2024 IEEE.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

On penalty-based bilevel gradient descent method

引用

MATHEMATICAL PROGRAMMING 2025年 1-51页

作者： Shen, Han Xiao, Quan Chen, Tianyi Rensselaer Polytech Inst Dept Elect Comp & Syst Engn Troy NY 12180 USA

Bilevel optimization enjoys a wide range of applications in emerging machine learning and signal processing problems such as hyper-parameter optimization, image reconstruction, meta-learning, adversarial training, and reinforcement learning. However, bilevel optimization problems are traditionally known to be difficult to solve. Recent progress on bilevel algorithms mainly focuses on bilevel optimization problems through the lens of the implicit-gradient method, where the lower-level objective is either strongly convex or unconstrained. In this work, we tackle a challenging class of bilevel problems through the lens of the penalty method. We show that under certain conditions, the penalty reformulation recovers the (local) solutions of the original bilevel problem. Further, we propose the penalty-based bilevel gradient descent (PBGD) algorithm and establish its finite-time convergence for the constrained bilevel problem with lower-level constraints yet without lower-level strong convexity. Experiments on synthetic and real datasets showcase the efficiency of the proposed PBGD algorithm. The code for implementing this algorithm is publicly available on GitHub.

关键词： Bilevel optimization First-order methods stochastic optimization Convergence analysis

来源：评论

学校读者我要写书评

暂无评论

PERCEPTUAL LEARNED image COMPRESSION VIA END-TO-END JND-BASED OPTIMIZATION 31

PERCEPTUAL LEARNED IMAGE COMPRESSION VIA END-TO-END JND-BASE...

引用

2024 International Conference on image processing

作者： Pakdaman, Farhad Nami, Sanaz Gabbouj, Moncef Tampere Univ Fac Informat Technol & Commun Sci Tampere Finland

ISBN: (纸本)9798350349405;9798350349399

Emerging Learned image Compression (LC) achieves significant improvements in coding efficiency by end-to-end training of neural networks for compression. An important benefit of this approach over traditional codecs is that any optimization criteria can be directly applied to the encoder-decoder networks during training. Perceptual optimization of LC to comply with the Human Visual System (HVS) is among such criteria, which has not been fully explored yet. This paper addresses this gap by proposing a novel framework to integrate Just Noticeable Distortion (JND) principles into LC. Leveraging existing JND datasets, three perceptual optimization methods are proposed to integrate JND into the LC training process: (1) Pixel-Wise JND Loss (PWL) prioritizes pixel-by-pixel fidelity in reproducing JND characteristics, (2) image-Wise JND Loss (IWL) emphasizes on overall imperceptible degradation levels, and (3) Feature-Wise JND Loss (FWL) aligns the reconstructed image features with perceptually significant features. Experimental evaluations demonstrate the effectiveness of JND integration, highlighting improvements in rate-distortion performance and visual quality, compared to baseline methods. The proposed methods add no extra complexity after training.

关键词： Just Noticeable Distortion (JND) Human Visual System ( HVS) learned compression perceptual optimization

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：