A micro-expression is a fleeting, delicate and localized facial gesture. It can expose the true feelings that someone is trying to hide and is seen to be a crucial indicator for spotting lies. Because of its possible ...
详细信息
A micro-expression is a fleeting, delicate and localized facial gesture. It can expose the true feelings that someone is trying to hide and is seen to be a crucial indicator for spotting lies. Because of its possible applications in a variety of sectors, micro-expression research has garnered a lot of attention. The accuracy of micro-expression recognition still needs to be improved, though, because of the brief and weak motions that make up micro- expressions. In recent years, Deep convolution neuralmethods have depicted a higher degree of efficiency for complex challenge of face detection. Although several attempts were made for micro-expression recognition (MER), the problem is far from being resolved problem which is portrayed by the lowest accuracy rate depicted by the other models. In this study, present a Facial Micro-Expression Detection and Classification using Modified Multimodal Ensemble Learning (FMEDC-MMEL) approach. The major intention of the FMEDC-MMEL technique lies in the proficient identification of MEs that exist in the facial images. As a pre-processing phase, the FMEDCMMEL technique exploits histogram equalization (HE) approach to improve the contrast level of the image. In the FMEDC-MMEL technique, improved densely connected networks (DenseNet) model is used for learning feature patterns from the pre-processed images. To enhance the proficiency of the improved DenseNet model, stochastic gradient descent (SGD) approach is used for hyperparameter selection process. For facial ME detection, the FMEDC-MMEL technique follows an ensemble of three classifiers namely bi-directional gated recurrent unit (Bi-GRU), long short-term memory (LSTM) and extreme learning machine (ELM). A tailored ensemble learning approach is shown, which combines many machine learning models to improve classification performance and detection accuracy. Sophisticated feature extraction methods are utilized to extract the subtle aspects of micro-expressions, and precision is ma
To acquire color images, most commercial cameras rely on color filter arrays (CFAs), which are a pattern of color filters overlaid over the sensor's focal plane. Demosaicking describes the processing techniques to...
详细信息
ISBN:
(纸本)9789464593617;9798331519773
To acquire color images, most commercial cameras rely on color filter arrays (CFAs), which are a pattern of color filters overlaid over the sensor's focal plane. Demosaicking describes the processing techniques to reconstruct a full color image for all pixels on the focal plane array. Most demosaicking methods are tailored for a specific CFA, and tend to work poorly for others. In this work we present an algorithm for demosaicking a wide variety of CFAs. The proposed method allows to blend the knowledge of the CFA with information coming from data, employing a novel transformation and pattern-invariant loss function. The method is based on the unrolling of an algorithm based on a neural network learned on available examples. Preliminary experiments over RGB and RGBW CFAs show that the method performs well over a range of CFAs and is competitive for CFAs for which competing methods were tailored to work well on.
DNA exhibits remarkable potential as a data storage solution due to its impressive storage density and long-term stability, stemming from its inherent biomolecular structure. However, developing this novel medium come...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
DNA exhibits remarkable potential as a data storage solution due to its impressive storage density and long-term stability, stemming from its inherent biomolecular structure. However, developing this novel medium comes with its own set of challenges, particularly in addressing errors arising from storage and biological manipulations. These challenges are further conditioned by the structural constraints of DNA sequences and cost considerations. In response to these limitations, we have pioneered a novel compression scheme and a cutting-edge Multiple Description Coding (MDC) technique utilizing neural networks for DNA data storage. Our MDC method introduces an innovative approach to encoding data into DNA, specifically designed to withstand errors effectively. Notably, our new compression scheme overperforms classic image compression methods for DNA-data storage. Furthermore, our approach exhibits superiority over conventional MDC methods reliant on auto-encoders. Its distinctive strengths lie in its ability to bypass the need for extensive model training and its enhanced adaptability for fine-tuning redundancy levels. Experimental results demonstrate that our solution competes favorably with the latest DNA data storage methods in the field, offering superior compression rates and robust noise resilience.
Liver cancer is a significant global health concern, with its prevalence steadily rising over the years. The accurate detection and classification of liver cancer are pivotal for timely treatment and improved patient ...
详细信息
Liver cancer is a significant global health concern, with its prevalence steadily rising over the years. The accurate detection and classification of liver cancer are pivotal for timely treatment and improved patient outcomes. The most challenging tasks identified from the previous research studies are computational complexity, sensitive parameter setting, misdetection and misclassification. So, a deep learning-based optimization algorithm is proposed to detect and classify liver cancer. The image data are collected from the LiTS17 dataset, 3D-IRCADb dataset and Liver tumor CT dataset to preprocess the medical image data and the preprocessing provides consistency, quality and privacy. The Differential Convolutional neural Network (Differential CNN) model extracts the relevant features for improving the ability of a model to differentiate healthy and cancerous tissues. The features are classified into benign and malignant by using the classification model namely Kernel Extreme Learning Machine (KELM) model. The Differential Biogeography-Based Optimization Algorithm (DBBOA) algorithm fine-tunes the parameters to find near-optimal solutions. This tuning process is conducted during training the deep learning-based classification model. The experimental validation is conducted in terms of using significant performance evaluation measures and the comprehensive analysis provided a better classification accuracy of 98.72%, F1-score of 98.25%, specificity of 97.93%, sensitivity of 98.52%, AUCROC of 0.9872, precision of 98.89% and a computational time of 1.5 s for the proposed liver cancer detection and classification model. The comparative analysis showed that the proposed model achieved a superior outcome rather than other existing methods.
The efficient segmentation of medical image is of great significance for clinical diagnosis. Recently, TransUNet has achieved great success in medical image segmentation by effectively fusing Convolutional neural Netw...
详细信息
The efficient segmentation of medical image is of great significance for clinical diagnosis. Recently, TransUNet has achieved great success in medical image segmentation by effectively fusing Convolutional neural Networks (CNN) and Vision Transformer (ViT) to accomplish the extraction of local and global information. However, since TransUNet is designed as a stitching of CNN and ViT framework level, it has the following problems to be solved: 1) only local and relatively global spatial features of images are extracted;2) the direct introduction of ViT brings the disadvantages of not easy training and high computational overhead. Therefore, in this work, we propose Mixblock, a hybrid encoder that effectively fuses the superiority of CNN and ViT and is capable of extracting multidimensional high-level semantic information of images instead of being limited to local and global spatial features. Based on this, we design a UNet-like method MixUNet for medical image segmentation, which is a concise and efficient baseline network. Specifically, MixUNet is able to converge after less training without any pre-training, and its number of parameters and computation are only 3.17% and 4.99% of those of TransUNet. In addition, we creatively introduce frequency domain information on skip connection to eliminate the semantic ambiguity between the encoder and decoder, which provides a new perspective for medical image segmentation. Finally, we perform extensive experiments on three publicly available medical image datasets. Experimental results show that MixUNet has significant superiority in segmentation performance, model complexity, and robustness compared to state-of-the-art baseline methods.
Recently, implicit function-based approaches have advanced 3D human reconstruction from a single-view image. However, previous methods suffer from issues such as noisy artifacts, loss of geometric details, and broken ...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
Recently, implicit function-based approaches have advanced 3D human reconstruction from a single-view image. However, previous methods suffer from issues such as noisy artifacts, loss of geometric details, and broken limbs under the scenarios of challenging poses. To address these problems, a novel end-to-end deep neural network named ReGIR is proposed, which is a multi-level architecture combining the parametric model with implicit function. The architecture consists of a coarse level and a fine level, and for each level, normal maps and the signed distance function (SDF) are introduced to encode query points. Furthermore, the network is trained in a coarse-to-fine manner to enable robust human body reconstruction with geometric details. Our extensive qualitative and quantitative experiments demonstrate that ReGIR achieves competitive reconstruction results.
This paper presents a novel early-stage Alzheimer's dementia (AD) disease detection based on convolutional neural networks (CNNs). As it is widely used in detection and classification of AD disease, a time-frequen...
详细信息
ISBN:
(纸本)9789464593617;9798331519773
This paper presents a novel early-stage Alzheimer's dementia (AD) disease detection based on convolutional neural networks (CNNs). As it is widely used in detection and classification of AD disease, a time-frequency (TF) method has been proposed for AD detection. It has been described to address the problem of detecting early-stage AD by combining TF and CNN methods. The method is developed by utilizing the well-known structural similarity index measure (SSIM) to obtain discriminative features in each TF image. Experimental results demonstrate that the proposed method outperforms the early-stage AD detection using advanced signal decomposition algorithm that is intrinsic time-scale decomposition (ITD), and it achieves a notable improvement in terms of the detection success rates compared to AD detection from TF images of raw EEG signals.
In practical engineering applications, the working load of the rotor system is changing constantly, and the noise pollution of its working environment is serious, which leads to the performance degradation of traditio...
详细信息
In practical engineering applications, the working load of the rotor system is changing constantly, and the noise pollution of its working environment is serious, which leads to the performance degradation of traditional fault diagnosis methods. To solve the above problems, we present a novel rotor system fault diagnosis model based on parallel convolutional neural network architecture with attention mechanism (AMPCNN). The model uses convolution kernels of different sizes in parallel channels to process raw data, and based on late feature fusion, a more comprehensive feature map is obtained. Furthermore, the information sharing between the two channels is realized through the attention mechanism so that the effective features of one channel can be reflected in another channel. The performance of the model under variable working conditions is verified by the Machinery Fault Database (MAFAULDA), and the average accuracy is 99.58%. By dividing Gaussian white noise from -9 dB to 2 dB into 11 intervals and adding it to the public data of Wuhan University, the noise resistance performance is verified, and the proposed method can obtain 100% diagnosis accuracy even in the high noise condition. The above experiments show that in terms of load adaptability and noise immunity, the method has higher accuracy than traditional deep learning classification methods.
In recent times,an image enhancement approach,which learns the global transformation function using deep neural networks,has gained ***,many existing methods based on this approach have a limitation:their transformati...
详细信息
In recent times,an image enhancement approach,which learns the global transformation function using deep neural networks,has gained ***,many existing methods based on this approach have a limitation:their transformation functions are too simple to imitate complex colour transformations between low-quality images and manually retouched high-quality *** order to address this limitation,a simple yet effective approach for image enhancement is *** proposed algorithm based on the channel-wise intensity transformation is ***,this transformation is applied to the learnt embedding space instead of specific colour spaces and then return enhanced features to *** this end,the authors define the continuous intensity transformation(CIT)to describe the mapping between input and output intensities on the embedding ***,the enhancement network is developed,which produces multi-scale feature maps from input images,derives the set of transformation functions,and performs the CIT to obtain enhanced *** experiments on the MIT-Adobe 5K dataset demonstrate that the authors’approach improves the performance of conventional intensity transforms on colour space ***,the authors achieved a 3.8%improvement in peak signal-to-noise ratio,a 1.8%improvement in structual similarity index measure,and a 27.5%improvement in learned perceptual image patch ***,the authors’algorithm outperforms state-of-the-art alternatives on three image enhancement datasets:MIT-Adobe 5K,Low-Light,and Google HDRþ.
The space of symmetric positive definite (SPD) matrices, denoted as $P_{m}$ , plays a crucial role in various domains, including computer vision, medical imaging, and signalprocessing. Its significance lies in its ca...
详细信息
The space of symmetric positive definite (SPD) matrices, denoted as $P_{m}$ , plays a crucial role in various domains, including computer vision, medical imaging, and signalprocessing. Its significance lies in its capacity to represent the underlying structure in nonlinear data using its Riemannian geometry. Nevertheless, a notable gap exists in the absence of statistical distributions capable of characterizing the statistical properties of data within this space. This paper proposes a new Riemannian Generalized Gaussian distribution (RGGD) on that space. The major contributions of this paper are, first of all, providing the exact expression of the probability density function (PDF) of the RGGD model, as well as an exact expression of the normalizing factor. Furthermore, an estimation of parameters is given using the maximum likelihood of this distribution. The second contribution involves exploiting the second-order statistics of feature maps derived from the first layers of deep convolutional neural networks (DCNNs) through the RGGD stochastic model in an image classification framework. Experiments were carried out on four well-known datasets, and the results demonstrate the efficiency and competitiveness of the proposed model.
暂无评论