The skin acts as an important barrier between the body and the external environment, playing a vital role as an organ. The application of deep learning in the medical field to solve various health problems has generat...
详细信息
In this paper, the computation of graph Fourier transform centrality (GFTC) of complex network using graph filter is presented. For conventional computation method, it needs to use the non-sparse transform matrix of g...
详细信息
Existing deep learning-based point cloud denoising methods are generally trained in a supervised manner that requires clean data as ground-truth ***,in practice,it is not always feasible to obtain clean point *** this...
详细信息
Existing deep learning-based point cloud denoising methods are generally trained in a supervised manner that requires clean data as ground-truth ***,in practice,it is not always feasible to obtain clean point *** this paper,we introduce a novel unsupervised point cloud denoising method that eliminates the need to use clean point clouds as groundtruth labels during *** demonstrate that it is feasible for neural networks to only take noisy point clouds as input,and learn to approximate and restore their clean *** particular,we generate two noise levels for the original point clouds,requiring the second noise level to be twice the amount of the first noise *** this,we can deduce the relationship between the displacement information that recovers the clean surfaces across the two levels of noise,and thus learn the displacement of each noisy point in order to recover the corresponding clean *** experiments demonstrate that our method achieves outstanding denoising results across various datasets with synthetic and real-world noise,obtaining better performance than previous unsupervised methods and competitive performance to current supervised methods.
With the increasing popularity of smart portable electronic gadgets, voice-based online person verification systems have become prevalent. However, these systems are susceptible to attacks where illegitimate individua...
详细信息
With the increasing popularity of smart portable electronic gadgets, voice-based online person verification systems have become prevalent. However, these systems are susceptible to attacks where illegitimate individuals exploit the recorded voices of legitimate users, leading to false confirmations—spoofing attacks. To overcome this limitation, this article presents an innovative solution by combining speech and online handwritten signatures to mitigate the risks associated with spoofing attacks in voice-based authentication systems because a person has to be present in front of the system to produce an online handwritten signature. To accomplish this objective, this work proposes a novel bidirectional Legendre memory unit (BLMU), a type of recurrent neural network (RNN), for person authentication (verification) and recognition. The Legendre memory unit (LMU) is an innovative memory cell for RNNs that efficiently retains temporal/non-temporal sequential information over a long period with minimal resources. It achieves information orthogonalization by solving coupled ordinary differential equations (ODEs) and leveraging Legendre polynomials, ensuring effective data representation. The proposed framework for person authentication and recognition comprises seven convolution layers, four BLMU layers, two dense layers, and one output layer. The performance of the proposed BLMU-based deep learning framework has been evaluated on a self-generated/private dataset of combined feature matrix of voice signals and online handwritten signatures in the Devanagari script. To assess performance, experiments have also been conducted using various RNN architectures, such as LSTM, BLSTM, and ordinary differential equation recurrent neural network (ODE-RNN), to have a performance comparison with the proposed BLMU-based deep learning (DL) framework. The results demonstrate the superiority of the proposed BLMU-based DL framework in enhancing the accuracy of person verification systems,
Emotion detection from social media data plays a crucial role in studying societal emotions concerning different events, aiding in predicting the reactions of specific social groups. However, it is complex to automati...
详细信息
Voice, motion, and mimicry are naturalistic control modalities that have replaced text or display-driven control in human-computer communication (HCC). Specifically, the vocals contain a lot of knowledge, revealing de...
详细信息
Voice, motion, and mimicry are naturalistic control modalities that have replaced text or display-driven control in human-computer communication (HCC). Specifically, the vocals contain a lot of knowledge, revealing details about the speaker’s goals and desires, as well as their internal condition. Certain vocal characteristics reveal the speaker’s mood, intention, and motivation, while word study assists the speaker’s demand to be understood. Voice emotion recognition has become an essential component of modern HCC networks. Integrating findings from the various disciplines involved in identifying vocal emotions is also challenging. Many sound analysis techniques were developed in the past. Learning about the development of artificial intelligence (AI), and especially Deep Learning (DL) technology, research incorporating real data is becoming increasingly common these days. Thus, this research presents a novel selfish herd optimization-tuned long/short-term memory (SHO-LSTM) strategy to identify vocal emotions in human communication. The RAVDESS public dataset is used to train the suggested SHO-LSTM technique. Mel-frequency cepstral coefficient (MFCC) and wiener filter (WF) techniques are used, respectively, to remove noise and extract features from the data. LSTM and SHO are applied to the extracted data to optimize the LSTM network’s parameters for effective emotion recognition. Python Software was used to execute our proposed framework. In the finding assessment phase, Numerous metrics are used to evaluate the proposed model’s detection capability, Such as F1-score (95%), precision (95%), recall (96%), and accuracy (97%). The suggested approach is tested on a Python platform, and the SHO-LSTM’s outcomes are contrasted with those of other previously conducted research. Based on comparative assessments, our suggested approach outperforms the current approaches in vocal emotion recognition.
Background: The synthesis of reversible logic has gained prominence as a crucial research area, particularly in the context of post-CMOS computing devices, notably quantum computing. Objective: To implement the bitoni...
详细信息
This work proposes a novel and improved Butterfly Optimization Algorithm (BOA), known as LQBOA, to solve BOA’s inherent limitations. The LQBOA uses Lagrange interpolation and simple quadratic interpolation techniques...
详细信息
Plant diseases present a considerable threat to the farming industry, causing significant economic losses by reducing crop yields. The emergence of deep neural network models in the realm of computer vision has brough...
详细信息
Alzheimer’s Disease(AD)is a progressive neurological *** diagnosis of this illness using conventional methods is very *** Learning(DL)is one of the finest solutions for improving diagnostic procedures’performance an...
详细信息
Alzheimer’s Disease(AD)is a progressive neurological *** diagnosis of this illness using conventional methods is very *** Learning(DL)is one of the finest solutions for improving diagnostic procedures’performance and forecast *** disease’s widespread distribution and elevated mortality rate demonstrate its significance in the older-onset and younger-onset age *** light of research investigations,it is vital to consider age as one of the key criteria when choosing the *** younger subjects are more susceptible to the perishable side than the older *** proposed investigation concentrated on the younger *** research used deep learning models and neuroimages to diagnose and categorize the disease at its early stages *** proposed work is executed in three *** 3D input images must first undergo image pre-processing using Weiner filtering and Contrast Limited Adaptive Histogram Equalization(CLAHE)*** Transfer Learning(TL)models extract features,which are subsequently compressed using cascaded Auto Encoders(AE).The final phase entails using a Deep Neural Network(DNN)to classify the phases of *** model was trained and tested to classify the five stages of *** ensemble ResNet-18 and sparse autoencoder with DNN model achieved an accuracy of 98.54%.The method is compared to state-of-the-art approaches to validate its efficacy and performance.
暂无评论