As a substitute for recurrent neuralnetworks (RNNs), Deep Learning (DLs) has shown promise in text processing in the last several years. In this study, we looked at the recently announced capsule networks (CapsNets),...
详细信息
ISBN:
(数字)9798350361155
ISBN:
(纸本)9798350361162
As a substitute for recurrent neuralnetworks (RNNs), Deep Learning (DLs) has shown promise in text processing in the last several years. In this study, we looked at the recently announced capsule networks (CapsNets), which are gaining a lot of interest since they outperform Deep Learning (DLs) in image analysis, and we found that they function well for language categorization and sentiment analysis in some instances. Our experimental findings demonstrate that the suggested well-tuned CapsNets model may serve as a suitable (and, in some cases, superior and more cost-effective) alternative to sentence categorization models based on RNNs and DLs. We ran a battery of tests with randomly shuffled test data to see whether CapsNets could learn the sequence of words sequentially. Overall, our CapsNets model outperforms DL and RNN models in terms of classification performance and resilience to adversarial assaults.
Human Action Recognition (HAR) is crucial for monitoring elderly individuals living alone, ensuring timely assistance during distress. This paper presents a HAR framework leveraging imageprocessing and artificial int...
详细信息
ISBN:
(数字)9798331521691
ISBN:
(纸本)9798331521707
Human Action Recognition (HAR) is crucial for monitoring elderly individuals living alone, ensuring timely assistance during distress. This paper presents a HAR framework leveraging imageprocessing and artificial intelligence to enhance performance across key benchmarks, including accuracy, computation speed, memory efficiency, and practical usability. By utilizing skeletal data, recurrent neuralnetworks, and gait classification techniques, the approach achieves improved results on RGB video inputs. The proposed method reduces computation time significantly using Divide and Conquer, Sliding Window, and spatial aspects of Long Short-Term Memory (LSTM) architecture while maintaining low resource requirements for native device compatibility. Testing on the Fall Detection and NTU-RGB datasets demonstrates its effectiveness in handling real-time detection with reduced processing times compared to existing methods.
Fish is a very nutritious dish that is consumed worldwide as a complete meal. This causes a rise in fish production and storage. Freshness of fish that is stored in ice boxes and deep freezers can degrade very quickly...
详细信息
Feature correspondence is an important topic in many computer vision or robot vision tasks. Different from traditional optimization based matching method, in the last two years, researchers are finally able to solve t...
详细信息
ISBN:
(数字)9781665490627
ISBN:
(纸本)9781665490627
Feature correspondence is an important topic in many computer vision or robot vision tasks. Different from traditional optimization based matching method, in the last two years, researchers are finally able to solve the matching process in a learning manner. As a representative method, SuperGlue achieves superior performance in many real-world tasks, but it still has problems in dealing with outlier features. Targeting at the outlier problem, this paper improves SuperGlue by introducing a deep learning based feature correspondence method, which consists of the pruned attentional graph neural network and the improved matching layer for the outlier problem. Experiments on real world images validate the effectiveness of the proposed method.
This paper proposes an image enhancement and target detection model based on AI GC (artificial intelligence generated content). The model innovatively combines GAN and self-attention mechanism, and effectively improve...
详细信息
ISBN:
(数字)9798331533694
ISBN:
(纸本)9798331533700
This paper proposes an image enhancement and target detection model based on AI GC (artificial intelligence generated content). The model innovatively combines GAN and self-attention mechanism, and effectively improves the quality of aerial images through processing steps such as super-resolution, denoising and contrast enhancement. The generator performs image enhancement through a deep convolutional network, and the self-attention mechanism focuses on the key areas in the image to restore more details and textures. Experimental results show that compared with traditional image enhancement methods, the AIGC method shows superior performance in different scenarios. Specifically, after AIGC enhancement processing, the mean average precision (mAP) of target detection is improved by 18.5%, the recall rate is improved by 15.2 % , and the F1-score reaches 90.4 % . Furthermore, the results show that the proposed model is effective in complicated background and low light environment, and the PSNR and structure similarity (SSIM) are improved by 12 dB and 0.18. It is proved that the AIGC technique can greatly improve the performance of the VA V, and it is an effective way to solve the problem of the complicated scene.
image classification has undergone a revolution in recent years due to the high performance of new deep learning models. However, severe security issues may impact the performance of these systems. In particular, adve...
详细信息
ISBN:
(纸本)9783031065279;9783031065262
image classification has undergone a revolution in recent years due to the high performance of new deep learning models. However, severe security issues may impact the performance of these systems. In particular, adversarial attacks are based on modifying input images in a way that is imperceptible for human vision, so that deep learning image classifiers are deceived. This work proposes a new deep neural network model composed of an encoder and a Generative Adversarial Network (GAN). The former encodes a possibly malformed input image into a latent vector, while the latter generates a reconstructed image from the latent vector. Then the reconstructed image can be reliably classified because our model removes the deleterious effects of the attack. The experiments carried out were designed to test the proposed approach against the Fast Gradient Signed Method attack. The obtained results demonstrate the suitability of our approach in terms of an excellent balance between classification accuracy and computational cost.
The proceedings contain 57 papers. The special focus in this conference is on Emerging applications of Information Technology. The topics include: A Method of Genome Sequence Comparison Based on a New Form of Fuzzy Po...
ISBN:
(纸本)9789811951909
The proceedings contain 57 papers. The special focus in this conference is on Emerging applications of Information Technology. The topics include: A Method of Genome Sequence Comparison Based on a New Form of Fuzzy Polynucleotide Space;identification of Humans by Using Machine Learning Models on Gait Features;efficient Heart Disease Prediction Using Modified Hybrid Classifier;an Unstructured Mammogram Analysis for Feasible Classification and Detection of Breast Cancer Using a Convolutional Approach;Emotion Recognition from EEG Data Using Hybrid Deep Learning Approach;colon Cancer Prediction with Transfer Learning and K-Means Clustering;lightweight Authentication Protocol for E-Healthcare Systems Using Fuzzy Commitment Scheme;performance Analysis of Machine Learning Algorithms for Prediction of Cerebral Attack (Stroke);breast Cancer Detection Using Transfer Learning Techniques in Convolutional neuralnetworks;improving Mental Health Through Multimodal Emotion Detection from Speech and Text Data Using Long-Short Term Memory;a Machine Learning-Based Prediction Model for Fetal Health Assessment;A Smart System for Assessment of Mental Health Using Explainable AI Approach;binary Classification of Thyroid Using Comprehensive Set of Machine Learning Algorithms;Interpretability Approaches of Explainable AI in Analyzing Features for Lung Cancer Detection;impregnable Healthcare Ecosystem Using Blockchain and artificial Intelligence Approaches;RIWT Generative Feedback Residual Network for Secure Clinical Data Communication in Healthcare Unit;blockchain-Based Smart Integrated Healthcare System;Designing a Secure Robust Medical image Authentication Based on Watermarking Using the ED-DWT and Encryption;IoT-Based Secure Blockchain Framework for Patient Record Management Using MPRESENT Lightweight Block Cipher;hybridized Support Vector Machine and Adaboost Technique for Malaria Diagnosis;feather-Light Vessel Segregation Model.
Spiking neuralnetworks (SNNs) represent a promising approach to developing artificialneuralnetworks that are both energy-efficient and biologically plausible. However, applying SNNs to sequential tasks, such as tex...
ISBN:
(纸本)9798331314385
Spiking neuralnetworks (SNNs) represent a promising approach to developing artificialneuralnetworks that are both energy-efficient and biologically plausible. However, applying SNNs to sequential tasks, such as text classification and time-series forecasting, has been hindered by the challenge of creating an effective and hardware-friendly spike-form positional encoding (PE) strategy. Drawing inspiration from the central pattern generators (CPGs) in the human brain, which produce rhythmic patterned outputs without requiring rhythmic inputs, we propose a novel PE technique for SNNs, termed CPG-PE. We demonstrate that the commonly used sinusoidal PE is mathematically a specific solution to the membrane potential dynamics of a particular CPG. Moreover, extensive experiments across various domains, including time-series forecasting, natural language processing, and image classification, show that SNNs with CPG-PE outperform their conventional counterparts. Additionally, we perform analysis experiments to elucidate the mechanism through which SNNs encode positional information and to explore the function of CPGs in the human brain. This investigation may offer valuable insights into the fundamental principles of neural computation. Our code is available at https://***/microsoft/SeqSNN.
Facial Emotion Recognition (FER) is a current area of research in computer vision and machine learning. This study looks into how well Convolutional neuralnetworks (CNNs) can identify face expressions of human emotio...
详细信息
Nutrient deficiency has a noteworthy impact on agriculture which results in reduced plant quality in turn reduced crop yield. A plant can have Multiple deficiencies at the same time so there is need of suggesting an a...
Nutrient deficiency has a noteworthy impact on agriculture which results in reduced plant quality in turn reduced crop yield. A plant can have Multiple deficiencies at the same time so there is need of suggesting an appropriate Fertilizer considering all nutrient deficiencies in it. In this paper we suggested suitable fertilizer using random forest for detected nutrient deficiency from leaf images using different neuralnetworks. We trained our dataset using four different Convolution neuralnetworks and selected the network with highest accuracy and then the output of this network is sent to the trained model of Random Forest which suggests the suitable fertilizer. Since Potassium(K), Phosphorous(P) and Nitrogen(N) are three key nutrients required for Rice plant we considered the image dataset having leaves with N, P, K deficiencies.
暂无评论