Spiking neuralnetworks (SNNs) exhibit superior energy efficiency and fault tolerance compared to artificialneuralnetworks (ANNs). Due to their spike-based nature, input signals must be encoded into spike trains. Ho...
详细信息
ISBN:
(数字)9798331530723
ISBN:
(纸本)9798331530730
Spiking neuralnetworks (SNNs) exhibit superior energy efficiency and fault tolerance compared to artificialneuralnetworks (ANNs). Due to their spike-based nature, input signals must be encoded into spike trains. However, traditional spiking coding schemes often encounter a trade-off between energy efficiency and fault tolerance. To overcome this limitation, we propose a novel coding scheme, Spiking Sparse Coding (SSC), which achieves both high sparsity and enhanced fault tolerance. SSC not only improves network performance but also reduces the number of spikes by 93.69%, resulting in a 76.47% increase in energy efficiency. Additionally, SSC enhances network fault tolerance, with accuracy improvements of up to 12.06% under identical noise interference conditions. We deployed the network on the ZCU102 FPGA platform, operating at a frequency of 200 MHz with a power consumption of 409 mW. On the MNIST dataset, we achieved a classification accuracy of 95.84%. The training speed and energy consumption were 12656 frames per second and 0.032 mJ per image, the inference speed and energy consumption were 19279 frames per second and 0.021 mJ per image.
Facial analysis has emerged as a prominent area of research with diverse applications, including cosmetic surgery programs, the beauty industry, photography, and entertainment. Manipulating patient images often necess...
Facial analysis has emerged as a prominent area of research with diverse applications, including cosmetic surgery programs, the beauty industry, photography, and entertainment. Manipulating patient images often necessitates professional imageprocessing software. This study contributes by providing a model that facilitates the detection of blemishes and skin lesions on facial images through a convolutional neural network and machine learning approach. The proposed method offers advantages such as simple architecture, speed and suitability for imageprocessing while avoiding the complexities associated with traditional methods. The model comprises four main steps: area selection, scanning the chosen region, lesion diagnosis, and marking the identified lesion. Raw data for this research were collected from a reputable clinic in Tehran specializing in skincare and beauty services. The dataset includes administrative information, clinical data, and facial and profile images. A total of 2300 patient images were extracted from this raw data. A software tool was developed to crop and label lesions, with input from two treatment experts. In the lesion preparation phase, the selected area was standardized to $50\times 50$ pixels. Subsequently, a convolutional neural network model was employed for lesion labeling. The classification model demonstrated high accuracy, with a measure of 0.98 for healthy skin and 0.97 for lesioned skin specificity. Internal validation involved performance indicators and cross-validation, while external validation compared the model’s performance indicators with those of the transfer learning method using the Vgg16 deep network model. Compared to existing studies, the results of this research showcase the efficacy and desirability of the proposed model and methodology.
The proceedings contain 37 papers. The special focus in this conference is on artificial Intelligence and its applications. The topics include: The Hybrid Cardiac Risk Assessment and Prediction Model Using Convolution...
ISBN:
(纸本)9783031843969
The proceedings contain 37 papers. The special focus in this conference is on artificial Intelligence and its applications. The topics include: The Hybrid Cardiac Risk Assessment and Prediction Model Using Convolutional neuralnetworks;deep Learning applications for Malaria Detection and Diagnosis: A Review;Environmental Considerations in the Ethics of AI Adoption in Healthcare: Striving for Sustainable and Responsible Practices;harnessing the Power of Cognitive Computing: Assessing Point-of-Care Decision Support Tools in Oncology Practice;Critical Analysis in Use of AI in Health Care Management;classification and Prediction of Spinal Tuberculosis Disease Using Optimization of Convolution neural Network Using Spatial and Temporal Constraints;artificial Intelligence for Remote Healthcare in Underserved Areas: Enhancing Access and Quality of Healthcare Delivery;machine Learning Based Skin Cancer Detection and Recognitions Techniques in IoT Environment;validation of a Chronic Kidney Disease Prediction System Using Machine Learning Techniques;real-Time Feedback Detection Using Emotion Detection and Facial Recognition;revolutionizing Vision Tasks: Unlocking Potential Through Patch-Based Approaches;automated Knee Implant Identification from 2D Templates Using imageprocessing and artificial Intelligence – An Experimental Approach;crop Analysis and Classification Based on Phenotype Using Ensemble Learning;Classification and Identification with Health Benefit Assessment and Nutrient Profile of Brewed Tea Utilizing Computer Vision with ML and DL and Sensory Approaches;enhancing Industrial Automation Flexibility Through neural Network-Empowered Machine Vision applications.
To quickly evaluate soybean quality, we proposed a deep learning-based method for online classification of soybean seeds. Firstly, images of soybean seeds with uneven illumination were segmented based on the multiscal...
详细信息
To quickly evaluate soybean quality, we proposed a deep learning-based method for online classification of soybean seeds. Firstly, images of soybean seeds with uneven illumination were segmented based on the multiscale Retinex with color restoration (MSRCR). Then, a convolutional neural network (CNN) was constructed to achieve soybean seed four-classification with appropriate parameters. The F-score of the normal, damaged, abnormal, and non-classifiable soybeans reached about 95.97%, 97.41%, 97.25%, and 96.14%, respectively. Finally, the method was successfully applied in NVIDIA Jetson TX2 with an accuracy of 95.63% and an average classification time of 4.92 ms for a soybean seed, which can meet the requirement of online soybean quality assessment.
This paper focuses on the hardware acceleration of an AI-ISP (image signal processing) neural network, namely CSANet, which computes images from raw data to RGB format. To design a flexible, high-performance, and low-...
详细信息
ISBN:
(数字)9798350360349
ISBN:
(纸本)9798350360356
This paper focuses on the hardware acceleration of an AI-ISP (image signal processing) neural network, namely CSANet, which computes images from raw data to RGB format. To design a flexible, high-performance, and low-power AI-ISP accelerator on FPGA, we apply the RISC-VISA extension with the CFU-Playground framework specified for tiny-ML. The proposed RISC-V custom instructions support normal convolution, transposed convolution, depth-wise convolution, and partial element-wise operations, which can accelerate the computation hotspots of the CSANet. Applying with the FPGA implementation, on the Digilent Nexys Video development board (Artix-7 XC7A200T), the proposed CSANet accelerator achieves a
$79.7\times$
speedup and demonstrates a
$27.8\times$
improvement in energy efficiency on average compared to RISC-V CPU-only.
Convolutional neuralnetworks (CNNs) have received widespread attention due to their powerful modeling capabilities and have been successfully applied in natural language processing, image recognition, and other field...
详细信息
Convolutional neuralnetworks (CNNs) have received widespread attention due to their powerful modeling capabilities and have been successfully applied in natural language processing, image recognition, and other fields. On the other hand, traditional CNN can only deal with Euclidean spatial data. In contrast, many real-life scenarios, such as transportation networks, social networks, reference networks, and so on, exist in graph data. The creation of graph convolution operators and graph pooling is at the heart of migrating CNN to graph data analysis and processing. With the advancement of the Internet and technology, graph convolution network (GCN), as an innovative technology in artificial intelligence (AI), has received more and more attention. GCN has been widely used in different fields such as imageprocessing, intelligent recommender system, knowledge-based graph, and other areas due to their excellent characteristics in processing non-European spatial data. At the same time, communication networks have also embraced AI technology in recent years, and AI serves as the brain of the future network and realizes the comprehensive intelligence of the future grid. Many complex communication network problems can be abstracted as graph-based optimization problems and solved by GCN, thus overcoming the limitations of traditional methods. This survey briefly describes the definition of graph-based machine learning, introduces different types of graph networks, summarizes the application of GCN in various research fields, analyzes the research status, and gives the future research direction.
artificial intelligence (AI) research for medical applications has expanded quickly. Advancements in computer processing now allow for the development of complex neural network architectures (eg, convolutional neural ...
详细信息
Recently, brain-inspired spiking neuralnetworks (SNNs) have demonstrated tremendous improvement in energy efficiency (EE) and low power by exploiting highly sparse spikes and event-driven design [1-2]. (top of Fig. 3...
The proceedings contain 61 papers. The special focus in this conference is on Soft Computing Models in Industrial and Environmental applications. The topics include: Missing Values Imputation for Visualizing the&...
ISBN:
(纸本)9783031425288
The proceedings contain 61 papers. The special focus in this conference is on Soft Computing Models in Industrial and Environmental applications. The topics include: Missing Values Imputation for Visualizing the Air Quality Evolution During the COVID-19 Pandemic in Madrid;Comparative Study of Wastewater Treatment Plant Feature Selection for COD Prediction;effectiveness of Quantum Computing in imageprocessing for Burr Detection;hyperspectral Technology for Oil Spills Detection by Using artificialneural Network Classifier;Model-Based Design of the IMO-NMPC Strategy: Real-Time Implementation;first Approach of an Intelligent Automatic System for Aircraft Flap/Slat Positioning;multi-scale neural Model for Tool-Narayanaswamy-Moynihan Model Parameter Extraction;comparative Study of Open Source Database Management Systems to Enable Predictive Maintenance of Autonomous Guided Vehicles;deep Learning and Metaheuristic for Multivariate Time-Series Forecasting;forecasting Greenhouse Temperature Using Machine Learning Models: Optimizing Crop Production in Andalucia;extended Rank-Based Ant Colony Optimization Algorithm for Traveling Salesman Problem;Managing Pandemics Through Agent-Based Simulation: A Case Study Based on COVID-19;leveraging Smart Meter Data for Adaptive Consumer Profiling;Categorization of CoAP DoS Attack Based on One-Class Boundary Methods;text Classification for Automatic Distribution of Review Notes in Movie Production;fault Detection in Biological Methanation Process Using Machine Learning: A Comparative Study of Different Algorithms;machine Learning Approaches for Predicting Tree Growth Trends Based on Basal Area Increment;integrated Forecast and Optimization for Retailer Allocation in a Two-Echelon Inventory System;application of Fuzzy Logic to the Risk Assessment of Production Machines Failures;Neuron Characterization in Complex Cultures Using a Combined YOLO and U-Net Segmentation Approach;neuroevolutionary Transfer Learning for Time Series Forecasting;f
This This study proposes a new recommendation system model that integrates multimodal data, large language models (LLM), and neural matrix factorization techniques. By using Vision Transformer (ViT) and the BERT model...
详细信息
ISBN:
(数字)9798350386943
ISBN:
(纸本)9798350386950
This This study proposes a new recommendation system model that integrates multimodal data, large language models (LLM), and neural matrix factorization techniques. By using Vision Transformer (ViT) and the BERT model, we deeply mine the image and text information of users and products, and combine unstructured data with neural network technology for evaluation prediction and recommendation, showing excellent prediction performance. This study also verified the performance advantages of this method compared with traditional recommendation system methods through various ablation experiments. The results highlight the importance of text data in capturing user preferences and improving recommendation accuracy. In addition, the study found that in practical applications, more detailed segmentation processing of visual data is needed to accurately evaluate the specific contribution of visual data to multi-modal recommendation systems. These findings provide important theoretical and practical guidance for further optimizing recommendation systems.
暂无评论