Feature extractionplays an important role before pattern recognition takes place. The existing artificialneuralnetworks (ANNs), however, ignoreto learn and represent temporal information, instead of only utilizing s...
详细信息
Feature extractionplays an important role before pattern recognition takes place. The existing artificialneuralnetworks (ANNs), however, ignoreto learn and represent temporal information, instead of only utilizing spatial information for recognition. Moreover, the substantial computational and energy costs resulted from the conventional ANN-based classifiers, limit their uses in mobile and embedded applications. In this work, we develop a sparse temporal encoding method which exploits both spatial and temporal information. On the basis of spike-timing-dependent plasticity and multi-scale structure, the resulting temporal feature representation integrates with a temporal spiking neural network (SNN) classifier to achieve high efficiency of parallel computing for feature extraction. Experimental evaluation on four benchmark datasets from image classification and speech recognition tasks show the proposed SNN model yielding state-of-the-art accuracy.
In recent years, video streaming services have become increasingly popular. In general, the search function in a video sharing service site evaluates the relevance of a search query to the title, tags, description, an...
详细信息
ISBN:
(纸本)9781665462198
In recent years, video streaming services have become increasingly popular. In general, the search function in a video sharing service site evaluates the relevance of a search query to the title, tags, description, and so on given by the creator of the video. Then, the search results with the highest relevance are displayed. Therefore, if a title is given to a video that does not match its content, there is a possibility that a video with low relevance will be found. In this research, ( 1) we built a new system that retrieves animal videos that are relevant to its content using image recognition. (2) By describing the relationships between the concepts of animal families and species and incorporating them into the retrieval system, it is possible to retrieve animal videos by their family names. Adding retrieval by animal family name enabled us to find species that have not been learned. In this research, (3) we confirmed the usefulness of our video retrieval system using trained neuralnetworks, GoogLeNet and ResNet50, as animal species classifiers.
In smart city applications such as medical image analysis, autonomous driving, and security monitoring, image recognition faces challenges like complex backgrounds, low-quality images, and diverse targets, affecting a...
详细信息
ISBN:
(数字)9798331529246
ISBN:
(纸本)9798331529253
In smart city applications such as medical image analysis, autonomous driving, and security monitoring, image recognition faces challenges like complex backgrounds, low-quality images, and diverse targets, affecting accuracy and robustness. This study explores machine learning algorithms to enhance image recognition performance by addressing noise interference, intricate backgrounds, and feature extraction difficulties. It combines Convolutional neuralnetworks (CNN) with transfer learning, starting with data preprocessing to reduce noise, using pre-trained CNN models to extract high-level features, and fine-tuning with a ResNet transfer learning strategy for specific tasks. Additionally, ensemble learning methods are employed to further improve model robustness and accuracy. Experimental results show that the ensemble model maintains around 85% accuracy even with high background complexity, and transfer learning achieves 90% accuracy when the sample size reaches 1000. These findings demonstrate that transfer and ensemble learning effectively enhance image recognition accuracy and resilience in complex environments.
Melanoma is one of the most severe types of skin cancer, and early diagnosis is crucial to improving the chances of successful treatment. Deep learning models, in particular, have proven to be a highly promising appli...
详细信息
ISBN:
(数字)9798350369106
ISBN:
(纸本)9798350369113
Melanoma is one of the most severe types of skin cancer, and early diagnosis is crucial to improving the chances of successful treatment. Deep learning models, in particular, have proven to be a highly promising application of artificial intelligence in helping dermatologists diagnose melanoma early. By using these models, dermatological images can be analyzed with greater precision, making it easier to identify suspicious lesions and differentiate between benign and malignant ones. This study shows that more accurate segmentation and classification of skin lesions can be achieved by combining models like U-Net with preprocessing methods such as Autoencoder. This can lead to better melanoma detection and treatment. Additionally, we employed a hybrid CNN-quantum neural network model for classification, which achieved an accuracy of 99.67%, a precision of 99.35%, and a recall of 99.67%.
artificial intelligence has involved as main part to construct smart cities and applications for autonomous system. Traffic Sign recognition is one of the important applications that can be used by autonomous cars. Tr...
详细信息
ISBN:
(数字)9798350353624
ISBN:
(纸本)9798350353631
artificial intelligence has involved as main part to construct smart cities and applications for autonomous system. Traffic Sign recognition is one of the important applications that can be used by autonomous cars. Traffic Sign recognition provides the valuable information about road status to drive safety. Although traffic sign cannot be recognized accurately under low-illumination conditions and time of the day. This system proposes enhancement method based on the brightness value of the image to enhance the low-illumination images. The HSV thresholding method is used to identify traffic sing location from the input image and convolution neural network is proposed to recognized the traffic signs. The system produces good results under night time, cloudy, drizzle rain weather. Myanmar Traffic Sign of red, yellow and blue color traffic signs are used in this system.
Recent convolutional neural network (CNN) development continues to advance the state-of-the-art model accuracy for various applications. However, the enhanced accuracy comes at the cost of substantial memory bandwidth...
详细信息
ISBN:
(数字)9781665409964
ISBN:
(纸本)9781665409964
Recent convolutional neural network (CNN) development continues to advance the state-of-the-art model accuracy for various applications. However, the enhanced accuracy comes at the cost of substantial memory bandwidth and storage requirements and demanding computational resources. Although in the past the quantization methods have effectively reduced the deployment cost for edge devices, it suffers from significant information loss when processing the biased activations of contemporary CNNs. In this paper, we hence introduce an adaptive high-performance quantization method to resolve the issue of biased activation by dynamically adjusting the scaling and shifting factors based on the task loss. Our proposed method has been extensively evaluated on image classification models (ResNet-18/34/50, MobileNet-V2, EfficientNet-B0) with imageNet dataset and object detection model (YOLO-V4) with COCO dataset. The results show that our 4-bit integer (INT4) quantization models achieve better accuracy than the state-of-the-art 4-bit models, and in some cases, even surpass the golden full-precision models. The final designs have been successfully deployed onto extremely resource-constrained edge devices for many practical applications.
Global food security and agricultural sustainability are seriously threatened by the paddy leaf diseases that is rapidly spread. Four of the most common widespread disease that affect rice leaf are Hispa, brown spot, ...
详细信息
The proceedings contain 21 papers. The topics discussed include: exploring the generation of online content by users and its predictive influence on the relational quality. application to the Andalusian hotel sector;L...
The proceedings contain 21 papers. The topics discussed include: exploring the generation of online content by users and its predictive influence on the relational quality. application to the Andalusian hotel sector;LIVING-LANG: living digital entities by human language technologies;ESAN: automating medical scribing in Spanish;big hug: artificial intelligence for the protection of digital societies;an exploration of the semantic knowledge in vector models: polysemy, synonymy and idiomaticity;ALIADA: artificial intelligence-based language applications for the detection of aggressiveness in social networks;exploring gender bias in Spanish deep learning models;and a neural machine translation system for Galician from transliterated Portuguese text.
Spiking neuralnetworks (SNN) hold the promise of being a more biologically plausible, low-energy alternative to conventional artificialneuralnetworks. Their time-variant nature makes them particularly suitable for ...
详细信息
ISBN:
(数字)9798350390582
ISBN:
(纸本)9798350390599
Spiking neuralnetworks (SNN) hold the promise of being a more biologically plausible, low-energy alternative to conventional artificialneuralnetworks. Their time-variant nature makes them particularly suitable for processing time-resolved, sparse binary data. In this paper, we investigate the potential of leveraging SNNs for the detection of photon coincidences in positron emission tomography (PET) data. PET is a medical imaging technique based on injecting a patient with a radioactive tracer and detecting the emitted photons. One central post-processing task for inferring an image of the tracer distribution is the filtering of invalid hits occurring due to e.g. absorption or scattering processes. Our approach, coined PETNet, interprets the detector hits as a binary-valued spike train and learns to identify photon coincidence pairs in a supervised manner. We introduce a dedicated multi-objective loss function and demonstrate the effects of explicitly modeling the detector geometry on simulation data for two use-cases. Our results show that PETNet can outperform the state-of-the-art classical algorithm with a maximal coincidence detection F
1
of 95.2%. At the same time, PETNet is able to predict photon coincidences up to 36 times faster than the classical approach, highlighting the great potential of SNNs in particle physics applications.
Based on a research in 2002 (Ozkaynak & Ova, 2006), acrylamide substance is formed when excessive heat treatment (e.g. frying, grilling, baking) is applied to starch-containing products. This substance contains ca...
详细信息
ISBN:
(纸本)9789897585838
Based on a research in 2002 (Ozkaynak & Ova, 2006), acrylamide substance is formed when excessive heat treatment (e.g. frying, grilling, baking) is applied to starch-containing products. This substance contains carcinogenic and neurotoxicological risks for human health. The acrylamide levels are controlled by random laboratory sampling. This control processes which are executed by humans, cause a prolonged and error prone process. In this study, we offer a Convolutional neural Network (CNN) model, which provides acceptable precision and recall rates for detecting acrylamide in biscuit manufacturing process.
暂无评论