This paper proposes SroX, a novel approach for accelerating Hidden Weight creation in neuralnetworks, which is a critical component of neural network training that can significantly impact the overall efficiency of t...
详细信息
ISBN:
(数字)9798331519056
ISBN:
(纸本)9798331519063
This paper proposes SroX, a novel approach for accelerating Hidden Weight creation in neuralnetworks, which is a critical component of neural network training that can significantly impact the overall efficiency of the training process. SroX leverages spectral relaxation and X-Iteration to reduce the computational time required for Hidden Weight creation, enabling faster training times without sacrificing accuracy. Our approach addresses the long-standing challenge of efficient Hidden Weight creation, which has been a major bottleneck in neural network training. We demonstrate the efficacy of SroX through extensive experiments on benchmark datasets, including MNIST, CIFAR-10, and imageNet, where SroX outperforms existing methods, including Random Projection and Gradient-Based Optimization, in terms of training time and accuracy. The proposed approach provides a promising solution for large-scale neural network applications, where efficient training is crucial, and has the potential to revolutionize the field of neural network research and applications, driving innovation and breakthroughs in areas such as computer vision, natural language processing, and beyond. By providing a faster and more efficient way to create Hidden Weights, SroX can enable researchers and practitioners to tackle complex problems with unprecedented speed and efficiency, leading to significant advancements in the field of artificial intelligence.
The Multiply and Accumulate (MAC) unit is a fundamental component in Convolutional neural Network (CNN) accelerators, designed to enhance the efficiency and speed of deep learning applications such as image recognitio...
详细信息
ISBN:
(数字)9798350375442
ISBN:
(纸本)9798350375459
The Multiply and Accumulate (MAC) unit is a fundamental component in Convolutional neural Network (CNN) accelerators, designed to enhance the efficiency and speed of deep learning applications such as image recognition, natural language processing, and autonomous driving. These applications demand high computational power and low latency, which are critical for real-time processing and decision-making. Existing solutions often struggle with power consumption, area efficiency, and processing speed, hindering their performance in resource-constrained environments. The proposed CNN-MAC accelerator addresses these challenges by integrating a Modified Booth Multiplier (MBM) and an Error Correctable Carry Look Ahead Adder (EC-CLA). The MBM enhances multiplication efficiency by reducing the number of partial products, while the EC-CLA minimizes propagation delay and corrects errors during addition, ensuring high-speed and reliable computation. This combination significantly improves the overall performance and energy efficiency of CNN accelerators, making them suitable for deployment in various high-demand scenarios.
Automatic segmentation of medical images plays an important role in diagnosing various diseases using these images. image segmentation can be applied separately through many medical samples to make different analyses ...
详细信息
ISBN:
(纸本)9781665436496
Automatic segmentation of medical images plays an important role in diagnosing various diseases using these images. image segmentation can be applied separately through many medical samples to make different analyses and diagnoses. For example, by auto-segmentation cells in a cell culture, the amount, vitality, diameter or shape of the cells in the cell culture can be analysed separately. In addition, by automatic segmentation of blood vessels in a tissue, analyses such as the length and density of the vessels, and the radius of each vessel can be made separately. In this way, inferences such as early diagnosis of a disease and the type of the disease can be made from the examined image. In this study, using two different data sets consisting of various cell images and retinal images, the images in this data set were segmented separately. For automatic segmentation, a convolutional neural network model U-Net is used. The existing ground truth images and the images segmented using the U-Net network were compared. Percentage accuracy values obtained for both data sets, respectively;95.2 for retina data set and 97.25 for cell data set.
In this study, we propose a convolutional neural network-based method for unsupervised hyperspectral anomaly detection. We obtain within- and between-cluster pixel pairs from hyperspectral images using the cluster map...
详细信息
ISBN:
(纸本)9781665436496
In this study, we propose a convolutional neural network-based method for unsupervised hyperspectral anomaly detection. We obtain within- and between-cluster pixel pairs from hyperspectral images using the cluster maps obtained by an automatic clustering algorithm and constitute training data set by taking the differences of the pixel pairs. We design a multilayer convolutional neural network and train it using difference vectors. In prediction step, we apply a dual local window to hyperspectral image. For each pixel, we calculate the difference vectors between the center pixel and the surrounding pixels. By feeding the trained network with the difference vectors, we obtain prediction scores. In the last step, a weighted RX detector was obtained using prediction scores and used for anomaly detection. It has been observed that the experimental results conducted on four different real hyperspectral data show better results than the current CNN-based anomaly detector.
Spiking neural network (SNN) is promising but the development has fallen far behind conventional deep neuralnetworks (DNNs) because of difficult training. To resolve the training problem, we analyze the closed-form i...
详细信息
ISBN:
(纸本)9781577358664
Spiking neural network (SNN) is promising but the development has fallen far behind conventional deep neuralnetworks (DNNs) because of difficult training. To resolve the training problem, we analyze the closed-form input-output response of spiking neurons and use the response expression to build abstract SNN models for training. This avoids calculating membrane potential during training and makes the direct training of SNN as efficient as DNN. We show that the non-leaky integrate-and-fire neuron with single-spike temporalcoding is the best choice for direct-train deep SNNs. We develop an energy-efficient phase-domain signal processing circuit for the neuron and propose a direct-train deep SNN framework. Thanks to easy training, we train deep SNNs under weight quantizations to study their robustness over low-cost neuromorphic hardware. Experiments show that our direct-train deep SNNs have the highest CIFAR-10 classification accuracy among SNNs, achieve imageNet classification accuracy within 1% of the DNN of equivalent architecture, and are robust to weight quantization and noise perturbation.
This research work presents a novel face recognition system with customized face dataset. After completion of the dataset, this dataset will be fed to neural network (or) model and find how much efficient it is in det...
This research work presents a novel face recognition system with customized face dataset. After completion of the dataset, this dataset will be fed to neural network (or) model and find how much efficient it is in detecting the student. There are different types of neuralnetworks like ANN and its variants like CNN and RNN. For example, a person is affected by fever or cold etc. In that case, through biometric application disease the can be spread from one person to the other persons. To combat this situation, imageprocessing authentication technique is proposed. This research study develops a new html script for accessing the camera to capture images. The input image is then fed to neural network. Here, a customized CNN model is developed. After recognition, the embedding or features of the image will be compared with the dataset. The proposed model will act like a black box, where input must be provided and it will process the image data and provide the recognized output.
The Memory-based Reconfigurable Processor (MRP) is a memory-centric reconfigurable device that offers advantages in processing speed and cost compared to other reconfigurable devices like FPGAs, making it a promising ...
详细信息
ISBN:
(数字)9798350379051
ISBN:
(纸本)9798350379068
The Memory-based Reconfigurable Processor (MRP) is a memory-centric reconfigurable device that offers advantages in processing speed and cost compared to other reconfigurable devices like FPGAs, making it a promising alternative device for artificial Intelligence applications such as neuralnetworks (NNs). However, due to the unique structure of MRP, the method for implementing NNs on it has not yet been established. In our previous work [8], we have proposed a sparse neural network model named MNN (MRP neuralnetworks), specifically designed to align with the unique connection structure of MRP for neural network integration. This paper extends our research by proposing a binarization method for MNN referred to as the BMNN using XNOR-Net to facilitate practical NNs application (handwriting digits recognition) on MRP. To enhance the performance of BMNN, a novel implementation strategy Deep-BMNN is also proposed aimed at increasing both the layer depth and the neuron count of BMNN by folding the logic routing on the MRP device. Experimental results using MNIST dataset for handwritten digits recognition demonstrate that our proposed binary NNs for MRP can achieve comparable accuracy to those of fully connected NNs employing real numbers.
The growing need for efficient neural network inference in embedded systems has spurred the development of specialized hardware accelerators. This paper introduces the design and implementation of an FPGA-based neural...
详细信息
ISBN:
(数字)9798350377255
ISBN:
(纸本)9798350377262
The growing need for efficient neural network inference in embedded systems has spurred the development of specialized hardware accelerators. This paper introduces the design and implementation of an FPGA-based neuralprocessing Unit (NPU), optimized for high-performance neural network tasks. The NPU architecture integrates advanced features, including SIMD (Single Instruction, Multiple Data) support, an out-of-order execution system, and a six-stage pipeline, ensuring efficient processing. The NPU's RAM system manages both instruction and data storage, while 16-bit floating-point (FP16) computations balance precision with resource efficiency. On the software side, the NPU utilizes a custom instruction set, supported by a C language framework and a Python-based compiler. Detailed analyses of key modules, such as scalar and matrix operations, neural network inference, and RAM storage, are provided. Simulation tests, including handwritten digit recognition using the MNIST dataset, validate the NPU's functionality and performance, demonstrating its robust computational capabilities and efficiency. This study offers a novel and practical approach to achieving efficient neural network inference on FPGA, underscoring the NPU's superior performance in handling complex tasks.
Text matching is one of the research hotspots in Natural Language processing (NLP). The study of text matching is of great practical importance for applications such as text de-duplication, web retrieval, and question...
详细信息
Agriculture is vital to human civilization, providing food and economic benefits. However, crops and plant leaves are vulnerable to various illnesses that hinder growth. Early and accurate identification of pathogens ...
详细信息
ISBN:
(数字)9798331533311
ISBN:
(纸本)9798331533328
Agriculture is vital to human civilization, providing food and economic benefits. However, crops and plant leaves are vulnerable to various illnesses that hinder growth. Early and accurate identification of pathogens can prevent further damage, but current manual methods by farmers could be more efficient and precise, leading to potential crop destruction and reduced yields. Utilizing computerized imageprocessing techniques can minimize losses and boost productivity. images of affected leaves or crops can be used to identify and classify plant diseases using a variety of techniques. Significant advancements have been made, particularly with deep learning (DL) and machine learning (ML) techniques, including convolutional neuralnetworks (CNN), which are favored for their ability to process image information and understand spatial hierarchies. Selecting between traditional machine learning (ML) and deep learning (DL) approaches depends on the nature of the problem, the availability of data, and computational resources. For intricate tasks like image detection and classification, DL—particularly convolutional neuralnetworks (CNNs)—is preferred when sufficient data and processing capabilities are accessible.
暂无评论