The convolutional neural network is a subfield of artificialneuralnetworks and has made great achievements in various domains over the past decade. The technique has been widely applied including computer vision, na...
详细信息
Transferable adversarial examples highlight the vulnerability of deep neuralnetworks (DNNs) to imperceptible perturbations across various real-world applications. While there have been notable advancements in untarge...
详细信息
The strength of long short-term memory neuralnetworks (LSTMs) that have been applied is more located in handling sequences of variable length than in handling geometric variability of the image patterns. In this pape...
详细信息
The strength of long short-term memory neuralnetworks (LSTMs) that have been applied is more located in handling sequences of variable length than in handling geometric variability of the image patterns. In this paper, an end-to-end convolutional LSTM neural network is used to handle both geometric variation and sequence variability. The best results for LSTMs are often based on large-scale training of an ensemble of network instances. We show that high performances can be reached on a common benchmark set by using proper data augmentation for just five such networks using a proper coding scheme and a proper voting scheme. The networks have similar architectures (convolutional neural network (CNN): five layers, bidirectional LSTM (BiLSTM): three layers followed by a connectionist temporal classification (CTC) processing step). The approach assumes differently scaled input images and different feature map sizes. Three datasets are used: the standard benchmark RIMES dataset (French);a historical handwritten dataset KdK (Dutch);the standard benchmark George Washington (GW) dataset (English). Final performance obtained for the word-recognition test of RIMES was 96.6%, a clear improvement over other state-of-the-art approaches which did not use a pre-trained network. On the KdK and GW datasets, our approach also shows good results. The proposed approach is deployed in the Monk search engine for historical-handwriting collections.
With the rise of big data and artificial intelligence(AI),Convolutional neuralnetworks(CNNs) have become instrumental in numerous applications,from image and speech recognition to natural language ***,these networks&...
详细信息
With the rise of big data and artificial intelligence(AI),Convolutional neuralnetworks(CNNs) have become instrumental in numerous applications,from image and speech recognition to natural language ***,these networks' computational demands often exceed the capabilities of traditional processing units,leading to a search for more effective computing *** research aims to evaluate the potential of Field-Programmable Gate Array(FPGA) technology in accelerating CNN computations,considering FPGA's unique attributes such as reprogrammability,energy efficiency,and custom logic *** primary aim of this research is to compare the efficiency and performance of FPGA acceleration of CNNs with conventional processing units like CPUs and GPUs and to explore its potential for future AI *** research employs a mixed-methods approach,including an integrated literature review and comparative *** paper reviews state-of-the-art research on FPGA-accelerated CNNs,benchmark performance metrics of FPGA,CPU,and GPU platforms across various CNN models,and compare FPGA-based AI applications with other real-world AI *** findings suggest a significant potential for FPGA-accelerated CNNs,particularly in scenarios requiring real-time computation or power-limited ***,challenges persist in the areas of development complexity and limited on-chip *** work must focus on surmounting these barriers to unlock the full potential of FPGAaccelerated CNNs.
In this article, we investigate the performance bottleneck of existing deep learning (DL) systems and propose DLBooster to improve the running efficiency of deploying DL applications on GPU clusters. At its core, DLBo...
详细信息
In this article, we investigate the performance bottleneck of existing deep learning (DL) systems and propose DLBooster to improve the running efficiency of deploying DL applications on GPU clusters. At its core, DLBooster leverages two-level optimizations to boost the end-to-end DL workflow. On the one hand, DLBooster selectively offloads some key decoding workloads to FPGAs to provide high-performance online data preprocessing services to the computing engine. On the other hand, DLBooster reorganizes the computational workloads of training neuralnetworks with the backpropagation algorithm and schedules them according to their dependencies to improve the utilization of GPUs at runtime. Based on our experiments, we demonstrate that compared with baselines, DLBooster can improve the imageprocessing throughput by 1.4x - 2.5x and reduce the processing latency by 1/3 in several real-world DL applications and datasets. Moreover, DLBooster consumes less than 1 CPU core to manage FPGA devices at runtime, which is at least 90 percent less than the baselines in some cases. DLBooster shows its potential to accelerate DL workflows in the cloud.
Deep neuralnetworks have made significant advancements in accurately estimating scene flow using point clouds, which is vital for many applications like video analysis, action recognition, and navigation. The robustn...
详细信息
ISBN:
(数字)9798350372250
ISBN:
(纸本)9798350372267
Deep neuralnetworks have made significant advancements in accurately estimating scene flow using point clouds, which is vital for many applications like video analysis, action recognition, and navigation. The robustness of these techniques, however, remains a concern, particularly in the face of adversarial attacks that have been proven to deceive state-of-the-art deep neuralnetworks in many domains. Surprisingly, the robustness of scene flow networks against such attacks has not been thoroughly investigated. To address this problem, the proposed approach aims to bridge this gap by introducing adversarial white-box attacks specifically tailored for scene flow networks. Experimental results show that the generated adversarial examples obtain up to 33.7 relative degradation in average end-point error on the KITTI and FlyingThings3D datasets. The study also reveals the significant impact that attacks targeting point clouds in only one dimension or color channel have on average end-point error. Analyzing the success and failure of these attacks on the scene flow networks and their 2D optical flow network variants shows a higher vulnerability for the optical flow networks.
Imaging sensors with inbuilt processing capability are expected to form the backbone of low-latency and highly energy efficient artificial vision systems. A range of emerging atomically thin materials provide opportun...
详细信息
Imaging sensors with inbuilt processing capability are expected to form the backbone of low-latency and highly energy efficient artificial vision systems. A range of emerging atomically thin materials provide opportunities to exploit their electrical and optical properties for human vision and brain inspired functions. This work reports atomically thin nanosheets of beta-In2S3 which exhibit inherent persistent photoconductivity (PPC) under ultraviolet and visible wavelengths. This PPC effect enables beta-In2S3-based optoelectronic devices to optically mimic the dynamics of biological synapses. Based on the material characterizations, the PPC effect is attributed to the intrinsic defects in the synthesized beta-In2S3 nanosheet. Furthermore, the feasibility of adopting these atomically thin synaptic devices for optoelectronic neuromorphic hardware is demonstrated by implementing a convolutional neural network for image classification. As such, the demonstrated atomically thin nanosheets and optoelectronic synaptic devices provide a platform for scaling up complex vision-sensory neuralnetworks, which can find many promising applications for multispectral imaging and neuromorphic computation.
This paper exploits the origins of artificialneuralnetworks (ANN s) based on approximation theory, to refocus efforts on data transformation, and away from the exclusive attention on the development of more and more...
详细信息
ISBN:
(数字)9798350369250
ISBN:
(纸本)9798350369267
This paper exploits the origins of artificialneuralnetworks (ANN s) based on approximation theory, to refocus efforts on data transformation, and away from the exclusive attention on the development of more and more complex architectures. Machine learning (ML) networks efficiently accept raw input data and determine their own features for the training process. Features are chosen to optimize accuracy, but typically are not designed to be robust, leaving the network susceptible to noise. Thus, a case is made for the reintroduction of data transformation techniques, through an intelligent algorithm, to enhance the robustness of the ML networks. In a sense, the science and mathematics of AI algorithms must provide a path for the accurate, reliable, and rapid deployment of intelligent algorithms without an over-reliance of technological prowess such as inexpensive bandwidth and computing power. We developed an algorithm for image classification based on intelligent geometric transformations, which forms the input data for any neural network architecture of choice. Our results show that adopting intelligent data transformations for pre-processing network input leads, in comparison to a conventional raw image input convolutional neural network, to a more robust model to both random and adversarial noise, offering network designers enhanced control over the trade-off between accuracy and robustness.
This paper presents a new lossy image compression technique using Logic-based Weightless neuralnetworks, which underwrite two novel network architectures. The system endorses three processing phases, image optimizati...
详细信息
ISBN:
(纸本)9781665434430
This paper presents a new lossy image compression technique using Logic-based Weightless neuralnetworks, which underwrite two novel network architectures. The system endorses three processing phases, image optimization, inflation, and skimming. This research demonstrates an untraditional approach of auto-compression network guided by horizontal and vertical pixel intensity wavering trend. The performance of this new approach aligns with human's perception of singularities in a certain pattern. The potential of trend analysis in image compression incorporates with information storage techniques and knowledge accumulation. The weightless network models generate images underlying enough distinct features that preserve the originality of a particular pattern but give superior levels of compression.
The proceedings contain 72 papers. The special focus in this conference is on Multi-disciplinary Trends in artificial Intelligence. The topics include: Hybrid Model Using Interacted-ARIMA and ANN Models for ...
ISBN:
(纸本)9783031364013
The proceedings contain 72 papers. The special focus in this conference is on Multi-disciplinary Trends in artificial Intelligence. The topics include: Hybrid Model Using Interacted-ARIMA and ANN Models for Efficient Forecasting;demand and Price Forecasting Using Deep Learning Algorithms;conversational artificial Intelligence in Digital Healthcare: A Bibliometric Analysis;Statistical Analysis of the Monthly Costs of OPEC Crude Oil Using Machine Learning Models;Redefining the World of Medical imageprocessing with AI – Automatic Clinical Report Generation to Support Doctors;sign Language Interpretation Using Deep Learning;interpretable Chronic Kidney Disease Risk Prediction from Clinical Data Using Machine Learning;traffic Prediction in Indian Cities from Twitter Data Using Deep Learning and Word Embedding Models;iAOI: An Eye Movement Based Deep Learning Model to Identify Areas of Interest;an Ensemble Technique to Detect Stress in Young Professional;automatic Differentiation Using Dual Numbers - Use Case;a Hybrid Intelligent Cryptography Algorithm for Distributed Big Data Storage in Cloud Computing Security;Low Light image Illumination Adjustment Using Fusion of MIRNet and Deep Illumination Curves;Multi-dimensional STAQR Indexing Algorithm for Drone applications;eduKrishnaa: A Career Guidance Web Application Based on Multi-intelligence Using Multiclass Classification Algorithm;iSTIMULI: Prescriptive Stimulus Design for Eye Movement Analysis of Patients with Parkinson’s Disease;AI Based Employee Attrition Prediction Tool;rescheduling Exams Within the Announced Tenure Using Reinforcement Learning;a Yolo-Based Deep Learning Approach for Vehicle Class Classification;pixel Value Prediction Task: Performance Comparison of Multi-Layer Perceptron and Radial Basis Function neural Network;sentiment Analysis of Twitter Data on ‘The Agnipath Yojana’;how Good are Transformers in Reordering?;LPCD: Incremental Approach for Dynamic networks.
暂无评论