Deep Learning as a Service (DLaaS) stands as a promising solution for cloud-based inference applications. In this setting, the cloud has a pre-learned model whereas the user has samples on which she wants to run the m...
详细信息
Deep Learning as a Service (DLaaS) stands as a promising solution for cloud-based inference applications. In this setting, the cloud has a pre-learned model whereas the user has samples on which she wants to run the model. The biggest concern with DLaaS is the user privacy if the input samples are sensitive data. We provide here an efficient privacy-preserving system by employing high-end technologies such as Fully Homomorphic Encryption (FHE), Convolutional neuralnetworks (CNNs) and Graphics processing Units (GPUs). FHE, with its widely-known feature of computing on encrypted data, empowers a wide range of privacy-concerned applications. This comes at high cost as it requires enormous computing power. In this article, we show how to accelerate the performance of running CNNs on encrypted data with GPUs. We evaluated two CNNs to classify homomorphically the MNIST and CIFAR-10 datasets. Our solution achieved sufficient security level (> 80 bit) and reasonable classification accuracy (99) and (77.55 percent) for MNIST and CIFAR-10, respectively. In terms of latency, we could classify an image in 5.16 seconds and 304.43 seconds for MNIST and CIFAR-10, respectively. Our system can also classify a batch of images (> 8,000) without extra overhead.
As a prevailing research in artificial intelligence, the application of computer vision is widely used in many fields which are closely related to people's livelihood, such as industrial automation, new retail ind...
详细信息
As a prevailing research in artificial intelligence, the application of computer vision is widely used in many fields which are closely related to people's livelihood, such as industrial automation, new retail industry, smart transportation and security monitoring. And the proposed face recognition method is a branch in the field of computer vision, it integrates neuralnetworks, biology, image signal processing, machine learning and other fields, which promote research and cross-development among different disciplines. Hence, this paper focuses on face recognition method by using convolutional neural network(CNN), and CNN has the property of "weight sharing", which has been widely popularized in image recognition, it can greatly simplify the work of large-scale network training. The experiments demonstrate that the proposed face recognition method is successful, and the accuracy of the proposed method can be as high as 98%.
The widespread availability of forged image software necessitates the integrity verification of digital images in industrial and medical applications. Because of image manipulation, detecting small tampering and dupli...
详细信息
The widespread availability of forged image software necessitates the integrity verification of digital images in industrial and medical applications. Because of image manipulation, detecting small tampering and duplicated forgery from digital radiography (gamma and x-ray) images has become a research challenge, Two essential approaches are proposed for forgery detection from digital radiography images. A precise forgery detection approach with pretrained deep convolution neuralnetworks (CNN) is conducted. Alexnet, Resnet-18 and VGG-19 are three pretrained networks for features extraction. artificialneural network (ANN) and multiclass support vector machine (MSVM) classifiers are applied for classifying the extracted features into authentic or forged. The second suggested approach depends on Haralick and Zoning extractors. These extracted features are trained and tested using the K-nearest neighbors (KNN) classifier. The suggested approaches are investigated using several manipulated industrial (gamma welding images) and medical (spine images) datasets images. Besides, these approaches are tested with several color benchmark dataset images. The results are verified using a variety of evaluation metrics. The approaches are validated through comparison with published work and high agreements are demonstrated. For digital radiography images, Alexnet pretrained network with MSVM, Resnet-18 pretrained network with ANN and Haralick extractor with KNN achieve the highest accuracy and assessment metrics. It is observed that the performance of pretrained CNN outperforms that of conventional classification algorithms in respect of accuracy with computational time. The developed approaches allow for the precise detection of forgery regions in x-ray and gamma radiographic images as well as digital images.& COPY;2023 Elsevier B.V. All rights reserved.
The main objective of this paper was to effectively interface object detection based on Convolution neuralnetworks (CNNs) with selective lossy image compression techniques to improve the efficiency of subsequent imag...
详细信息
ISBN:
(纸本)9781665416696
The main objective of this paper was to effectively interface object detection based on Convolution neuralnetworks (CNNs) with selective lossy image compression techniques to improve the efficiency of subsequent image operations and reduce the memory requirement for storing the images in autonomous applications of self-driving vehicles. Object detection and localization was performed using 2 state-of-the-art CNN based models from the Tensorflow 2.0 Object Detection API - Faster R-CNN ResNet152 V1 1024x1024 and CenterNet HourGlass104 1024x1024. Lossy image Compression centred around the most prominent detected object (which is preserved) is done through 3 techniques - K-Means Clustering (KM), Genetic Algorithm (GA), Discrete Cosine Transform (DCT). The compressed and preserved parts were recombined to produce the final image. Analysis of the results obtained from different models and compression techniques was carried out. It was found that DCT produced the best results on both the models.
Adversarial robustness of neuralnetworks is an increasingly important area of research, combining studies on computer vision models, large language models (LLMs), and others. With the release of JPEG AI — the first ...
详细信息
The rise of artificial intelligence (AI)-based image analysis has led to novel application possibilities in the field of solvent analytics. Using convolutional neuralnetworks (CNNs), better and more automated analysi...
详细信息
The rise of artificial intelligence (AI)-based image analysis has led to novel application possibilities in the field of solvent analytics. Using convolutional neuralnetworks (CNNs), better and more automated analysis of optically visible phenomena becomes feasible, broadening the spectrum of non-invasive measurements. These so-called smart sensors have attracted increasing attention in pharmaceutical and chemical process engineering;their additional sensor data enables more precise process control as additional process parameters can be monitored. This contribution presents an approach to analyzing single rising droplets to determine their physical properties;for example, geometrical parameters such as diameter, projection area and volume. Additionally, the rising velocity is determined, as well as the density and interfacial tension of the rising liquid droplet, determined from the force balance. Thus, a method was developed for analyzing liquid-liquid properties suitable for real-time applications. Here, the size range of the investigated droplet diameters lies between 0.68 mm and 7 mm with an accuracy for AI detecting droplets of +/- 4 mu m. The obtained densities lie between 0.822 kg.m (-3) for rising n-butanol droplets and 0.894 kg.m (-3) for toluene droplets. For the derived parameters, such as the interfacial tension estimation, all of the data points lie in a range from 12.75 mN.m (-1) to 15.25 mN.m (-1). The trueness of the investigated system thus is in a range from -1 to +0.4 mN.m (-1), with a precision of +/- 0.3 to +/- 0.6 mN.m (-1). For density estimation using our system, a standard deviation of 1.4 kg m 3 from the literature was determined. Using camera images in conjunction with image analysis improved by artificial intelligence algorithms, combined with using empirical mathematical formulas, this article contributes to the development of easily accessible, cheap sensors.
Multi-label text classification is a fundamental task in the field of natural language processing. Currently, there are issues in the Chinese multi-label text classification tasks, such as insufficient extraction of t...
详细信息
ISBN:
(数字)9798350360240
ISBN:
(纸本)9798350384161
Multi-label text classification is a fundamental task in the field of natural language processing. Currently, there are issues in the Chinese multi-label text classification tasks, such as insufficient extraction of text label features and a lack of learning about the correlations between labels. To address these challenges, we proposed a Multi-label Text Classification Model based on DistilBERT and Word-Label Probabilities (DBWL). The model consists of a metric space module that integrates word-label probabilities and a deep neural network module based on DistilBERT. Firstly, the trained Labeled Latent Dirichlet Allocation (Labeled-LDA) model is utilized to obtain the probabilities of words associated with labels, which are then used to calculate the text mapping vectors and map them into the same metric space as the labels. Similarity measurement is employed to extract features between text-label pairs and label-label pairs. Then, in the deep neural network module, DistilBERT is employed to jointly embed texts and labels, capturing rich semantic information about text labels, followed by feature extraction using TextCNN. Finally, the features extracted from both modules are weighted and fused for label prediction. Experimental results on two benchmark datasets demonstrate that the proposed model outperforms current mainstream multi-label classification models in terms of key performance metrics.
Trajectories can be regarded as time-series of coordinates, typically arising from motile objects. Methods for trajectory classification are particularly important to detect different movement patterns, while methods ...
详细信息
Deep neuralnetworks are being widely used in applications like object detection, image classification, etc. Having billions of parameters in the networks poses challenges in terms of storage requirements, and computa...
详细信息
ISBN:
(数字)9798331533229
ISBN:
(纸本)9798331533236
Deep neuralnetworks are being widely used in applications like object detection, image classification, etc. Having billions of parameters in the networks poses challenges in terms of storage requirements, and computational complexity. Exploiting sparsity in weights and input activations results in reduced memory and accelerated computations. This paper evaluates the performance of a sparse neural network inference on GPU that utilizes a clustering technique to convert the intermediate results into sparser representation during inference time with a sparse matrix storage format. The methodology utilizes Delta-Compressed Sparse Row sparse weight storage format for reduced memory requirements. The proposed approach has been implemented on NVIDIA Tesla T4 GPU, using medium-scale sparse DNN trained with MNIST dataset. Our implementation has achieved an average reduction of 4% in storage requirement and a maximum speedup of 1.92 × compared to baseline SNICIT implementation.
In approximate circuits, the functionality of the circuit is modified to improve non-functional parameters such as area, power, and delay. These circuits are targeted for error-resilient applications like image proces...
详细信息
暂无评论