The project addresses the challenge of accurately identifying blurred faces in computer vision and facial recognition. It introduces a novel framework that integrates deblurring techniques, utilizing point spread func...
详细信息
Deep learning models for computer visionapplications specifically and for machine learning generally are now the state of the art. The growth of size and complexity of neural networks has made them more and more reli...
详细信息
Deep learning models for computer visionapplications specifically and for machine learning generally are now the state of the art. The growth of size and complexity of neural networks has made them more and more reliable, yet in greater need of computational power and memory as is evident from the heavy reliance on graphical processing units and cloud computing for training them. As the complexity of deep neural networks increases, the need for fast processing neural networks in real-time embedded applications at the edge also increases and accelerating them using reconfigurable hardware suggests a solution. In this work, a convolutional neural network based on the inception net architecture is first optimized in software and then accelerated by taking advantage of field programmable gate array (FPGA) parallelism. Genetic algorithm augmented training is proposed and used on the neural network to produce an optimum model from the first training run without re-training iterations. Quantization of the network parameters is performed according to the weights of the network. The resulting neural network is then transformed into hardware by writing the register transfer level (RTL) code for FPGAs with exploitation of layer parallelism and a simple trial-and-error allocation of resources with the help of the roofline model. The approach is simple and easy to use as compared to many complex existing methods in literature and relies on trial and error to customize the FPGA design to the model needed to work on any computer vision or multimedia application deep learning model. Simulation and synthesis are performed. The results prove that the genetic algorithm reduces the number of back-propagation epochs in software and brings the network closer to the global optimum in terms of performance. Quantization to 16 bits also shows a reduction in network size by almost half with no performance drop. The synthesis of our design also shows that the Inception-based classifier is cap
Deep learning is a very powerful analytic tool to recognize the patterns in data to make appropriate predictions. It has tremendous potential in data analyses, particularly for cell biology domain, caused by the growi...
详细信息
Deep learning is a very powerful analytic tool to recognize the patterns in data to make appropriate predictions. It has tremendous potential in data analyses, particularly for cell biology domain, caused by the growing scale and inherent complexity of biological data. The core purpose of this research work is to design, implement, and calibrate an efficient deep convolutional neural network (DCNN) architecture in the context of binary-class classification problem. This diversified network is developed to precisely identify human induced pluripotent stem cell-derived endothelial cells (hiPSC-derived EC) based on photomicrograph. The proposed architecture is cerebrally developed with numerous convolutional modules, multiple kernel sizes, various pooling layers, activation functions and strides, nevertheless fewer trainable parameters to strengthen the network and enhance its performance. The proposed feature fusion framework is compared with the classifier fusion approach in terms of Matthews's correlation coefficient (MCC), training time, inference time, number of layers, number of parameters, graphics processing unit (GPU) memory utilization, and floating-point operations (FLOPS). Specifically, it achieves 94.6% sensitivity, 94.5% specificity, and 94.7% precision. Induced pluripotent stem cell (iPS) dataset is also introduced in this research work that has 16278 images which are labelled by three independent and experienced human experts of cell biology domain to facilitate future research. Experimental results show that the proposed framework offers an innovative and attainable algorithm for accelerating and systematizing the classification task along with saving time and effort.
The 'Smart Exercise Counter using Computer vision' is a groundbreaking system that blends cutting-edge computer vision technology with exercise monitoring. In an age where fitness and health are paramount, thi...
详细信息
The goal of functional error correction is to preserve neural network performance when stored network weights are corrupted by noise. To achieve this goal, a selective protection (SP) scheme was proposed to optimally ...
详细信息
Defect detection is a crucial quality control process in the manufacturing industry, aimed at identifying and classifying imperfections or anomalies in products before they reach customers. Traditional manual inspecti...
详细信息
In this demonstration paper, we present "e2evideo" a versatile Python package composed of domain-independent modules. These modules can be seamlessly customised to suit specialised tasks by modifying specifi...
详细信息
Sparse representation based on dictionary learning has been widely used in many applications over the past decade. In this article, a new method is proposed for removing noise from video images using sparse representa...
详细信息
The article proposes an approach to the formation of the trajectory of the spatial movement of a controlled object in a confined space using stationary vision systems. For its implementation, the following main steps ...
详细信息
ISBN:
(纸本)9781510667877;9781510667884
The article proposes an approach to the formation of the trajectory of the spatial movement of a controlled object in a confined space using stationary vision systems. For its implementation, the following main steps are used in the work: 1. Preprocessing of data generated by the machinevision system. The task includes multicriteria imageprocessing in order to minimize the noise component and determine the boundaries of objects. 2. An automated method for adaptive non-local separation of objects on borders, background and objects. 3. Execution of the task of adaptive nonlocal binarization. 4. Building a mask of stationary and current moving objects. 5. Formation of an equidistant displacement trajectory. 6. Checking the trajectory by moving in adjacent frames. 7. Prediction and remeasurement of the position of objects in the frame based on displacement vectors and correction of the object's movement trajectory. 7. Formation of a control team to move an object in a confined space using stationary vision systems. To test the effectiveness, studies were conducted on a set of test sequences. The studies were carried out on a group of cameras in the visible spectrum (1920x1080, RGB, 8 bits) covering the entire field of view. The adaptability of the application of the proposed approach in solving complex problems is showed.
We propose a computer vision architecture based on Hyperbolic networks, contrastive learning and knowledge distillation to detect unsafe behavior in energy production and oil & gas plants. Data scarcity poses a si...
详细信息
ISBN:
(纸本)9798350370058;9798350370164
We propose a computer vision architecture based on Hyperbolic networks, contrastive learning and knowledge distillation to detect unsafe behavior in energy production and oil & gas plants. Data scarcity poses a significant challenge to develop machine learning applications in industry. Indeed, the data may be incomplete, inconsistent, or biased, making it difficult to develop accurate and reliable models. Insufficient data during training phase has direct impact on the models' representation learning capabilities;with the aid of vision Transformers (ViTs), we are able to solve data crunch situations by learning efficient representations of the existing data. We harnessed the power of ViTs, as it incorporates more global information, leading to quantitatively stronger intermediate feature representations. Further, we approached the task with contrastive learning and obtained pairs of samples which are similar, to tackle the limited data availability in our industrial use case. The proposed approach by applying hyperbolic embeddings helps in extracting complex relationships in the data. Furthermore, the size of the model makes it suitable for devices with low computational capabilities such as unmanned robots.
暂无评论