CNN is inspired from Primary Visual (V1) neurons. It is a typical deep learning technique and can help teach machine how to see and identify objects. In the most recent decade, deep learning develops rapidly and has b...
详细信息
CNN is inspired from Primary Visual (V1) neurons. It is a typical deep learning technique and can help teach machine how to see and identify objects. In the most recent decade, deep learning develops rapidly and has been well used in various fields of expertise such as computer vision and natural language processing. As the representative algorithm of deep learning, Convolution Neural Network (CNN) has been regarded as a breakthrough of historic significance in imageprocessing and visual recognition tasks since the astonishing results achieved on imageNet Large Scale Visual Recognition Competition (ILSVRC) Unlike methods based on handcrafted features, CNN models can build high-level features from low-level ones in a data-driven fashion and have displayed great potential in medical image analysis among the aspects of segmentation of histological images identification, lesion detection, tissue classification, etc. This paper provides a review on CNN from the perspectives of its basic mechanism introduction, structure, typical architecture and main application in medical image analysis through analyzing over 100 references from Google Scholar, PubMed, Web of Science and various sources published from 1958 to 2020.
This paper presents a comprehensive examination of innovative strategies aimed at enhancing machinevision technology, particularly in the context of energy efficiency and processing speed, critical factors for applic...
详细信息
ISBN:
(数字)9798350376258
ISBN:
(纸本)9798350376265
This paper presents a comprehensive examination of innovative strategies aimed at enhancing machinevision technology, particularly in the context of energy efficiency and processing speed, critical factors for applications like facial recognition. The study focuses on three distinct approaches: an optimized two-dimensional convolution algorithm, a novel Field-Programmable Gate Array (FPGA) implementation, and advancements in multichannel meta-imagers. Firstly, the paper discusses an optimized algorithm for two-dimensional convolutions, a fundamental operation in machinevision. This advanced algorithm significantly reduces computational complexity. For instance, in executing a two-dimensional 3×3 cyclic convolution, the proposed method reduces the number of necessary multiplications from 81 to merely 13, offering a substantial improvement in efficiency. Secondly, the paper explores an innovative FPGA implementation of the two-dimensional convolution algorithm. This implementation is designed to minimize the use of shift registers, multipliers, and adders. As a result, it utilizes fewer Look-Up Tables (LUTs), leading to energy and time savings in executing the convolution process. The paper details the architecture of this FPGA-based approach and its implications for energy consumption and processing speed in machinevisionapplications. Finally, the paper introduces a novel technique called the Avg-Topk method, addressing a critical challenge in the pooling layer of convolutional neural networks. This method combines the benefits of average pooling with the advantages of max pooling, aiming to enhance the accuracy of the pooling layer without compromising on efficiency. The Avg-Topk method represents a significant step forward in optimizing the pooling process within machinevision systems. In summary, this paper delves into groundbreaking methods to improve the speed and energy efficiency of machinevision systems, offering valuable insights and potential solution
In high-energy physics, the capability to accurately and efficiently track charged particles is essential for effective data analysis. This article introduces an innovative density-based clustering pipeline intended f...
详细信息
ISBN:
(数字)9798350355413
ISBN:
(纸本)9798350355420
In high-energy physics, the capability to accurately and efficiently track charged particles is essential for effective data analysis. This article introduces an innovative density-based clustering pipeline intended for the track reconstruction task, incorporating Density-Based Spatial Clustering of applications with Noise (DBSCAN) algorithm and Ordering Points To Identify the Clustering Structure (OPTICS) algorithm. Results on simulated data suggest that the proposed method offers improvements in both effectiveness and robustness compared to traditional techniques, with performance on par with state-of-the-art neural network-based approaches. Furthermore, this pipeline demonstrates significant potential for real-time applications in high-energy physics experiments, offering a scalable and robust solution.
Automatic target tracking in emerging remote sensing video-generating tools based on microwave imaging technology and radars has been investigated in this paper. A moving target tracking system is proposed to be low c...
详细信息
Automatic target tracking in emerging remote sensing video-generating tools based on microwave imaging technology and radars has been investigated in this paper. A moving target tracking system is proposed to be low complexity and fast for implementation through edge nodes in a mini-satellite or drone network enabling machine intelligence into large-scale vision systems, in particular, for marine transportation systems. The system uses a group of imageprocessing tools for video pre-processing, and Kalman filtering to do the main task. For testing the system performance, two measures of accuracy and false alarms probability are computed for real vision data. Two types of scenes are analyzed including the scene with single target, and the scene with multiple targets that is more complicated for automatic target detection and tracking systems. The proposed system has achieved a high performance in our tests.
In this work, we propose Asynchronous Perception machine (APM), a computationally-efficient architecture for test-time-training (TTT). APM can process patches of an image one at a time in any order asymmetrically, and...
In the machinevision-based online monitoring of the flotation process, froth images acquired in real-time are subject to color distortion and excessive bright spots caused by inconsistent illumination, which hinders ...
详细信息
In the machinevision-based online monitoring of the flotation process, froth images acquired in real-time are subject to color distortion and excessive bright spots caused by inconsistent illumination, which hinders the effectiveness of image analysis and further online measurement for operating performance indicators. Current imageprocessing methods struggle to correct color distortion and remove excess bright spots in froth images simultaneously. Therefore, in this article, an illumination domain signal-guided unsupervised generative adversarial network (IDS-GUGAN) is proposed for illumination consistency processing of flotation froth images. First, considering the varying effects of inconsistent illumination on froth images, the illumination domain signal-guided image generation (IDS-GIG) mechanism based on the theory of unsupervised disentangled representation learning is designed to achieve adaptive correction of froth images with varying degrees of distortion. Moreover, a novel lightweight double-closed-loop network architecture is introduced to support unsupervised learning utilizing unpaired froth images and improve computational efficiency, which makes the proposed approach highly suitable for industrial applications. Comprehensive experiments on a real tungsten cleaner flotation process dataset and two public benchmark datasets related to image illumination processing tasks consistently endorse the superiority of IDS-GUGAN.
The main objective of this paper is to improve the automatic Myanmar captions by learning the contents of images using NASNetLarge and Bi-LSTM model. Describing the contents of an image is a complex task for machine w...
详细信息
In agriculture field, yield loss is a major problem due to attack of various insects in field crops. Traditional insect identification and classification methods are time-consuming and require entomologist experts. Ea...
详细信息
In agriculture field, yield loss is a major problem due to attack of various insects in field crops. Traditional insect identification and classification methods are time-consuming and require entomologist experts. Early information about the attack of insects helps farmers to control the crop damage to improve the productivity and reduce the use of pesticides. This research work focuses on the classification of crop insects by applying machinevision and knowledge-based techniques with imageprocessing by using different feature descriptors including texture, color, shape, histogram of oriented gradients (HOG) and global image descriptor (GIST). A combination of all these features was used in the classification of insects. In this research, several machine learning algorithms including both base classifiers and ensemble classifiers were applied for three different insect datasets and the performances of classification results were evaluated by majority voting. Naive bayes (NB), support vector machine (SVM), K-nearest-neighbor (KNN) and multi-layer perceptron (MLP) were used as base classifiers. Ensemble classifiers include random forest (RF), bagging and XGBoost were utilized;10-fold cross-validation test was conducted to achieve a better classification and identification of insects. The experimental results showed that the classification accuracy is improved by majority voting with ensemble classifiers in the combination of texture, color, shape, HOG and GIST features.
Precision agriculture (PA) represents the use of new technologies, specially computer vision, to increase agricultural productivity, where image segmentation plays a crucial role in several PA applications. This paper...
详细信息
ISBN:
(纸本)9781665422093
Precision agriculture (PA) represents the use of new technologies, specially computer vision, to increase agricultural productivity, where image segmentation plays a crucial role in several PA applications. This paper presents a machine learning algorithm for Avocado image segmentation based on quantum enhancement and Random forest. In order to show the performance of the proposed method in term of segmentation, which represents one of the most sensible computer vision technics to noise and low illumination images, a set of experimentations based on synthetic and real images devoted to agricultural applications (avocado fruit detection and localization) are done. The Segmentation accuracy (SA) and the mean intersection over Union (MIoU) metrics are adopted to evaluate its performance against other algorithms presented in the literature. The proposed method shows good results in terms of segmentation quality, sensibility to noise and low illumination conditions, outperforming the existing and widely used binarization methods.
This work is focused on investigating the impact of out-of-plane stitches on enhancing mode-ii interlaminar fracture toughness (or energy) and characterizing damage progression and crack arrestment in stitched resin-i...
详细信息
ISBN:
(数字)9781624107115
ISBN:
(纸本)9781624107115
This work is focused on investigating the impact of out-of-plane stitches on enhancing mode-ii interlaminar fracture toughness (or energy) and characterizing damage progression and crack arrestment in stitched resin-infused composites. For the experimental work, End-Notched Flexure (ENF) quasi-isotropic specimens were manufactured using +/- 45 non-crimp carbon-fiber fabrics through a resin-infusion process. Both stitched and unstitched specimen sets were designed for comparison. For a size effect study, the ENF specimens were geometrically scaled with three scaling levels. Based on the load-displacement data (i.e., global analysis), the fracture energy of the specimen material was analyzed using the compliance calibration method and a size effect theory. The fracture energy values were compared between the stitched and unstitched cases to characterize the enhanced fracture toughness of stitched composites. For local analysis, two types of digital image correlation (DIC) systems were employed: microscopic and macroscopic (i.e., coupon-scale) DIC systems. By analyzing in-plane displacement through the thickness, separation development was characterized along predicted fracture process zones. The impact of out-of-plane stitches on separation propagation along fracture process zones was discussed based on the DIC analysis. This work will contribute to developing a high-fidelity damage model for stitched resin-infused composites in the form of a traction-separation for high-speed aircraft applications.
暂无评论