The advancement in deep learning is increasing day-by-day from image classification to language understanding tasks. In particular, the convolution neural networks are revived and shown their performance in multiple f...
详细信息
The advancement in deep learning is increasing day-by-day from image classification to language understanding tasks. In particular, the convolution neural networks are revived and shown their performance in multiple fields such as natural language understanding, signal processing, and computer vision. The property of translational invariance for convolutions has made a huge advantage in the field of computer vision to extract feature invariances appropriately. When these convolutions trained using back-propagation tend to prove their results ability to outperform existing machinevision techniques by overcoming the various hand-engineered machinevision models. Hence, a clear understanding of current deep learning methods is crucial. These convolution neural networks have proven to show their performance by attaining state-of-the-art performance in computer vision over years when applied on humongous data. Hence in this survey, we detail a set of state-of-the-art models in image classification evolved from the birth of convolutions to present ongoing research. Each state-of-the-art model evolved in the successive year is illustrated with architecture schema, implementation details, parametric tuning and their performance. It is observed that the neural architecture construction i.e. a supervised approach for an image classification problem is evolved as data construction with cautious augmentations i.e., a self-supervised approach. A detailed evolution from neural architecture construction to augmentation construction is illustrated by provided appropriate suggestions to improve the performance. Additionally, the implementation details and the appropriate source for the execution and reproducibility of results are tabulated.
Intelligent manufacturing raises higher requirements for tool condition monitoring (TCM) in terms of accuracy, robustness, and adaptability. At present, direct methods based on imageprocessing and deep learning have ...
详细信息
Intelligent manufacturing raises higher requirements for tool condition monitoring (TCM) in terms of accuracy, robustness, and adaptability. At present, direct methods based on imageprocessing and deep learning have made breakthroughs in TCM. However, some issues, such as image quality, model parameters, and dataset scale in the abovementioned methods, restrict industrial applications of TCM. Regarding the abovementioned issue, the purpose of this article is to propose a lightweight network model based on multiple activation functions to promote the intelligent industrial application of TCM. First, the image quality mechanism caused by complex working conditions is analyzed in industrial environments. Correspondingly, data augmentation is adopted to solve the problem of data scale under the premise of ensuring data quality and richness. Then, the adaptive activation function and the hard version of swish are introduced at the front and second half of the network to avoid information loss and reduce the activation function cost. Finally, a lightweight network based on cloud-edge collaboration for TCM is constructed. The model is iteratively optimized in the cloud and inferenced on the edge embedded device. The accuracy and adaptability of the proposed network are verified by accelerating milling cutter life under multiple working conditions.
Lung diseases pose a significant threat to public health worldwide,resulting in a substantial number of *** such as chronic obstructive pulmonary disease and lung cancer constitute two of the three deadliest diseases ...
详细信息
Lung diseases pose a significant threat to public health worldwide,resulting in a substantial number of *** such as chronic obstructive pulmonary disease and lung cancer constitute two of the three deadliest diseases worldwide,contributing to over 3 million deaths *** study offered a comparative analysis of different diagnostic techniques used for lung pathologies from an engineering *** review concentrated on intelligent detection methods,including electronic nose,computer vision(CV),or imageprocessing,and biosensors such as graphene-field effect transistor(FET).The E-nose-based detection technique uses electronic sensors to recognize volatile organic compounds(VOCs)in the exhaled *** VOCs can aid in the diagnosis of lung pathologies such as *** CV processing method involves the application of advanced imaging techniques and machine learning algorithms to scrutinize and diagnose lung pathologies and ventilatorassociated pneumonia(VAP).Lastly,biosensors employ the exceptional properties of these materials to identify specific biomarkers in biological *** information can be used to diagnose lung pathologies and *** study examined the current state-of-the-art methods and offers a comprehensive analysis of their advantages and disadvantages from an engineering *** study underscored the potential of these techniques to enhance the diagnosis of lung pathologies and VAP and presents the advances in the field of smart biomedical ***,it emphasized the necessity for further research to optimize their performance and clinical usefulness.
This paper presents the use of vision-based methods for cutting tool motion registration and modal analysis. Motion of three illustrative tools were recorded using low- and high-speed cameras with sufficiently high re...
详细信息
This paper presents the use of vision-based methods for cutting tool motion registration and modal analysis. Motion of three illustrative tools were recorded using low- and high-speed cameras with sufficiently high resolutions. The tool's own features are used to register motion. Pixels within images from recordings of the vibrating tools are treated as non-contact motion sensors. Comparative analysis of three different methods of motion registration are presented to evaluate their suitability for the application of interest. These include variants of expanded edge detection and tracking schemes, expanded optical flow-based schemes, and established digital image correlation methods. Performance of different methods was observed to be governed by the tool's own features, illumination conditions, noise, and the image acquisition parameters. Extracted motion was benchmarked against twice integrated measured tool point accelerations, and motion was generally observed to compare well. Modal parameters extracted from vision-based measurements were also observed to agree with those extracted using more traditional experimental modal analysis procedures using a contact type accelerometer as the transducer. Since methods presented are generalized, they can suitably be adapted for other applications of interest. (C) 2020 CIRP.
The human brain serves as the principal controller of the humanoid system. Brain tumors are the result of abnormal cell division and proliferation, and the development of these tumors can result in brain cancer. The u...
详细信息
We report on a novel structure of a receptive field (RF)-based multi-layer all-optical neural network using a micropillar laser with a saturable absorber (SA) for imageprocessing. From the perspective of biological v...
详细信息
We report on a novel structure of a receptive field (RF)-based multi-layer all-optical neural network using a micropillar laser with a saturable absorber (SA) for imageprocessing. From the perspective of biological vision, the realization of imageprocessing based on the RF provides the biological rationality for the machinevision implemented by the spiking neural network (SNN). By exploiting the fast physical mechanisms of gain and absorption in the SA laser, the photonic spike-timing-dependent plasticity (STDP) curves are achieved to train the weights. Here, the source image pixels are mapped into the temporal information of spike trains injected into the neural network through the temporal coding method called time-to-first-spike encoding. Different source images are processed and tested by the proposed photonic SNN. Simulation results show that our proposed system can process not only simple binary images but also complex color images under the adjustment of STDP rules. When considering the robustness, we demonstrate the tolerance of the image segmentation to the time jitter. These results indicate that our proposed photonic SNN can achieve high-resolution processing of complex source images. Additionally, the time-multiplexing technique can be further adopted to simplify the RF structure, which is expected to reduce the complexity of the whole system, thus facilitating physical applications. Our work offers the prospect for a high-speed photonic spiking platform for imageprocessing.
This article describes chemical and physical parameters, including their role in the storage, trade, and processing of potatoes, as well as their nutritional properties and health benefits resulting from their consump...
详细信息
This article describes chemical and physical parameters, including their role in the storage, trade, and processing of potatoes, as well as their nutritional properties and health benefits resulting from their consumption. An analysis of the share of losses occurring during the production process is presented. The methods and applications used in recent years to estimate the physical and chemical parameters of potatoes during their storage and processing, which determine the quality of potatoes, are presented. The potential of the technologies used to classify the quality of potatoes, mechanical and ultrasonic, and imageprocessing and analysis using vision systems, as well as their use in applications with artificial intelligence, are discussed.
Currently, deep learning neural networks have shown remarkable performance in the field of image classification. However, existing classification networks are typically applied within closed environments, where all co...
详细信息
ISBN:
(数字)9798350355413
ISBN:
(纸本)9798350355420
Currently, deep learning neural networks have shown remarkable performance in the field of image classification. However, existing classification networks are typically applied within closed environments, where all collected samples are batch-trained to produce classification results. In practical applications, the categories of samples are not static but dynamically increase over time. With the continual addition of new classes, the current classification methods exhibit limited continual learning capabilities, making them unsuitable for applications with stringent real-time requirements. To address this issue, this paper investigates an incremental learning algorithm based on ensemble transfer learning to handle classification training for newly added categories. This algorithm leverages the strengths of ensemble transfer learning to enhance the network's ability to continually learn new categories, enabling fast and accurate classification of new samples. In this paper, the proposed algorithm is applied to the CUB-200-2011 dataset for class-incremental experiments and compared with other deep learning models. Experimental results demonstrate that the incremental learning classification algorithm based on integrated transfer outperforms other models.
Digital Signal processing (DSP) and Digital imageprocessing (DIP) with machine Learning (ML) and Deep Learning (DL) are popular research areas in Computer vision and related fields. We highlight transformative applic...
详细信息
In this paper, an online monitoring system of welding quality based on machinevision and machine learning was proposed. A high-speed CCD camera was used to monitor the tail end of the molten pool, and the remove smal...
In this paper, an online monitoring system of welding quality based on machinevision and machine learning was proposed. A high-speed CCD camera was used to monitor the tail end of the molten pool, and the remove small objects algorithm and contour compensation based on convex hull algorithm were utilized to achieve high-precision collection of features such as the width and length of the tail of the molten pool. This effectively solved the technical challenges caused by welding splashes and plasma arc, which could interfere with visual acquisition. Combined with neural network algorithms, a welding quality model was established and validated to accurately identify defects such as welding undercut, welding deviations, and unstable welding processes, with a defect recognition rate of $\geq 94\%$ .
暂无评论