Computer vision (CV) and imageprocessing techniques aim at the fast development of medical images diagnoses field. As the specialist takes a long time to diagnose one MRI images, CV techniques and machine learning al...
详细信息
As we all know, the large number of counts is a challenging and time consuming task subject because of oversized number and complex conditions. However, the development of deep learning makes deep learning models very...
详细信息
ISBN:
(数字)9781728161365
ISBN:
(纸本)9781728161372
As we all know, the large number of counts is a challenging and time consuming task subject because of oversized number and complex conditions. However, the development of deep learning makes deep learning models very competitive in image segmentation. In this paper, we take cigarette filter rods as the research object. we first evaluate the standard Unet for the filter rod target recognition to separate target and background. Secondly, we use the focal loss function instead of the traditional cross-entropy function to solve the problem of imbalance between target and background area. Thirdly, we add a self-attention module in the traditional Unet convolutional layer to enhance the convolution effect. Fourth, we propose structural element detection criteria and round tangency matching strategy based on HMM (Hidden Markov Model) for the geometric relationship of filter rod position, which further improves the accuracy of the algorithm. We used Qu's [1], Mask-R-CNN [2], FCN [3], Deep-lab-V1 [4] and this paper's algorithm to test the performance of 30000 images from the industrial site. The performance of this paper's algorithm is completely better than the performance of the above algorithm.
This paper presents a new combined local and global transform domain-based feedback image enhancement algorithm for medical diagnosis, treatment, and clinical research. The basic idea in using local alfa-rooting metho...
详细信息
The trajectory of a moving object may be extracted from video using imageprocessingalgorithms. However, the quality of the extracted information largely depends on the frame rate and exposure time of the camera, thu...
详细信息
ISBN:
(纸本)9781728145433
The trajectory of a moving object may be extracted from video using imageprocessingalgorithms. However, the quality of the extracted information largely depends on the frame rate and exposure time of the camera, thus it is difficult to capture fast movement using slower and less expensive cameras. To this end, we propose to use an active modulated light source for object tracking, interacting with exposure times and sub-sampling existing frames. A prototype of a multi-functional active visual marker is presented in this paper. The system is based on the ESP-WROOM-32 microcontroller, which is configured to use various communication protocols, namely WiFi 802.11, RF 2.4 GHz GFSK and RS485. The microcontroller controls the RGB LED, which is used as light source. In addition, the system can be synchronized with the external real-time clock. The experimental results have illustrated the advantages and disadvantages of the designed active markers and pointed out the directions for future work and development.
Fractional Motion Estimation (FME) is a very significant part of video coding technology, but its high computational complexity often limits the real-time performance and makes the area cost very high in hardware desi...
详细信息
ISBN:
(数字)9781728168968
ISBN:
(纸本)9781728168975
Fractional Motion Estimation (FME) is a very significant part of video coding technology, but its high computational complexity often limits the real-time performance and makes the area cost very high in hardware design. In this brief, this paper puts forward some novel joint optimizations with taking both algorithm and architecture into account to increase the throughput and reduce the area cost as much as possible, including SATD-based inter coding unit (CU) early split method and pipeline the 25 points parallel search and interpolation. In addition, to further improve throughput, for CUs with size greater than 8×8, we adopt the method of splicing to further reduce redundant operations and improve the ability of architecture reuse. These ideas help us save nearly 75% of the computational effort and complexity in FME of inter frame prediction. Finally, verified by the Xilinx vivado HLS, the experiment results show that the algorithms and architecture proposed in this paper are able to process videos of 1920×1080@60 fps under the 300 MHz clock frequency with 3% bitrates increase in average.
The application of multi-rotor UAV in vision has been developed rapidly in recent years, and multi-rotor unmanned aerial vehicles (UAV) can't be widely used without the aid of a visual system. The application of v...
详细信息
This paper introduces an intelligent and efficient path guidance robot to assist the visually impaired people in their movements. This is a novel device for the replacement of strenuous guide dogs. The robot has the c...
详细信息
This work addresses segmentation of volumetric images of woven carbon fiber textiles from micro-tomography data. We propose a semi-supervised algorithm to classify carbon fibers that requires sparse input as opposed t...
详细信息
This work addresses segmentation of volumetric images of woven carbon fiber textiles from micro-tomography data. We propose a semi-supervised algorithm to classify carbon fibers that requires sparse input as opposed to completely labeled images. The main contributions are: (a) design of effective discriminative classifiers, for three-dimensional textile samples, trained on wavelet features for segmentation;(b) coupling of previous step with nonlocal means as simple, efficient alternative to the Potts model;and (c) demonstration of reuse of classifier to diverse samples containing similar content. We evaluate our work by curating test sets of voxels in the absence of a complete ground truth mask. The algorithm obtains an average 0.95 F1 score on test sets and average F1 score of 0.93 on new samples. We conclude with discussion of failure cases and propose future directions toward analysis of spatiotemporal high-resolution micro-tomography images.
Thresholding is the simplest but most effecttive segmentation technique for image analysis. However, the computational complexity increases exponentially with the increase of threshold number in order to seek the most...
详细信息
ISBN:
(纸本)9781728140681
Thresholding is the simplest but most effecttive segmentation technique for image analysis. However, the computational complexity increases exponentially with the increase of threshold number in order to seek the most appropriate threshold values. Therefore, stochastic optimization algorithm are often used to overcome excessive computational problems, but the single optimization algorithm often falls into the local optimum. In general, hybrid algorithm is able to produce better performance. As a result, a parallel coupled mode(DE_GA in brief) of differential evolution algorithm (DE) and genetic algorithm (GA) is proposed for solving multi-threshold problem and The maximum variance is used as the fitness function. The experimental result displays that compared with a single algorithm, the results of the hybrid algorithm are relatively stable, which means that the parallel coupled DE_GA algorithm combined with Otsu might be an effect and practical image segmentation method.
The paper deals with the approach to crystal lattice identification based on analysis of isosurface configurations. This method allows to estimate the relative positioning of nodes inside a unit cell, which, theoretic...
详细信息
ISBN:
(数字)9781728170411
ISBN:
(纸本)9781728170428
The paper deals with the approach to crystal lattice identification based on analysis of isosurface configurations. This method allows to estimate the relative positioning of nodes inside a unit cell, which, theoretically, should provide favorable conditions for the creation of identification algorithms resistant to structure distortions. The major problem of high computational complexity is solved by modifying the method using the three-dimensional periodicity of crystal lattice structures. The study of the method performed on a large base of reference lattices has confirmed its high resistance to structure distortions. In addition, the joint analysis of isosurface configurations and parameters of Bravais unit cells allowed to increase the accuracy of structural identification by an average of 13%.
暂无评论