Irregular-shape food processing by robotic arms like shrimp picking is a common problem in industrial automation, which can be summarized as localization of particular points on an image, emphasizing on both good accu...
详细信息
ISBN:
(纸本)9781450365307
Irregular-shape food processing by robotic arms like shrimp picking is a common problem in industrial automation, which can be summarized as localization of particular points on an image, emphasizing on both good accuracy and high speed with relatively very limited hardware resources. In most cases, the points do not have a distinct visual characteristic in color or size. In this paper, first we outline the suspicious search range resorting to intelligently learning the coarse mapping function between shrimp shape and target points, based on the proposed contour model of shrimp body, which significantly simplifies numerical representation of the original image. Next, priori knowledge of the shrimp body is used for more accurate fine localization of the target points. More specifically, in this step, the shrimp body pose is normalized for edge extraction after proper rotation and projection. The extracted edge curve on the back of the shrimp is then analyzed to accurately pick out the target corner point. During validation, in the search region detection step, the method is able to efficiently avoid wrong search in neighboring joints of shrimp body. After finer localization of the target points, the final detection rate turns out to be 93%.
The 3D structure of a virtual or augmented scene is affected by distortions that can affect the natural fruition of the experience. Here, we consider the distortions of the spatial layout of virtual scenes, specifical...
详细信息
ISBN:
(纸本)9781728102474
The 3D structure of a virtual or augmented scene is affected by distortions that can affect the natural fruition of the experience. Here, we consider the distortions of the spatial layout of virtual scenes, specifically the underestimation of egocentric distances, as experienced by users. The aim of our work is to develop a computervision based modeling that is able to describe such an underestimation. In particular, we propose that the spatial misperception has origin from the fact that the world is seen by a user through an uncalibrated camera, thus producing a virtual view of a distorted space for him/her. The proposed geometric model is able to explain the experimental evidence of the egocentric distance underestimation that is reported in the literature.
With the increasing availability of medical images coming from different modalities (X-Ray, CT, PET, MRI, ultrasound, etc.), and the huge advances in the development of incredibly fast, accurate and enhanced computing...
详细信息
ISBN:
(纸本)9781450365628
With the increasing availability of medical images coming from different modalities (X-Ray, CT, PET, MRI, ultrasound, etc.), and the huge advances in the development of incredibly fast, accurate and enhanced computing power with the current graphics processing units. The task of automatic caption generation from medical images became a new way to improve healthcare and the key method for getting better results at lower costs. In this paper, we give a comprehensive overview of the task of image captioning in the medical domain, covering: existing models, the benchmark medical image caption datasets, and evaluation metrics that have been used to measure the quality of the generated captions.
images could be understood and analyzed more easily through enhance processing. Generally, image enhancement can be divided into spatial domain enhancement and frequency domain enhancement according to different proce...
详细信息
ISBN:
(纸本)9781538695944
images could be understood and analyzed more easily through enhance processing. Generally, image enhancement can be divided into spatial domain enhancement and frequency domain enhancement according to different processing domains. Yet compared with frequency domain processing, spatial domain enhancement is more easily to realize real-time enhancement of images, namely, spatial domain processing is easier to satisfy the real-time requirements of applications in practical engineering. While, real-time is critical to robots for performing operations. Hence, this paper devotes to exploring performance of four typical spatial domain enhancement algorithms for a mobile robot operating in low contrast environment. More precisely, fundamental theory of contrast stretch, histogram equalization, histogram matching, and local histogram are overviewed. Furthermore, a mobile robot system is briefly introduced. Finally, in condition of low contrast, performance of above four algorithms are compared in mobile robot via experiments. This work will provide a basic reference for further research on image enhancement in robot vision.
The core of natural language processing is the science of how computers understand and respond to the influence of human language. This is also the main research direction in the field of machine intelligence developm...
详细信息
Classification in natural stone industry have a great importance for enterprises. There are reinstatement cases arising from the fact that ordered granite parties are not the same as the agreed sample parties at the b...
详细信息
ISBN:
(纸本)9781538678930
Classification in natural stone industry have a great importance for enterprises. There are reinstatement cases arising from the fact that ordered granite parties are not the same as the agreed sample parties at the beginning, which causes significant economic losses for the companies. There is a greater need to classify tiles using computer-aided imageprocessing methods for the development of quality control processes that have become increasingly important due to the rapidly increasing competition and globalization in the natural stone industry. In this type of automatic systems, the attributes that give information about color and surface are extracted from the images of natural stone tiles with imageprocessing techniques and then the data set obtained by using these attributes are classified by various artificial intelligence and data mining techniques. In this study, a classification was made on a dataset consisting of 996 pictures of natural stone tiles from six categories obtained from a natural stone producer (Beta Mermer I. C.) operating in Sivas. Gray level co-occurrence matrix (GLCM) and local binary pattern (LPB) are used to obtain pattern information of granite tiles. Several statistics related to each color channel were used to obtain color information of granites. Various datasets are created using only pattern information and combination of pattern and color information of tiles. Subsequently, classification performance of these datasets are compared using several algorithms such as, artificial neural networks, support vector machines, and naive bayes.
Nowadays, more and more object recognition tasks are being solved with Convolutional Neural Networks (CNN). Due to its high recognition rate and fast execution, the convolutional neural networks have enhanced most of ...
详细信息
Nowadays, more and more object recognition tasks are being solved with Convolutional Neural Networks (CNN). Due to its high recognition rate and fast execution, the convolutional neural networks have enhanced most of computervision tasks, both existing and new ones. In this article, we propose an implementation of traffic signs recognition algorithm using a convolution neural network. The paper also shows several CNN architectures, which are compared to each other. Training of the neural network is implemented using the TensorFlow library and massively parallel architecture for multithreaded programming CUDA. The entire procedure for traffic sign detection and recognition is executed in real time on a mobile GPU. The experimental results confirmed high efficiency of the developed computervision system. (C) 2017 The Authors. Published by Elsevier Ltd. Peer-review under responsibility of the scientific committee of the 3rdinternationalconference "Information Technology and Nanotechnology".
Object tracking is one of the most important tasks in computervision. It is aimed to track and identify the targets in every frame automatically, thus making it possible extracting the targets in the frames for the p...
详细信息
ISBN:
(纸本)9781509063529
Object tracking is one of the most important tasks in computervision. It is aimed to track and identify the targets in every frame automatically, thus making it possible extracting the targets in the frames for the post-processing. The described method encompasses two important issues in image tracking system that have received increased attention in the few years: moving object tracking and object recognition. It is based on kernel algorithm and due to the continuously changing background and moving targets, object tracking has been a challenging task We review strategies under kernel algorithm and successfully track and restore the image.
The rainfall monitoring from Radar images in any areas of Thailand is the one task for the flood monitoring system. Optical Character Recognition or OCR technique was applied to recognize these text images and convert...
详细信息
ISBN:
(纸本)9781538663509
The rainfall monitoring from Radar images in any areas of Thailand is the one task for the flood monitoring system. Optical Character Recognition or OCR technique was applied to recognize these text images and convert them into editable text. The propose of this work is to develop a text image recognition program which can identify the date and time in images for collecting and arranging the Radar images in archive. Because of the editable date and time texts are able to replace in the original image filename.
With the development of the Internet of Things and 5G technology, stream processing systems are more and more widely used, especially the open source Storm platform. A comprehensive understanding of the research statu...
详细信息
暂无评论