In computed tomography and several related scientific domains, the Fourier slice theorem is a powerful mathematical tool to solve the problem of image reconstruction. Although this theorem is well understood in the co...
详细信息
ISBN:
(纸本)9798400708275
In computed tomography and several related scientific domains, the Fourier slice theorem is a powerful mathematical tool to solve the problem of image reconstruction. Although this theorem is well understood in the continuous case, a detailed quantitative analysis of artifacts caused by discretization is rarely found in the computed tomographic literature. Assuming a practical Fourier Domain Reconstruction (FDR) algorithm, which performs resampling by interpolation or approximation in the frequency domain, artifacts have two main sources. One of these is a combination of truncation and aliasing, introduced by Discrete Fourier Transform (DFT), while the other is the numerical error of the function estimation algorithm that performs resampling. Here, we provide an algebraic method to quantitatively isolate distinct sources of error and construct a set of novel metrics that can be used in the numerical analysis of reconstruction methods.
This paper introduces a vision-based gesture mouse system, which is roughly independent from the lighting conditions, because it only uses the depth data for hand sign recognition. A Kinect sensor was used to develop ...
详细信息
This paper introduces a vision-based gesture mouse system, which is roughly independent from the lighting conditions, because it only uses the depth data for hand sign recognition. A Kinect sensor was used to develop the system, but other depth sensing cameras are adequate as well, if their resolutions are similar or better than the resolution of Kinect sensor. Our aim was to find a comfortable, user-friendly solution, which can be used for a long time without getting tired. The implementation of the system was developed in C++, and two types of test were performed too. We investigated how fast the user can position with the cursor and click on objects and we also examined which controls of the graphical user interfaces (GUI) are easy to use and which ones are difficult to use with our gesture mouse. Our system is precise enough to use efficiently most of the elements of traditional GUI such as buttons, icons, scrollbars, etc. The accuracy achieved by advanced users is only slightly below as if they used the traditional mouse.
Recognition of car make and model from frontal images is a common problem in computer vision. We refined existing approaches based on ROIs defined relative to the number plate. Square-Mapped-Gradient features are extr...
详细信息
Recognition of car make and model from frontal images is a common problem in computer vision. We refined existing approaches based on ROIs defined relative to the number plate. Square-Mapped-Gradient features are extracted from the ROI and recognition is accomplished by classification utilizing a learning set. The classifier is evaluated using ground truth data provided manually. Via numerical simulations we evaluated the detection tolerance of the method and proposed semi-automatic and fully automatic methods. The SMG-based classification is able to give nearly perfect results when there is no outlier class, which decreases to 92% and 87% in case of the semi-automatic and fully automatic methods, respectively. Separation between outliers and known types can be balanced by a threshold. Since the size of the learning set can be kept low and the size of the SMG features are small, this approach can be successfully used to solve mobile client-server scenarios.
We have been developing a hand gesture based computer mouse controlling system, whose central issue is the precise controllability and the comfort of the user. The Kinect for Windows sensor provides the data to proces...
详细信息
We have been developing a hand gesture based computer mouse controlling system, whose central issue is the precise controllability and the comfort of the user. The Kinect for Windows sensor provides the data to process. We have investigated 11 different hand signs inside a special zone to choose appropriate gestures for controlling. In this paper we point out the advantages and disadvantages of the potential solutions and present the preliminary results of our system.
In this paper a linear registration framework is used for medical image registration using segmented binary objects. The method is best suited for problems where the segmentation is available, but we also propose a...
详细信息
In this paper a linear registration framework is used for medical image registration using segmented binary objects. The method is best suited for problems where the segmentation is available, but we also propose a general bone segmentation approach for CT images. We focus on the case when the objects to be registered differ considerably because of segmentation errors. We check the applicability of the method to bone segmentation of pelvic and thoracic CT images. Comparison is also made against a classical mutual information-based registration method.
The digitalization processes of bills produce frequently digital images with a mass of redundant background caused by rotation angles and misoperations in the scanning process. The redundant background occupies much s...
详细信息
The digitalization processes of bills produce frequently digital images with a mass of redundant background caused by rotation angles and misoperations in the scanning process. The redundant background occupies much storage space. First, this paper proposes a method that combines Hough transform and mathematical morphology to detect the skew. Secondly, the redundant background of bill images is removed after recognizing the coordinates of the top-left and bottom-right points of the target area. Last, experimental results are given on different types of bill images. The method provided in this paper is automated and simple to implement.
The reading process of visual codes consists of two steps, localization and data decoding. This paper presents a novel method for QR code localization using deep rectifier neural networks, trained directly in the JPEG...
详细信息
ISBN:
(纸本)9789897580413
The reading process of visual codes consists of two steps, localization and data decoding. This paper presents a novel method for QR code localization using deep rectifier neural networks, trained directly in the JPEG DCT domain, thus making image decompression unnecessary. This approach is efficient with respect to both storage and computation cost, being convenient, since camera hardware can provide JPEG stream as their output in many cases. The structure of the neural networks, regularization, and training data parameters, like input vector length and compression level, are evaluated and discussed. The proposed approach is not exclusively for QR codes, but can be adapted to Data Matrix codes or other two-dimensional code types as well.
The micro-structure of wood-based insulation materials is analyzed to gain insight into how features on microscopic scales influence macroscopic thermal conductivity. Three-dimensional (3D) image data obtained by micr...
详细信息
In this paper, we propose a lossless binary image compression scheme that can achieve high compression ratio via partitioning the black regions (one's) of the input image into rectangles. After partitioning, the t...
详细信息
This paper describes a fast image segmentation approach designed for pavement detection in a moving camera. The method is based on a graph-oriented segmentation approach where gradient information is used temporally a...
详细信息
暂无评论