this article describes the use of gesture recognition techniques in computervision as a natural interface for video content navigation, and the design of a navigation and browsing system that caters to these natural ...
详细信息
this article describes the use of gesture recognition techniques in computervision as a natural interface for video content navigation, and the design of a navigation and browsing system that caters to these natural means of computer-human interaction. For consumer applications, video content navigation presents two challenges: (1) how to parse and summarize multiple video streams in an intuitive and efficient manner, and (2) what type of interface will enhance the ease of use for video browsing and navigation in a living room setting or an interactive environment. In this paper, we address the issues and propose the techniques that combine video content navigation with gestures, seamlessly and intuitively, in an integrated system. the current framework can incorporate speech recognition technology. We present a new type of browser for browsing and navigating video content, as well as a gesture recognition interface for this browser.
A pattern locating tool, called SmARTTM (Smart Alignment and Registration Tool) Search, has been developed for the accurate and precise location of patterns despite normal process variations. SmART Search features Tra...
详细信息
A pattern locating tool, called SmARTTM (Smart Alignment and Registration Tool) Search, has been developed for the accurate and precise location of patterns despite normal process variations. SmART Search features Training WizardTM, a time-saving utility that takes the guesswork and uncertainty of the pattern training process. With state-of-the-art GeoSearch-assist, SmART Search enables manufacturers of vision-automated equipment to build more robust machines that will automatically adapt to changes in object appearance due to normal process variations.
the problem of matching sets of either points or segments is a well-studied problem with applications to image processing and computervision and also to areas such as bioinformatics and astronomy. We present an appro...
详细信息
the problem of matching sets of either points or segments is a well-studied problem with applications to image processing and computervision and also to areas such as bioinformatics and astronomy. We present an approximate solution to the segment matching problem in 3D that can be used to recognize planar-faced objects from range data. Our main contributions in the area of geometric matching are: a) A new definition of the Hausdorff distance between two sets of segments. this definition appears to be better suited to comparisons between sets of geometric entities. b) An efficient and practical strategy for approximate matching of sets of segments using this distance definition. Our solution extends results obtained for the simpler case of point-sets withthe same time efficiency within the same error bounds.
Recent developments in computervision and patternrecognition have enabled the development of sophisticated vision-based quality control systems for automatic inspection. We present a new geometrical analysis of rigi...
详细信息
Recent developments in computervision and patternrecognition have enabled the development of sophisticated vision-based quality control systems for automatic inspection. We present a new geometrical analysis of rigid body transformation parameters from properties of reflected correspondence vectors. Based on this analysis, we propose a novel algorithm to calibrate transformation parameters of 3D objects in a fast production line of filter components. the algorithm performs a rigidity analysis from range images acquired from two cameras, making full use of distance and angle information providing a closed form solution to all parameters of interest. the method is used in conjunction with a decision support system where components falling outside specified thresholds are to be rejected. For a comparative study of algorithm performance, we also implemented a well known procedure for rigidity analysis based on quaternions. Experimental results demonstrate that our novel algorithm has a number of advantages over the quaternion method and that its performance is superior or similar to the quaternion method.
the needs for accurate and efficient object localization prevail in many industrial applications, such as automated visual inspection and factory automation. Image reference approach is very popular in automatic visua...
详细信息
the needs for accurate and efficient object localization prevail in many industrial applications, such as automated visual inspection and factory automation. Image reference approach is very popular in automatic visual inspection due to its general applicability to a variety of inspection tasks. However, it requires very precise alignment of the inspection pattern in the image. To achieve very precise pattern alignment, traditional template matching is extremely time-consuming when the search space is large. In this paper, we present a new FLASH (Fast Localization with Advanced Search Hierarchy) algorithm for fast and accurate object localization in a large search space. this object localization algorithm is very useful for applications in automated visual inspection and pick-and-place systems for automatic factory assembly. It is based on the assumption that the surrounding regions of the pattern within the search range are always fixed, which is valid for most industrial inspection applications. the FLASH algorithm comprises a hierarchical nearest-neighbor search algorithm and an optical-flow based energy minimization algorithm. the hierarchical nearest-neighbor search algorithm produces a rough estimate of the transformation parameters for the initial guess of the iterative optical-flow based energy minimization algorithm, which provides very accurate estimation results and associated confidence measures. Experimental results demonstrate the accuracy and efficiency of the proposed FLASH algorithm.
the macro features, such as triangle, quadrilateral, polygon, play a very important role in many computervision applications such as matching, visual inspection, object tracking. However;effective ways to extract suc...
详细信息
ISBN:
(纸本)0818685123
the macro features, such as triangle, quadrilateral, polygon, play a very important role in many computervision applications such as matching, visual inspection, object tracking. However;effective ways to extract such macro features from images are still not available in the literature up to now. Worse, there are few reports on the matter. this paper proposes a randomized Hough technique to directly extract general triangles (i.e., with unknown size and orientation) from images. Extensive simulations as well as experiments with real images show that the results are satisfactory.
In this paper, a structural representation and fuzzy matching scheme is proposed for off-line multi-font chinese character recognition. Firstly, a chinese character is decomposed into eight stroke types. Secondly, a c...
详细信息
In this paper, a structural representation and fuzzy matching scheme is proposed for off-line multi-font chinese character recognition. Firstly, a chinese character is decomposed into eight stroke types. Secondly, a complete structural attribute feature codes among different type of strokes are defined and extracted. Lastly, a fuzzy matching scheme and dynamic programming algorithm is used for detailed match between an input character and candidate characters. Experiment on about 5140 daily used characters shows that our method can achieve 96.23% recognition accuracy.
the image sequence is often met in the researches of computervision and image processing. Exploring the information concealed in the sequence of images may make some algorithms, such as enhancement ones, more effecti...
详细信息
the image sequence is often met in the researches of computervision and image processing. Exploring the information concealed in the sequence of images may make some algorithms, such as enhancement ones, more effective and efficient. this paper derives a noval formula to enhance the image sequence. Having considered the heat diffusion equation of the deformable object, we derive a practical one for the enhancement of the sequence of images. Our method may converge faster while still keeps the locations of edges precise and sharp.
In this paper, we introduce a new pre-classification scheme for printed chinese characters. It first extracts secondary feature codes of four particular strokes for each chinese character: topmost and bottommost horiz...
详细信息
In this paper, we introduce a new pre-classification scheme for printed chinese characters. It first extracts secondary feature codes of four particular strokes for each chinese character: topmost and bottommost horizontal strokes, leftmost and rightmost vertical strokes. then four corresponding index key values are generated from the four feature codes respectively. An index search scheme is designed for the pre-classification of chinese characters. the index search has high pre-classification speed for normal characters, while its range search maintains accuracy for distorted characters. Experiment on more than 4175 daily used chinese character can achieve the average pre-classification rate of 99.3%.
the use of a genetic algorithm to select the optimal structuring element was discussed. An optimal structuring element extraction for mathematical morphology signature transform (MST) based shapes was determined. the ...
详细信息
the use of a genetic algorithm to select the optimal structuring element was discussed. An optimal structuring element extraction for mathematical morphology signature transform (MST) based shapes was determined. the new method was found to have the ability to find the global optimum, keeping the advantages of genetic algorithms, and tabu search. A brief description of the MST, its shape, and the application of the structuring element were also discussed.
暂无评论