The segmentation task refers to the preliminary stage of image preprocessing. Further object detection, feature recognition, scene analysis and prediction of the situations depends on its results. Modern segmentation ...
详细信息
作者:
Potter, Jerry L.Kent State Univ
Mathematical Science Dep Kent OH USA Kent State Univ Mathematical Science Dep Kent OH USA
Scene analysis includes both the relatively simple but very repetitive pixel processingalgorithms of imageprocessing and the search-intensive task of rule-based recognition and analysis. A number of special-purpose ...
详细信息
ISBN:
(纸本)0818605638
Scene analysis includes both the relatively simple but very repetitive pixel processingalgorithms of imageprocessing and the search-intensive task of rule-based recognition and analysis. A number of special-purpose architectures have been built and studied for the pixel processing task, but the search aspects have been largely ignored. The author proposes an architecture that is well suited for both pixel processing and rule-based analysis. This architecture is based on an analysis of the nature of pixel data and rule-based data. The architecture emphasizes content addressable techniques for rule-based data storage and manipulation.
This paper first analysed the state-of-the-art corner detection algorithms and then proposed a novel corner detection approach based on a maximum point-to-chord distance. The proposed corner detector consists of three...
详细信息
This paper describes experimental and theoretical investigations concerning the transport behaviour of insulation material in multidimensional aqueous flow. Modern technologies of digital imageprocessing are presente...
详细信息
ISBN:
(纸本)8476538723
This paper describes experimental and theoretical investigations concerning the transport behaviour of insulation material in multidimensional aqueous flow. Modern technologies of digital imageprocessing are presented. Experimental results were used for the construction of several Takagi-Sugeno fuzzy models which allow the calculation of sink rates for different particle classes. The process of modelling comes along with a nested usage of clustering algorithms.
The ability to capture good quality images in the dark and near-zero lux conditions has been a long-standing pursuit of the computer vision community. The seminal work by Chen et al. [5] has especially caused renewed ...
详细信息
An improved method for visibility enhancement of foggy based degraded images is presented. Proposed technique consists of two phases: firstly applied the visibility enhancement algorithm and then automatic color enhan...
详细信息
ISBN:
(纸本)9781509021185
An improved method for visibility enhancement of foggy based degraded images is presented. Proposed technique consists of two phases: firstly applied the visibility enhancement algorithm and then automatic color enhancement algorithm. Quantitative metric and qualitative result of proposed technique is evaluated and compared with other existing visibility restoration algorithms. In this paper quantitative results are presented in terms of measure of enhancement and measure of enhancement factor. Simulation results on foggy images from database demonstrates that proposed technique provides better visibility enhancement results as compared to the others existing visibility enhancement algorithms. A result reveals that proposed technique is an efficient method for visibility enhancement of foggy based degraded images.
To address the problems of poor adaptability, high computational complexity and low operational efficiency of Massive MIMO detection algorithm on reconfigurable array structure in Massive MIMO system, a parallelizatio...
详细信息
The article presents the idea of a distributed system for industrial and medical tomography. The paper shows examples of reconstruction of images made by the author using various tomographic techniques and reconstruct...
详细信息
Zhang-Suen parallel thinning algorithm with the feature of rapidity and practicality ensures the connectivity of the refined curve. However, the refined skeleton cannot be guaranteed in a single pixel wide, and redund...
详细信息
image Caption Generation (ICG), situated at the confluence of computer vision and natural language processing, empowers machines to comprehend visual content and express it in human-like language. This research offers...
详细信息
ISBN:
(数字)9798350372748
ISBN:
(纸本)9798350372748
image Caption Generation (ICG), situated at the confluence of computer vision and natural language processing, empowers machines to comprehend visual content and express it in human-like language. This research offers a comprehensive overview of key concepts, methodologies, and challenges in ICG. The process involves developing algorithms for the automatic generation of contextually relevant captions, utilizing deep neural networks for feature extraction, and employing natural language processing techniques for coherent composition. Recent advancements, particularly in convolutional neural networks for imageprocessing and recurrent neural networks for language modelling, have significantly elevated the performance of image captioning systems. The study delves into the core components of an ICG system, including pre-processing techniques for image data, feature extraction mechanisms, and the integration of language models. Attention mechanisms, a key innovation in this field, enable the model to focus on relevant image regions while generating captions, closely mirroring human attention patterns. Despite notable progress, ICG faces several challenges, such as handling diverse and complex visual scenes, ensuring cross-modal coherence between images and captions, and addressing biases present in training data. Ethical considerations, particularly in applications like automated content generation, are also discussed. The study concludes by highlighting potential future directions in ICG research, including the incorporation of multimodal learning approaches, enhancing the interpretability of generated captions, and addressing societal concerns related to bias and fairness. As ICG continues to evolve, it holds promise for various applications, ranging from accessibility for the visually impaired to improving content indexing and retrieval in multimedia databases. The research also underscores the significance of the accuracy attainments, showcasing the success of the pr
暂无评论