Ocean exploration is a major challenge that we are facing today. With advancements to fields of marine engineering and aquatic robotics, we are capable of performing autonomous and complex decision making deep underwa...
详细信息
ISBN:
(纸本)9781728188768
Ocean exploration is a major challenge that we are facing today. With advancements to fields of marine engineering and aquatic robotics, we are capable of performing autonomous and complex decision making deep underwater. Significance of online Underwater computervision Algorithms is ever increasing. Underwater images, however, suffer from inaccurate colors, hazing, colour cast and degradations because of unequal absorption of light by water. Algorithms designed for detection/enhancement in the air are of no use underwater. Although a lot of underwater image enhancement algorithms have come up in recent times, most of them are not suitable for real-time applications like AUV, due to their high computational times. these algorithms are more suitable for offline analysis. In this paper, we propose an algorithm which is fast enough for real-time systems such as AUVs/ROVs and is comparable to the offline state of the art image enhancement algorithms. We will be exploring histogram equalization techniques for dehazing and automatic white balancing algorithms for color correction. UIEB (Underwater image Enhancement Benchmark) is used for evaluation of our algorithm. the codes and results are available at https://***/opgp/underwater-image-processing.
Fine-grained visual classification (FGVC) is a challenging task in image classification due to the small differences between classes and the large differences within subclasses. In the early works, some methods mainly...
详细信息
the fast compressed sensing (CS) reconstruction for natural imageprocessing aims to infer the original pixel vector from randomized measurements as correctly as possible. Current first-order proximal mainstream schem...
详细信息
indian Sign Language (ISL) is a form of communication used in India by the speech and hearing impaired community. It conveys linguistic information through gestures of the hands, arms, face, and head. However, the ges...
详细信息
Document digitization is an active area of research especially involving handwritten manuscripts. While the most common use cases involve digital libraries, there are other important applications in the area of electr...
详细信息
ISBN:
(数字)9783031113468
ISBN:
(纸本)9783031113451
Document digitization is an active area of research especially involving handwritten manuscripts. While the most common use cases involve digital libraries, there are other important applications in the area of electronic health records where handwritten text is predominant in developing worlds. the state-of-the-art approaches are domain-specific, and scaling across domains is still an open research problem. We report here a platform for real-time annotation and training of sub-region models in scanned documents using model pools and plug-n-play of annotation services. Given a document, sub-regions are annotated with textual labels. the textual regions themselves may correspond to characters or words or any other pattern of interest. For a given sub-region category, several sub-regions may be present in a given page or across pages. In the proposed system, a user needs to annotate only some of the sub-regions. A convolutional neural network (CNN) model is built for each of the sub-region categories, and named sets or pools of such models are prepared for application on any new document. We observe that a sub-region label may be provided by an existing optical character recognition system instead of a human annotator. In this regard, we have provisioned annotation as a service where any third-party system can be integrated into a plug-n-play mechanism. the state-of-the-art systems focused on having a pre-trained monolithic model which suffers from the problem of catastrophic forgetting when new sub-region classes are added over time. In our approach, due to sub-region specific models, the previous data models are not touched and hence providing a truly incremental learning solution. We have carried out the validation by choosing handwritten data sets belonging to different languages such as Devanagari, Kannada, Telugu, English that span diverse text patterns and the models produced by our sub-region detection algorithm were evaluated on documents containing hundreds
computervision algorithms are used in applications which require the given system to process and display images or video inputs. However, this will be computationally intensive on the machine that is performing the a...
详细信息
In the recent era, image tampering has become one of the threatening security problems in digital platforms. there are many software's available for tampering with an imagethat depicts as an original image. Diffe...
详细信息
the optimal technique for object detection directly in the compressed domain of a video sequence or an image is presented in this study, we delve into the application of Convolutional Neural Networks (CNNs) for object...
the optimal technique for object detection directly in the compressed domain of a video sequence or an image is presented in this study, we delve into the application of Convolutional Neural Networks (CNNs) for object recognition, employing image filtering and edge detection techniques within the framework of deep learning. the paper leverages the power of CNNs to automatically learn and extract essential features from images, enabling effective object recognition. We explore the significance of image filtering and edge detection as preprocessing steps to enhance feature extraction, leading to improved classification accuracy. through comprehensive experiments and analyses, we demonstrate the efficacy of our approach in various real-world scenarios, highlighting the potential for CNNs to play a pivotal role in advancing object recognition technology. this paper offers valuable insights into the synergy between CNNs and imageprocessing techniques, with promising implications for diverse applications, including computervision, robotics, and autonomous systems.
3D multi-object detection and tracking is an essential constituent for many applications in today's world. Object detection is a technology related to computervision and imageprocessingthat allows us to detect ...
详细信息
Optical Coherence Tomography (OCT) is a non-invasive imaging technology for diagnosing various macular pathologies. It assists ophthalmologists to detect abnormalities in the retina and thereby avoid many sight-threat...
详细信息
暂无评论