The human skin is a remarkable structure, susceptible to numerous known and unknown diseases. Thus, diagnosing skin conditions is one of the most uncertain and complex areas in medical science, making clinical image a...
详细信息
This work is devoted to the development of a novel deep learning encoder-decoder algorithm for real-time noise and blur elimination in video frames, received from UAV. This work improves on existing algorithms by prov...
详细信息
ISBN:
(纸本)9798350372557
This work is devoted to the development of a novel deep learning encoder-decoder algorithm for real-time noise and blur elimination in video frames, received from UAV. This work improves on existing algorithms by providing a more flexible blind deblurring solution than existing kernel-based methods. The proposed method can be applied to both improve the drone operator's capabilities and to improve the performance of autonomous imageprocessing tasks, such as object identification and visual navigation systems. Different types of blur as well as possible types of noise are presented. A brief overview of existing methods is provided. The problem of frame alignment due to the object's movement and associated noise is considered. Existing deblurring and image restoration methods are reviewed, including state-of-the-art. Their limitations are highlighted. To solve the limitations a method based on a fully convolutional encoder-decoder network with residual connections is presented. Dataset generation and training procedures are discussed. The approach is then compared to existing state-of-the-art deep learning methods. The proposed method enables up to 9 times faster blind image restoration with comparable quality in comparison to existing state-of-the-art image restoration methods.
This article describes the creation and development of a Web Framework for the processing of images and Patterns based on the power of the HTML5 language and the object-oriented capabilities of the Javascript language...
详细信息
ISBN:
(纸本)9783031298592;9783031298608
This article describes the creation and development of a Web Framework for the processing of images and Patterns based on the power of the HTML5 language and the object-oriented capabilities of the Javascript language, as well as the importance of its functionality on any operating system, thereby improving and simplifying its use. The object-oriented implementation will make it easier to access image variables and characteristics (width, height, color depth and bitmap), the calculation algorithms and matrix processing have been modified to perform convolution and color discretization processes, as well as the use of Gaussian filters and the Sobel, Cannis, Roberts Operator. Leaving the data ready to search for patterns and generate color histograms. As a result, a clear, compact implementation was obtained. to be able to derive and run on any browser that supports HTML5 technology and, more importantly, on any operating system. We provide greater flexibility and portability than other CGI frameworks based on C++ and Python.
Successful Artificial Intelligence systems often require numerous labeled data to extract information from document images. In this paper, we investigate the problem of improving the performance of Artificial Intellig...
详细信息
ISBN:
(纸本)9789819916474;9789819916481
Successful Artificial Intelligence systems often require numerous labeled data to extract information from document images. In this paper, we investigate the problem of improving the performance of Artificial Intelligence systems in understanding document images, especially in cases where training data is limited. We address the problem by proposing a novel finetuning method using reinforcement learning. Our approach treats the Information Extraction model as a policy network and uses policy gradient training to update the model to maximize combined reward functions that complement the traditional cross-entropy losses. Our experiments on four datasets using labels and expert feedback demonstrate that our finetuning mechanism consistently improves the performance of a state-of-the-art information extractor, especially in the small training data regime.
Maintaining road infrastructure is essential to effective transportation systems and public safety. This research provides a new method for pothole depth estimation and automatic road crack detection using computer vi...
详细信息
Polycystic Ovary Syndrome (PCOS) is a common hormonal disorder among women of reproductive age and it can lead to infertility, metabolic disorders and other health problems. Ultrasound is an important tool for the dia...
详细信息
Machine vision technology has shown great potential for development and application in the coal heat utilization and coal chemical production process It is important to carry out image analysis-based stability adaptat...
详细信息
The real-time obstacle detection and path adjustment system for autonomous robots presented in this paper was created using OpenCV. The combination of imageprocessing techniques enables the robot to identify and navi...
详细信息
This paper introduces a novel method for RGB-Guided Resolution Enhancement of infrared (IR) images called Guided IR Resolution Enhancement (GIRRE). In the area of single image super resolution (SISR) there exists a wi...
详细信息
Adverse weather conditions such as haze and fog significantly impact the visibility and quality of outdoor images, leading to errors in various computer vision systems. Traditional defogging methods, often fall short ...
详细信息
暂无评论