Business analysis plays an essential role in numerous companies and industries, which involves data collection, data analysis, and the construction of intelligent systems to make critical business decisions. By using ...
详细信息
image segmentation is of great importance in many areas including artificial intelligence, imageprocessing, computer vision, etc. The optimization of image segmentation can be converted to seek the best threshold val...
详细信息
As a method of imageprocessing, edge detection is widely used in object detection, image recognition, machine vision and other fields. At present, the research of edge detection algorithm is still a hot problem in th...
As a method of imageprocessing, edge detection is widely used in object detection, image recognition, machine vision and other fields. At present, the research of edge detection algorithm is still a hot problem in the field of imageprocessing. At the same time, in order to solve the problem that the edge detection operator is difficult to identify due to noise interference and low edge contrast in the image, in this paper, a contrast enhancement processing algorithm, based on Gaussian filter, is used to preprocess the image. In this paper, an FPGA-based grayscale conversion algorithm is used to convert the original image into a grayscale image, and then an improved algorithm based on Gaussian filtering is used to successfully enhance the slope of the edge in the original image. In order to process the edge detection of the image faster, use FPGA to realize the algorithm of edge detection of the image, and the result is output and displayed through VGA display. Finally, by comparing the experimental results of Roberts, Prewitt and Sobel operators, the advantages and disadvantages of several different algorithms are obtained. After enhancing the slope, using the edge detection algorithm, it is found that the three edge detection operators after enhancing have better detection results without wasting extra time.
Communicating online without fearing third-party interventions is becoming a challenge in the modern world. Especially the sectors like the military, and government organizations or private companies sharing sensitive...
详细信息
Surface water resource identification is one of the main techniques used in remote sensing image analysis. This is necessary to stop calamities like floods and droughts. Feature selection based on prior information an...
详细信息
Aiming at the problems of slow detection speed and low detection accuracy in existing fatigue driving detection algorithms, a fatigue driving detection algorithm based on YOLOv5 is proposed. In order to improve the fe...
详细信息
In recent years, although the task of fine-grained image classification has achieved remarkable results, these algorithms need to be trained on large datasets in order to obtain good results, otherwise it is easy to c...
详细信息
Total p-norm Variation (TpV) is a well-established technique in imageprocessing, used to denoise and preserve edges. However, the related non-convex minimization is still a challenging task in optimization, both for ...
详细信息
To article proposes an approach to improve the quality of data used in various processes based on machine vision systems. The paper proposes a combined approach to applying the method of multi-criteria processing base...
详细信息
ISBN:
(数字)9781510662117
ISBN:
(纸本)9781510662100;9781510662117
To article proposes an approach to improve the quality of data used in various processes based on machine vision systems. The paper proposes a combined approach to applying the method of multi-criteria processing based on the use of a combined criterion in order to implement an edge detector, smoothing and separation areas of the background / object in the image. The application of the method allows eliminating the noise caused by external factors (such as dust and water suspension on the lens or space). The generated data make it possible to form an adaptive criterion for changing the correction parameters for a non-linear change in color balance in areas of increased detail or selected masks of changes blocks. The proposed algorithms make it possible to increase the visibility of small elements, reduce the noise component, while maintaining the boundaries of objects, increase the accuracy of selecting the boundaries of objects and the visual quality of data. As test data used to evaluate the effectiveness, nature data and expert evaluation results for test images obtained by a machine vision system with a sensor with a resolution of 1024x768 (8-bit, color image, visible range) are used. images of simple shapes are used as analyzed objects.
A vital and rapidly growing application, remote sensing offers vast yet sparsely labeled, spatially aligned multimodal data;this makes self-supervised learning algorithms invaluable. We present CROMA: a framework that...
详细信息
ISBN:
(纸本)9781713899921
A vital and rapidly growing application, remote sensing offers vast yet sparsely labeled, spatially aligned multimodal data;this makes self-supervised learning algorithms invaluable. We present CROMA: a framework that combines contrastive and reconstruction self-supervised objectives to learn rich unimodal and multimodal representations. Our method separately encodes masked-out multispectral optical and synthetic aperture radar samples-aligned in space and time-and performs cross-modal contrastive learning. Another encoder fuses these sensors, producing joint multimodal encodings that are used to predict the masked patches via a lightweight decoder. We show that these objectives are complementary when leveraged on spatially aligned multimodal data. We also introduce X- and 2D-ALiBi, which spatially biases our cross- and self-attention matrices. These strategies improve representations and allow our models to effectively extrapolate to images up to 17.6x larger at test-time. CROMA outperforms the current SoTA multispectral model, evaluated on: four classification benchmarks-finetuning (*** arrow 1.8%), linear (*** arrow 2.4%) and nonlinear (*** arrow 1.4%) probing, kNN classification (*** arrow 3.5%), and K-means clustering (*** arrow 8.4%);and three segmentation benchmarks (*** arrow 6.4%). CROMA's rich, optionally multimodal representations can be widely leveraged across remote sensing applications.
暂无评论