This paper describes a new application of the technique known as Gradient Pattern Analysis (GPA), focused here on computer vision. In the GPA domain, the image is translated into a tessellation triangulation field bas...
详细信息
The plant diseases have a direct impact on the quality and quantity of the crop, and by diagnosing them, the market value of agricultural products increases. This exemplifies the significance of healthy plants as well...
详细信息
This paper constructs an energy model based on local features used in stereo matching. The local features include the similarity between different image areas, the matching cost function pattern, the connection betwee...
详细信息
This paper constructs an energy model based on local features used in stereo matching. The local features include the similarity between different image areas, the matching cost function pattern, the connection between neighbor pixels, and the occlusion geometric relationship. Based on these features, we define the weight of each data term and smoothing term in the energy function and then design an algorithm to solve the energy model and get disparity results. The significant improvements of this paper include as following. 1) We modify the structure of the energy function. First, we define the weight of the data term based on the reliability of its corresponding disparity result, which is obtained by cost function features and the occlusion geometric relationship. Then we define the weight of the smoothing term by analyzing the characteristic relation between neighbor super-pixels. We can also reduce the computational complexity by detecting and reducing some low-strength connections. 2) We proposed an algorithm based on pairwise Markov random field (MRF) (Taniai et al., IEEE Trans Pattern Anal machine Intell 40(11): 2725-2739, 2017) and local greedy iteratively, which can be used to solve the energy model. 3) In post-optimation, we select some areas with severe occlusion and fewer matching clues for post-interpolation fitting to optimize the results. The experiment shows that the proposed method reduced the average percentage of bad pixels (in bad 3) to 6.06 on the Middlebury dataset and 1.42 on the KITTI dataset. Finally, we compare our results with those of MC-Cnn (Zbontar and LeCun 2015), CF-Net (Shen et al., 2021), Guided-Stereo (Poggi et al., 2019), Gwc-Net (Guo et al., 2019) and Patchmatch-Net(PM-Net) (Wang et al., 2021) to verify the improved speed and accuracy of our algorithm, especially at recognizing the depth of changing edges and small objects. This paper's relevant research can contribute to practical engineering practices such as assisted vision, i
The explosive growth of image data facilitates the fast development of imageprocessing and computer vision methods for emerging visual applications, meanwhile introducing novel distortions to processed images. This p...
详细信息
The explosive growth of image data facilitates the fast development of imageprocessing and computer vision methods for emerging visual applications, meanwhile introducing novel distortions to processed images. This poses a grand challenge to existing blind image quality assessment (BIQA) models, which are weak at adapting to subpopulation shift. Recent work suggests training BIQA methods on the combination of all available human-rated IQA datasets. However, this type of approach is not scalable to a large number of datasets and is cumbersome to incorporate a newly created dataset as well. In this paper, we formulate continual learning for BIQA, where a model learns continually from a stream of IQA datasets, building on what was learned from previously seen data. We first identify five desiderata in the continual setting with three criteria to quantify the prediction accuracy, plasticity, and stability, respectively. We then propose a simple yet effective continual learning method for BIQA. Specifically, based on a shared backbone network, we add a prediction head for a new dataset and enforce a regularizer to allow all prediction heads to evolve with new data while being resistant to catastrophic forgetting of old data. We compute the overall quality score by a weighted summation of predictions from all heads. Extensive experiments demonstrate the promise of the proposed continual learning method in comparison to standard training techniques for BIQA, with and without experience replay. We made the code publicly available at https://***/zwx8981/BIQA_CL.
Writing in air has become a significant research area in imageprocessing and pattern recognition, contributing to automation and improving human-machine interfaces in various applications. Object tracking, a crucial ...
详细信息
In recent 10 years, deep learning has successfully shown its effectiveness in various computer vision fields such as autonomous vehicles, robotics, and AI surveillance. Numerous machinevision AI systems have been acc...
详细信息
Medicinal plants have long been the foundation of the medical system and a source of health and healing, but many people nowadays are unaware of these priceless natural resources or the range of possible applications ...
详细信息
This paper investigates the potential of advanced object detection technologies to automate and enhance the accuracy and efficiency of the vote counting process in democratic elections that utilize paper-based ballots...
详细信息
Automated segmentation of medical imagevolumes promises to reduce costly medical experts' time for annotation. However, using machine learning for the task is challenging due to variations in imaging modalities a...
详细信息
ISBN:
(纸本)9798400710759
Automated segmentation of medical imagevolumes promises to reduce costly medical experts' time for annotation. However, using machine learning for the task is challenging due to variations in imaging modalities and scarcity of patient data. While interactive image segmentation methods and foundational models incorporating user-provided prompts to refine segmentation masks have shown promise, they overlook crucial sequential information between the slices in 3D medical imagevolumes and videos, resulting in discontinuities in the segmentation results. This paper proposes a new framework that dynamically updates model parameters during inference in a test time training framework using user-provided scribbles. Our framework preserves acquired knowledge from the previous slices of the current medical volume and the training dataset via student-teacher learning. We evaluate our method on diverse CT, MRI, and microscopic cell datasets. Our framework significantly reduces user annotation time by a factor of 6.72x. Compared to other interactive segmentation methods, we reduce the time by a factor of 2.64x. Our method also outperforms prompting foundation models for segmentation by achieving a dice score of 0.9 in 3-4 interactions compared to 5-8 user interactions for the foundation model, significantly reducing annotation time for the CT and MRI volumes.
Municipal solid waste (MSW) management currently requires critical attention in ensuring the best principles of socio-economic attributes such as environmental protection, economic sustainability, and mitigation of hu...
详细信息
Municipal solid waste (MSW) management currently requires critical attention in ensuring the best principles of socio-economic attributes such as environmental protection, economic sustainability, and mitigation of human health problems. Numerous surveys on the waste management system reveal that approximately 90% of the MSW systems are improperly disposing the wastages in open dumps and landfills. Classifying the wastages into biodegradable and non-biodegradable helps converting them into usable energy and disposing properly. The advancements of effective computational approaches like artificial intelligence and imageprocessing provide wide range of solutions for the present problem identified in MSW management. The computational approaches can be programmed to classify wastes that help to convert them into usable energy. Existing methods of waste classification in MSW remain unresolved due to poor accuracy and higher error rate. This paper presents an experimented effective computer vision-based MSW management solution with the help of the Internet of Things (IoT), and machine learning (ML) techniques namely regression, classification, clustering, and correlation rules for the perception of solid waste images. A ground-up built convolutional neural network (CNN) and CNN by the inception of ResNet v2 models trained through transfer learning for image classification. ResNet v2 supports training large datasets in deep neural networks to achieve improved accuracy and reduced error rate in identity mapping. In addition, batch normalization and mixed hybrid pooling techniques are incorporated in CNN to improve stability and yield state of art performance. The proposed model identifies the type of waste and classifies them as biodegradable or non-biodegradable to collect in respective waste bins precisely. Furthermore, observation of performance metrics, accuracy, and loss ensures the effective functions of the proposed model compared to other existing models. The propo
暂无评论