Collaborative filtering techniques have been applied to personalised recommendation systems in recommender systems. However, with the gradual increase in the number of platform users and goods, the user rating data on...
详细信息
Accurate detail extraction from infrared images is fundamental for subsequent analysis tasks such as target detection, fault diagnosis, and thermal anomaly identification, as these details often contain critical infor...
详细信息
ISBN:
(纸本)9798400712647
Accurate detail extraction from infrared images is fundamental for subsequent analysis tasks such as target detection, fault diagnosis, and thermal anomaly identification, as these details often contain critical information about object boundaries, thermal variations, and potential defects. However, extracting fine details from such images remains challenging due to their inherent characteristics of low contrast, high noise, and complex parameter tuning requirements in traditional processing methods. This paper proposes a novel high dynamic range infrared image detail extraction method addressing the difficulties of traditional infrared imageprocessing in detail extraction and parameter adjustment. Based on grouped deformable convolution and depth-wise separable convolution, we propose a lightweight adaptive feature sampling module (LAFSM) that can effectively achieve precise detail and basic component separation of high-bit infrared images. Compared with traditional algorithms, our method requires no manual parameter adjustment and significantly reduces parameter count. Experimental results demonstrate that this method performs excellently in high-bit infrared image detail extraction, with significant advantages in edge preservation and noise suppression, providing a new technical approach for high dynamic range infrared image preprocessing.
With the increase of information, the traditional imageprocessing technology still has a big shortage in image quality and processing speed. Therefore, how to improve the image quality and processing speed is a resea...
详细信息
With the increase of information, the traditional imageprocessing technology still has a big shortage in image quality and processing speed. Therefore, how to improve the image quality and processing speed is a research hotspot. To solve the above problems, we adopt the method of combining the generative countermeasure network with the super-resolution reconstruction, and use the open data set to train the model. Through the reasonable selection and optimization of the imageprocessing algorithm, we can achieve the goal of improving the image super-resolution reconstruction effect. In order to improve the slow imageprocessing speed, we set up Spark cluster to integrate computing resources. The experimental results show that the method based on the generation of countermeasure network has a good effect for high resolution image reconstruction, and this method can expand the resolution of low resolution image by four times; The cluster based on the Spark platform shows better performance advantages in processing a large number of image data. When the hardware indicators of the cluster environment and the stand-alone environment are consistent, the cluster based on the Spark platform can increase the running speed of the program by about 10%. When the data volume reaches the tera-byte level or even the petabyte level, the operation speed of the system can still be improved by increasing the overall running memory of the cluster.
Recent accelerations in multi-modal applications have been made possible with the plethora of image and text data available online. However, the scarcity of analogous data in the medical field, specifically in histopa...
详细信息
Because gesture recognition is more and more useful in life, gesture recognition should not only consider the changes of gestures, but also be affected by the environment. Therefore, there are high requirements for th...
详细信息
Deep Learning and imageprocessing is a key concept in today's world of computational art, where artists employed AI algorithms to generate visuals. This paper explores AI-generated images, using Convolutional Neu...
详细信息
ISBN:
(纸本)9781450384209
Deep Learning and imageprocessing is a key concept in today's world of computational art, where artists employed AI algorithms to generate visuals. This paper explores AI-generated images, using Convolutional Neural Networks software as a paradigm of symbolic AI creative systems, and contextualizes the use of modern imageprocessing technologies to create visual artworks. It discusses the methodologies and strategies used to make art using AI algorithms, manipulating them with processing software tool. The discussion focuses on CNN (Convolutional Neural Network) and processing software (Java) as the main technologies used in distinct fields to generate images. My conception of technical images provides a conceptual framework for examining the qualities and attributes of AI-generated images.
Polynomial Networks (PNs) have demonstrated promising performance on face and image recognition recently. However, robustness of PNs is unclear and thus obtaining certificates becomes imperative for enabling their ado...
详细信息
ISBN:
(纸本)9781713871088
Polynomial Networks (PNs) have demonstrated promising performance on face and image recognition recently. However, robustness of PNs is unclear and thus obtaining certificates becomes imperative for enabling their adoption in real-world applications. Existing verification algorithms on ReLU neural networks (NNs) based on classical branch and bound (BaB) techniques cannot be trivially applied to PN verification. In this work, we devise a new bounding method, equipped with BaB for global convergence guarantees, called Verification of Polynomial Networks or VPN for short. One key insight is that we obtain much tighter bounds than the interval bound propagation (IBP) and DeepT-Fast [Bonaert et al., 2021] baselines. This enables sound and complete PN verification with empirical validation on MNIST, CIFAR10 and STL10 datasets. We believe our method has its own interest to NN verification. The source code is publicly available at https://***/megaelius/PNVerification.
This project aims to develop a solar powered auto irrigation system that incorporates imageprocessing techniques to monitor and maintain the health of the plants. The system utilizes solar energy to power the irrigat...
详细信息
ISBN:
(数字)9798331508692
ISBN:
(纸本)9798331508708
This project aims to develop a solar powered auto irrigation system that incorporates imageprocessing techniques to monitor and maintain the health of the plants. The system utilizes solar energy to power the irrigation process, making it environmentally friendly and cost-effective. imageprocessingalgorithms are employed to analyze the visual characteristics of the plants and detect any signs of distress or disease. This information is then used to automatically adjust the irrigation schedule and provide necessary nutrients to ensure optimal plant health. Furthermore, the system integrates imageprocessing techniques to analyze the health of the plants. images of the plants are captured using a camera module and processed to detect any signs of diseases or abnormalities. The system then provides real-time feedback on the plant health status. Through the implementation of this project, the efficiency and effectiveness of irrigation systems can be improved, leading to better crop yields and reduced water usage.
Deep unfolding compressive sensing (CS) has experienced remarkable advancements. However, there still exist two challenges: (1) Many algorithms either use uniform block-based sampling, which ignore the fact that the c...
详细信息
ISBN:
(数字)9798350368741
ISBN:
(纸本)9798350368758
Deep unfolding compressive sensing (CS) has experienced remarkable advancements. However, there still exist two challenges: (1) Many algorithms either use uniform block-based sampling, which ignore the fact that the content of different blocks is different, or allocate the sampling rate referring to complete signal before CS sampling, which is not always feasible in real-world scenarios. (2) Traditional CNN is difficult to capture broader contextual priors during iterative recovery. In this paper, we propose a novel network ASMFNet to solve the above two issues. Specifically, to address the first issue, we introduce a dual-branch network featuring a basic sampling branch to acquire reference image and an adaptive sampling branch by median filtering for allocating remaining sampling rate adaptively. For the second problem, we use Swin Transformer and feature fusion block to increase the feature interactions. Experimental results demonstrate that our proposed method outperforms existing methods.
Frozen pretrained models have become a viable alternative to the pretraining-then-finetuning paradigm for transfer learning. However, with frozen models there are relatively few parameters available for adapting to do...
详细信息
ISBN:
(纸本)9781713871088
Frozen pretrained models have become a viable alternative to the pretraining-then-finetuning paradigm for transfer learning. However, with frozen models there are relatively few parameters available for adapting to downstream tasks, which is problematic in computer vision where tasks vary significantly in input/output format and the type of information that is of value. In this paper, we present a study of frozen pretrained models when applied to diverse and representative computer vision tasks, including object detection, semantic segmentation and video action recognition. From this empirical analysis, our work answers the questions of what pretraining task fits best with this frozen setting, how to make the frozen setting more flexible to various downstream tasks, and the effect of larger model sizes. We additionally examine the upper bound of performance using a giant frozen pretrained model with 3 billion parameters (SwinV2-G) and find that it reaches competitive performance on a varied set of major benchmarks with only one shared frozen base network: 60.0 box mAP and 52.2 mask mAP on COCO object detection test-dev, 57.6 val mIoU on ADE20K semantic segmentation, and 81.7 top-1 accuracy on Kinetics-400 action recognition. With this work, we hope to bring greater attention to this promising path of freezing pretrained image models.
暂无评论