image compression and denoising are two fundamental problems in imageprocessing and have many real-world applications. In recent years, learning-based compression methods have achieved promising rate-distortion perfo...
详细信息
The paper explores what is 2 Dimensional Fourier Transform and how it can be used for imageprocessing, specifically denoising and edge enhancing. Starting from the Fourier theorem and Fourier series, the paper traver...
详细信息
With the advent of the digital age, the fields of computer vision and digital imageprocessing have developed rapidly. Process image plays an important role in the automatic production process. However, due to the com...
详细信息
Unlike ordinary omnidirectional imaging, which produces underexposure or overexposure of some areas under wide range of lighting conditions, tone mapping high dynamic range omnidirectional image (denoted as TM-HOI) pr...
详细信息
ISBN:
(纸本)9798350319439
Unlike ordinary omnidirectional imaging, which produces underexposure or overexposure of some areas under wide range of lighting conditions, tone mapping high dynamic range omnidirectional image (denoted as TM-HOI) presents more detailed information with high dynamic range (HDR) techniques. Aiming at the multiple distortions in TM-HOI processing, this paper proposes a blind TM-HOI quality metric with multi-regions and multi-levels analysis. Considering the conversion of image projection formats and the unique distortion caused by user behavior in immersive environments, feature extraction in the proposed metric is divided into the viewport based module and the Crasher parabolic projection based module. At the same time, considering the coding distortion, tone mapping (TM) distortion and mixed distortion in the TM-HOI and the different manifestations of this mixed distortion in different regions, this paper further extracts perceptual features to characterize image distortion by performing bit plane layer decomposition and detail/basic layer decomposition on different regions of TM-HOI. Finally, the extracted perceptual features are used as the input of Random forest to construct the nonlinear relationship between the feature space and subjective opinion scores. The experimental results indicate that the proposed metric has better consistency with the human visual system compared to representative blind quality metrics.
Deep learning techniques have demonstrated their ability to facilitate medical image diagnostics by offering more precise and accurate predictions. Convolutional neural network (CNN) architectures have been employed f...
详细信息
ISBN:
(纸本)9798350343557
Deep learning techniques have demonstrated their ability to facilitate medical image diagnostics by offering more precise and accurate predictions. Convolutional neural network (CNN) architectures have been employed for a decade as the primary approach to enable automated diagnosis. On the other hand, recently proposed vision transformers (ViTs) based architectures have shown success in various computer vision tasks. However, their efficacy in medical image classification tasks remains largely unexplored due to their requirement for large datasets. Nevertheless, significant performance gains can be achieved by leveraging self-supervised learning techniques through pretraining. This paper analyzes performance of self-supervised pretrained networks in medical image classification tasks. Results on colon histopathology images revealed that CNN based architectures are more effective when trained from scratch, while pretrained models could achieve similar levels of performance with limited data.
As an important part of the mobile robot platform to perceive the external environment, the computer transmits the collected real-time images to the processing unit. After the image information is analyzed and process...
As an important part of the mobile robot platform to perceive the external environment, the computer transmits the collected real-time images to the processing unit. After the image information is analyzed and processed, different functions are realized according to the actual needs. imageprocessing technology is a widely used technology type derived from the continuous development of computer technology. Using it in fruit detection and grading can improve the efficiency of fruit detection and grading, optimize the standard of fruit detection and grading, achieve the purpose of controlling the cost of manual detection and grading, and ensure the profit of fruit industry. We know that the human brain is the most effective biological intelligence system known, and the ability of human visual system to recognize, process and process images far exceeds any existing computer and information processing system. This paper implements imageprocessing technology based on computer vision algorithm, so as to express the actual coordinates of objects in 3D space through 3D voxels, and can also correct the distorted images caused by projection. Compared with traditional BP neural network, imageprocessing technology based on computer vision algorithm has more advantages and higher accuracy.
Multi-institutional collaboration is an emerging deployment of medical imaging processing with the goal to address the scarce annotation problem. While most of the efforts in this domain focus on the supervised machin...
详细信息
ISBN:
(纸本)9798400701023
Multi-institutional collaboration is an emerging deployment of medical imaging processing with the goal to address the scarce annotation problem. While most of the efforts in this domain focus on the supervised machine learning models and the model performance improvement, there lack the discussion about the distributed system performance, such as the trade-off between collaboration and efficiency, i.e., communication cost and processing time. In this work, we propose a distributed system based on deep reinforcement learning for medical image segmentation. Preliminary experiments are conducted on single and multiple CPU and GPU environments to demonstrate the system performance and the trade-off. We highlight some insights for better designs of multi-institutional collaboration in the future.
This paper designs a digital signal processor based on computer virtual reality technology. The system uses multi-core cooperative work to complete data processing and storage. The system consists of four layers: hard...
This paper designs a digital signal processor based on computer virtual reality technology. The system uses multi-core cooperative work to complete data processing and storage. The system consists of four layers: hardware abstraction layer, system core layer, distributed message communication mechanism and system service layer. Its multi-task kernel implements preemptive scheduling based on priority, synchronization and communication primitives between tasks, real-time interrupt processing and application-oriented cache management mechanism. The system also designed LCD module and keyboard module as man-machine interaction interface. The hardware interface diagram is given. The realization of software programming is also discussed. The experimental results show that multi-core collaborative processing is used to complete data processing and storage functions. The single-core resource utilization rate is effectively reduced by 26%.
Sonar image segmentation technique is crucial for underwater target tracking, among other things. Due to the undersea environment's influence, noise is easily absorbed, which leads to a poor tracking performance. ...
详细信息
Accurate assessment of medical image quality is of paramount importance as it forms the cornerstone of diagnoses, where even minor discrepancies can lead to misdiagnoses. To achieve this, two key aspects, realism, and...
Accurate assessment of medical image quality is of paramount importance as it forms the cornerstone of diagnoses, where even minor discrepancies can lead to misdiagnoses. To achieve this, two key aspects, realism, and comprehensibility, are employed to evaluate the quality of medical images. Factors such as image richness, clarity, and saliency are analyzed to precisely reflect the image's reliability. However, due to variations in data originating from different hospitals, medical professionals, and equipment, internal structures may exhibit significant differences. To address this issue, this study proposes a multimodal data-based medical image fusion model that captures the correlation and complementary information between heterogeneous data. By combining the regional Laplacian pyramid (LLP) with convolutional neural networks, the convolutional network is enhanced. The source images are decomposed using LLP, and an improved deep convolutional neural network generates an optimal weight map to guide the fusion process, resulting in a fused image. Experimental results demonstrate that the proposed method achieves favorable fusion effects in both subjective visual assessments and objective evaluation metrics.
暂无评论