Generative models are widely utilized to model the distribution of fused images in the field of infrared and visible image fusion. However, current generative models based fusion methods often suffer from unstable tra...
详细信息
ISBN:
(纸本)9789819786848;9789819786855
Generative models are widely utilized to model the distribution of fused images in the field of infrared and visible image fusion. However, current generative models based fusion methods often suffer from unstable training and slow inference speed. To tackle this problem, a novel fusion method based on consistency model is proposed, termed as CoMoFusion, which can generate high-quality images and achieve fast image inference speed. In specific, consistency model is used to construct multi-modal joint features in the latent space with the forward and reverse process. Then, the infrared and visible features extracted by the trained consistency model are fed into fusion module to generate the final fused image. In order to enhance the texture and salient information of fused images, a novel loss based on pixel value selection is also designed. Extensive experiments on public datasets illustrate that our method obtains the SOTA fusion performance compared with the existing fusion methods. The code is available at https://***/ZhimingMeng/CoMoFusion.
Noise removal is one of the most commonly used processes in computer vision. Noise removal improves the quality of the image, thereby improving the performance of computer vision algorithms and providing user pleasing...
详细信息
This paper presents experimental research aimed at comparing the academic achievement of students who have studied with an interactive digital lesson on using basic image editing programs designed for vocational certi...
详细信息
Sample adaptive offset (SAO) is applied for reducing sample distortion and attenuating ringing artifacts in both HEVC and VVC standards. The rate-distortion optimization process is used to select the best SAO paramete...
详细信息
Global shared memories of petabytes are increasingly useful for applications, but traditional page-based techniques do not scale (limited reach), and scalable techniques such as segments fail to provide needed localit...
详细信息
Discriminative Dimension Selection (DDS) has emerged as a powerful tool for identifying the most relevant features in high-dimensional datasets, enabling interpretable data analysis and visualization. This paper explo...
详细信息
Streaming graph has been broadly employed across various application domains. It involves updating edges to the graph and then performing analytics on the updated graph. However, existing solutions either suffer from ...
详细信息
Agent-Based Modeling and Simulation (ABMS) has been increasingly applied in various research fields, thanks to the capability of these models to describe fine-grained realworld behavior and to the ease of interpretati...
详细信息
Precise 3D object detection plays a pivotal role within the perception module of autonomous vehicles. Many approaches have shown promising results for 3D object detection with lidar. Voxel-based methods are computatio...
详细信息
ISBN:
(纸本)9798350373141;9798350373158
Precise 3D object detection plays a pivotal role within the perception module of autonomous vehicles. Many approaches have shown promising results for 3D object detection with lidar. Voxel-based methods are computationally efficient but suffer from information loss, while point-based methods handle irregular data but require extensive computations and optimization processes. In this paper, a high-performance 3D object detection framework named Frustum PointVoxel-RCNN has been developed. Using 2D object detection with frustums effectively mitigates the presence of irrelevant point clouds associated with the objects. Afterwards, 3D Voxel CNN and PointNet-based set abstraction are used to learn the features of the frustum-processed point clouds. Our method significantly reduces the computation cost of frustum-processed point clouds in 3D Voxel CNN and PointNet-based networks. Extensive experiments on the KITTI dataset shown our proposed method surpassed stae-of-the-art 3D detection methods when accurate 2D detections are precise. Specifically, it yielded significant speed advantages and improved accuracy when the scene was at a hard level. Our proposed method achieves a 50% improvement in computational efficiency compared to PV-RCNN.
Music genre classification is essential for organizing music libraries and enhancing recommendation systems. This paper evaluates four lightweight models combining Mel Frequency Cepstral Coefficients (MFCCs) and Chrom...
详细信息
暂无评论