Hyperspectral remote sensing images (HSIs) capture detailed spectral characteristics of features, while multi- spectral remote sensing images (MSIs) provide clear spatial distribution. Fusing these two types of images...
详细信息
Hyperspectral remote sensing images (HSIs) capture detailed spectral characteristics of features, while multi- spectral remote sensing images (MSIs) provide clear spatial distribution. Fusing these two types of images can enhance feature identification and classification accuracy. Current deep learning algorithms achieve high fusion quality but struggle with balancing global effective perception and lightweight computation. Moreover, these algorithms typically discretely handle data mapping, which contrasts with the continuous nature of the world. Recently, the Mamba has shown significant potential for complex long-range modeling, addressing the computational complexity of global perception. Concurrently, implicit neural representation (INR) offers high-quality solutions for continuous domain modeling. To this end, this study introduces a novel network architecture that combines Mamba and INR, termed the Mamba cooperative INR fusion network (MCIFNet). MCIFNet effectively captures global image information and generates fused images in a continuous domain through pointto-point processing. The network comprises two main units: potential space projection and semantic extraction and fusion. The potential space projection unit performs shallow encoding of hyperspectral and MSIs, mapping them to a latent feature space. The semantic extraction and fusion unit (SEFU) uses scale adaptive residual state spatial and implicit spatial-spectral fusion (ISSF) modules to extract deep features from the bimodal images, generating fused images point-by-point. A series of fusion experiments with 4x, 8x, and 16x scale factors demonstrate that MCIFNet surpasses popular algorithms in both spatial detail and spectral information reconstruction, while also providing more lightweight performance. The code for MCIFNet will be shared on https://***/chunyuzhu/MCIFNet.
computational tomography (CT) provides high-resolution medical imaging, but it can expose patients to high radiation. X-ray scanners have low radiation exposure, but their resolutions are low. This paper proposes a ne...
详细信息
ISBN:
(数字)9798350368741
ISBN:
(纸本)9798350368758
computational tomography (CT) provides high-resolution medical imaging, but it can expose patients to high radiation. X-ray scanners have low radiation exposure, but their resolutions are low. This paper proposes a new conditional diffusion model, DX2CT, that reconstructs three-dimensional (3D) CT volumes from bi or mono-planar X-ray image(s). Proposed DX2CT consists of two key components: 1) modulating feature maps extracted from two-dimensional (2D) X-ray(s) with 3D positions of CT volume using a new transformer and 2) effectively using the modulated 3D position-aware feature maps as conditions of DX2CT. In particular, the proposed transformer can provide conditions with rich information of a target CT slice to the conditional diffusion model, enabling high-quality CT reconstruction. Our experiments with the bi or mono-planar X-ray(s) benchmark datasets show that proposed DX2CT outperforms several state-of-the-art methods. Our codes and model will be available at: https://***/intyeger/DX2CT.
Constructive understanding of computational principles of multimedia, signal and visual information processing, perception and cognition is one of the most fundamental challenges of contemporary science. Deeper insigh...
Constructive understanding of computational principles of multimedia, signal and visual information processing, perception and cognition is one of the most fundamental challenges of contemporary science. Deeper insight into such computationalintelligence helps to advance intelligent systems research to achieve robust performance. Implementing integrated principles in artificial systems may help us achieve better, faster and more efficient intelligent systems. The symposium will address theory and applications of non-traditional computationalintelligence approaches in multimedia, signal and vision processing.
This paper outlines a solution to the multi-channel synthetic aperture radar (SAR) moving target indication and detection (MTI/MTD) by means of inverse systems approach. A novel model of the problem is presented and a...
详细信息
ISBN:
(纸本)9781424407071
This paper outlines a solution to the multi-channel synthetic aperture radar (SAR) moving target indication and detection (MTI/MTD) by means of inverse systems approach. A novel model of the problem is presented and an approximate analytic solution to it will be given. It will be demonstrated how a moving target indicator can benefit from a multichannel SAR system as opposed to a traditional approach that separates MTI and SAR systems. It will be shown that the problem of separation of moving targets from stationary ones can be solved completely by using multi-channel approach and in such a way that a spatial distribution of the stationary targets does not play a role.
This paper tackles the fish motion synthesis problem with the aim to increase the temporal resolution by generating a high frame rate video from a given video captured at a lower frame rate. The proposed approach reco...
详细信息
ISBN:
(纸本)9781467359177
This paper tackles the fish motion synthesis problem with the aim to increase the temporal resolution by generating a high frame rate video from a given video captured at a lower frame rate. The proposed approach reconstructs the virtual frames via estimating motion-vector fields that represent the correspondences between consecutive frames. To be more specific, the proposed approach utilizes two regularization functions: (i) an intra regularization function that is proposed to impose a local smoothness constraint on the estimated motion vector fields;and (ii) an inter regularization function that considers both intensity consistency and kinematic consistency between the reconstructed frame and its neighboring frames. Real-world fish motion video dataset is used to demonstrate the superior performance of the proposed approach in high temporal resolution fish motion synthesis, as well as its application in fish motion analysis.
Due to frost and insufficient exposure to sunlight, some grape bunches remain undeveloped during harvesting. For automation of harvesting, it is required to automatically identify the mature grape bunches. This paper ...
详细信息
ISBN:
(纸本)9781479945030
Due to frost and insufficient exposure to sunlight, some grape bunches remain undeveloped during harvesting. For automation of harvesting, it is required to automatically identify the mature grape bunches. This paper presents a sequence of imageprocessing and computationalintelligence methods to identify mature grape bunches. It's a two-step process where in the first step the grape bunches are separated from the background of an image and in the second step the grape bunch is classified into mature and undeveloped group. We achieved 96.88% accuracy on the images obtained from a strip of vineyard in Cambridge, Tasmania.
In this paper, a new thresholding approach for data denoising is presented. The approach is based minimum noiseless description length (MNDL), a new method for optimum sub-space selection in data representation. By us...
详细信息
ISBN:
(纸本)9781424407071
In this paper, a new thresholding approach for data denoising is presented. The approach is based minimum noiseless description length (MNDL), a new method for optimum sub-space selection in data representation. By using the observed noisy data, this information theoretic approach provides the optimum threshold that minimizes the description length of the noiseless signal. Comparison of the new method with the existing thresholding methods is provided.
This paper presents a neural network based approach for vehicle classification. The proposed vehicle classification approach extracts various features from a vehicle image, normalises and classifies them into one of t...
详细信息
ISBN:
(纸本)9781424407071
This paper presents a neural network based approach for vehicle classification. The proposed vehicle classification approach extracts various features from a vehicle image, normalises and classifies them into one of the known classes. It is based on structural features and a direct solution training method. The preliminary experiments on training and testing of 4 types of vehicles patterns were conducted. The experimental results are very promising and demonstrate the effectiveness and usefulness of the proposed approach.
We have used advanced signalprocessing and innovative imageprocessing methods that are used outside the operating room. The software is written in C++ in a windows environment and can be used on any PC. Applications...
详细信息
ISBN:
(纸本)0780377834
We have used advanced signalprocessing and innovative imageprocessing methods that are used outside the operating room. The software is written in C++ in a windows environment and can be used on any PC. Applications will be discussed along with computationalintelligence as used for diagnostic purposes and as visualization aids inside and outside the operating room. Subjects to be discussed, include blood cell classification, mammography, evoked potentials, ophthalmology, EEG and field potentials in Parkinson's disease. The latter methods along with 3D reconstruction of MRI images of Parkinson's patients, are currently used in the operating room for target assessment and electrode placement.
暂无评论