检索结果-内蒙古大学图书馆

2025 ieee symposium on Trustworthy, Explainable and Responsible computational intelligence, CITREx 2025 2025年

作者： Vellido, Alfredo

来源：评论

学校读者我要写书评

暂无评论

Mamba Collaborative Implicit Neural Representation for Hyperspectral and Multispectral Remote Sensing image Fusion

引用

ieee TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING 2025年 63卷

作者： Zhu, Chunyu Deng, Shangqi Song, Xuan Li, Yachao Wang, Qi Xidian Univ Hangzhou Inst Technol Hangzhou 311231 Zhejiang Peoples R China Xi An Jiao Tong Univ Natl Key Lab Human Machine Hybrid Augmented Intell Natl Engn Res Ctr Visual Informat & Applicat Inst Artificial Intelligence & Robot Xian 710049 Shaanxi Peoples R China Northwestern Polytech Univ Sch Astronaut Xian 710072 Peoples R China Xidian Univ Natl Key Lab Radar Signal Proc Xian 710071 Shaanxi Peoples R China

Hyperspectral remote sensing images (HSIs) capture detailed spectral characteristics of features, while multi- spectral remote sensing images (MSIs) provide clear spatial distribution. Fusing these two types of images can enhance feature identification and classification accuracy. Current deep learning algorithms achieve high fusion quality but struggle with balancing global effective perception and lightweight computation. Moreover, these algorithms typically discretely handle data mapping, which contrasts with the continuous nature of the world. Recently, the Mamba has shown significant potential for complex long-range modeling, addressing the computational complexity of global perception. Concurrently, implicit neural representation (INR) offers high-quality solutions for continuous domain modeling. To this end, this study introduces a novel network architecture that combines Mamba and INR, termed the Mamba cooperative INR fusion network (MCIFNet). MCIFNet effectively captures global image information and generates fused images in a continuous domain through pointto-point processing. The network comprises two main units: potential space projection and semantic extraction and fusion. The potential space projection unit performs shallow encoding of hyperspectral and MSIs, mapping them to a latent feature space. The semantic extraction and fusion unit (SEFU) uses scale adaptive residual state spatial and implicit spatial-spectral fusion (ISSF) modules to extract deep features from the bimodal images, generating fused images point-by-point. A series of fusion experiments with 4x, 8x, and 16x scale factors demonstrate that MCIFNet surpasses popular algorithms in both spatial detail and spectral information reconstruction, while also providing more lightweight performance. The code for MCIFNet will be shared on https://***/chunyuzhu/MCIFNet.

关键词： Feature extraction Accuracy Spatial resolution Matrix decomposition Optimization Deep learning Sparse approximation image fusion Hyperspectral imaging Transformers Hyperspectral and multispectral image fusion implicit neural representation (INR) Mamba Mamba cooperative INR fusion network (MCIFNet) state-space model (SSM)

来源：评论

学校读者我要写书评

暂无评论

ieee symposium on CI for Financial Engineering and Economics (ieee CiFer 2025)

2025 IEEE Symposium on Computational Intelligence in Natural...

引用

2025 ieee symposium on computational intelligence in Natural Language processing and Social Media, CI-NLPSoMe 2025 2025年

作者： Mizuta, Takanobu

来源：评论

学校读者我要写书评

暂无评论

ieee symposium on CI in Health and Medicine (ieee CIHM 2025)

2025 IEEE Symposium on Computational Intelligence in Natural...

引用

2025 ieee symposium on computational intelligence in Natural Language processing and Social Media, CI-NLPSoMe Companion 2025 2025年

作者： Plagianakos, Vassilis P.

来源：评论

学校读者我要写书评

暂无评论

TROI: Cross-Subject Pretraining with Sparse Voxel Selection for Enhanced fMRI Visual Decoding

TROI: Cross-Subject Pretraining with Sparse Voxel Selection ...

引用

2025 ieee International Conference on Acoustics, Speech, and signal processing, ICASSP 2025

作者： Wang, Ziyu Pan, Tengyu Li, Zhenyu Wu, Ji Li, Xiuxing Wang, Jianyong Department of Computer Science and Technology Tsinghua University Beijing China School of Computer Science and Engineering Beihang University Beijing China School of Computer Science and Technology Beijing Institute of Technology Beijing China

ISBN: (纸本)9798350368741

fMRI (functional Magnetic Resonance Imaging) visual decoding involves decoding the original image from brain signals elicited by visual stimuli. This often relies on manually labeled ROIs (Regions of Interest) to select brain voxels. However, these ROIs can contain redundant information and noise, reducing decoding performance. Additionally, the lack of automated ROI labeling methods hinders the practical application of fMRI visual decoding technology, especially for new subjects. This work presents TROI (Trainable Region of Interest), a novel two-stage, data-driven ROI labeling method for cross-subject fMRI decoding tasks, particularly when subject samples are limited. TROI leverages labeled ROIs in the dataset to pretrain an image decoding backbone on a cross-subject dataset, enabling efficient optimization of the input layer for new subjects without retraining the entire model from scratch. In the first stage, we introduce a voxel selection method that combines sparse mask training and low-pass filtering to quickly generate the voxel mask and determine input layer dimensions. In the second stage, we apply a learning rate rewinding strategy to fine-tune the input layer for downstream tasks. Experimental results on the same small sample dataset as the baseline method for brain visual retrieval and reconstruction tasks show that our voxel selection method surpasses the state-of-the-art method MindEye2 with an annotated ROI mask. © 2025 ieee.

关键词： computational neuroscience multi-modal learning region of interest analysis small sample learning

来源：评论

学校读者我要写书评

暂无评论

Latent Weight Quantization for Integerized Training of Deep Neural Networks

引用

ieee TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE intelligence 2025年第4期47卷 2816-2832页

作者： Fei, Wen Dai, Wenrui Zhang, Liang Zhang, Luoming Li, Chenglin Zou, Junni Xiong, Hongkai Shanghai Jiao Tong Univ Dept Elect Engn Shanghai 200240 Peoples R China Shanghai Jiao Tong Univ Dept Comp Sci & Engn Shanghai 200240 Peoples R China Donghua Univ Sch Comp Sci & Technol Shanghai 201620 Peoples R China Zhejiang Univ Key Lab Biomed Engn Minist Educ Hangzhou 310027 Peoples R China

Existing methods for integerized training speed up deep learning by using low-bitwidth integerized weights, activations, gradients, and optimizer buffers. However, they overlook the issue of full-precision latent weights, which consume excessive memory to accumulate gradient-based updates for optimizing the integerized weights. In this paper, we propose the first latent weight quantization schema for general integerized training, which minimizes quantization perturbation to training process via residual quantization with optimized dual quantizer. We leverage residual quantization to eliminate the correlation between latent weight and integerized weight for suppressing quantization noise. We further propose dual quantizer with optimal nonuniform codebook to avoid frozen weight and ensure statistically unbiased training trajectory as full-precision latent weight. The codebook is optimized to minimize the disturbance on weight update under importance guidance and achieved with a three-segment polyline approximation for hardware-friendly implementation. Extensive experiments show that the proposed schema allows integerized training with lowest 4-bit latent weight for various architectures including ResNets, MobileNetV2, and Transformers, and yields negligible performance loss in image classification and text generation. Furthermore, we successfully fine-tune Large Language Models with up to 13 billion parameters on one single GPU using the proposed schema.

关键词： Quantization (signal) Training Perturbation methods Memory management Hardware Trajectory Random access memory Graphics processing units computational modeling Noise Integerized training deep neural network quantization latent weight dual quantizer large language models

来源：评论

学校读者我要写书评

暂无评论

image Test Libraries for the in-field test of ultra-low-power devices 26

Image Test Libraries for the in-field test of ultra-low-powe...

引用

26th ieee Latin American Test symposium, LATS 2025

作者： Porsia, Antonio Perlo, Giacomo Ruospo, Annachiara Sanchez, Ernesto Politecnico di Torino DAUIN Turin Italy

ISBN: (纸本)9781665477635

In recent years, research and technology advancements have driven exponential growth in the adoption of Artificial intelligence (AI)-based systems, even in safety-critical contexts such as autonomous driving and healthcare applications. The joint effort of academia and industry has yielded techniques and standards with the objective of ensuring the safe operation of AI-based technology. In the specific context of Convolutional Neural Networks (CNNs) running on GPUs, image Test Libraries (ITLs) have been proposed as an effective method for performing on-line functional testing of GPU multipliers. This is achieved by launching the inference of a set of test images containing a set of ATPG-generated functional test patterns. However, while the demand for computational power for DNN models is constantly increasing, another branch of Machine Learning (ML) research, namely TinyML, focuses on minimizing the computational requirements of DNN models in order to bring AI capabilities to edge devices, whose constraints on power usage, memory space and processing power do not allow for the deployment of conventional DNN models. This research work aims to adapt the ITL technique to CNNs running on ultra-low-power edge hardware, while also overcoming some limitations of GPU ITLs. Experimental results demonstrate that a single test image generated using the proposed method is capable of detecting 96.01% of stuck-at faults occurring in the 32-bit integer multiplier of a RISC-V-based ultra-low-power System-on-Chip executing a quantized CNN. © 2025 ieee.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

DX2CT: Diffusion Model for 3D CT Reconstruction from Bi or Mono-planar 2D X-ray(s)

DX2CT: Diffusion Model for 3D CT Reconstruction from Bi or M...

引用

2025 ieee International Conference on Acoustics, Speech, and signal processing, ICASSP 2025

作者： Jeong, Yun Su Yoo, Hye Bin Chun, Il Yong Department of Electrical and Computer Engineering Sungkyunkwan University Korea Republic of Departments of Artificial Intelligence and Advanced Display Engineering Sungkyunkwan University Korea Republic of Suwon16419 Korea Republic of

ISBN: (纸本)9798350368741

computational tomography (CT) provides high-resolution medical imaging, but it can expose patients to high radiation. X-ray scanners have low radiation exposure, but their resolutions are low. This paper proposes a new conditional diffusion model, DX2CT, that reconstructs three-dimensional (3D) CT volumes from bi or mono-planar X-ray image(s). Proposed DX2CT consists of two key components: 1) modulating feature maps extracted from two-dimensional (2D) X-ray(s) with 3D positions of CT volume using a new transformer and 2) effectively using the modulated 3D position-aware feature maps as conditions of DX2CT. In particular, the proposed transformer can provide conditions with rich information of a target CT slice to the conditional diffusion model, enabling high-quality CT reconstruction. Our experiments with the bi or mono-planar X-ray(s) benchmark datasets show that proposed DX2CT outperforms several state-of-the-art methods. Our codes and model will be available at: https://***/intyeger/DX2CT. © 2025 ieee.

关键词： Computed tomography (CT) Diffusion models Three-dimensional (3D) reconstruction X-ray radiography

来源：评论

学校读者我要写书评

暂无评论

Deep Joint Semantic Coding and Beamforming for Near-Space Airship-Borne Massive MIMO Network

引用

ieee JOURNAL ON SELECTED AREAS IN COMMUNICATIONS 2025年第1期43卷 260-278页

作者： Wu, Minghui Gao, Zhen Wang, Zhaocheng Niyato, Dusit Karagiannidis, George K. Chen, Sheng Beijing Inst Technol Sch Informat & Elect Beijing 100081 Peoples R China Beijing Inst Technol BIT Zhuhai 519088 Peoples R China BIT State Key Lab CNS ATM Beijing 100081 Peoples R China BIT MIIT Key Lab Complex Field Intelligent Sensing Beijing 100081 Peoples R China BIT Jinan Adv Technol Res Inst Jinan 250307 Peoples R China BIT Jiaxing Yangtze Delta Reg Acad Jiaxing 314019 Peoples R China Tsinghua Univ Beijing Natl Res Ctr Informat Sci & Technol Dept Elect Engn Beijing 100084 Peoples R China Tsinghua Shenzhen Int Grad Sch Shenzhen 518055 Peoples R China Nanyang Technol Univ Sch Comp Sci & Engn Singapore 639798 Singapore Aristotle Univ Thessaloniki Dept Elect & Comp Engn Thessaloniki 54124 Greece Lebanese Amer Univ LAU Artificial Intelligence & Cyber Syst Res Ctr Beirut 03797751 Lebanon Univ Southampton Sch Elect & Comp Sci Southampton SO17 1BJ England

Near-space airship-borne communication network is recognized to be an indispensable component of the future integrated ground-air-space network thanks to airships' advantage of long-term residency at stratospheric altitudes, but it urgently needs reliable and efficient Airship-to-X link. To improve the transmission efficiency and capacity, this paper proposes to integrate semantic communication with massive multiple-input multiple-output (MIMO) technology. Specifically, we propose a deep joint semantic coding and beamforming (JSCBF) scheme for airship-based massive MIMO image transmission network in space, in which semantics from both source and channel are fused to jointly design the semantic coding and physical layer beamforming. First, we design two semantic extraction networks to extract semantics from image source and channel state information, respectively. Then, we propose a semantic fusion network that can fuse these semantics into complex-valued semantic features for subsequent physical-layer transmission. To efficiently transmit the fused semantic features at the physical layer, we then propose the hybrid data and model-driven semantic-aware beamforming networks. At the receiver, a semantic decoding network is designed to reconstruct the transmitted images. Finally, we perform end-to-end deep learning to jointly train all the modules, using the image reconstruction quality at the receivers as a metric. The proposed deep JSCBF scheme fully combines the efficient source compressibility and robust error correction capability of semantic communication with the high spectral efficiency of massive MIMO, achieving a significant performance improvement over existing approaches.

关键词： Semantics Array signal processing Massive MIMO Communication systems image reconstruction Iterative decoding Decoding Airship base station beamforming massive MIMO deep learning semantic communication

来源：评论

学校读者我要写书评

暂无评论

Design Exploration of DWT-Based Feature Extraction Using FPGA for High-Performance signal processing 16

Design Exploration of DWT-Based Feature Extraction Using FPG...

引用

16th ieee Latin American symposium on Circuits and Systems, LASCAS 2025

作者： Trabes, Emanuel Zayed, Aymen Valderrama, Carlos Tarrillo, Jimmy Universidad Nacional de San Luis Department of Electronics Argentina University of Mons Service d'Électronique et de Microélectronique Mons Belgium Research Laboratory of Technology and Medical Imaging Ltim LR12ES06 Faculty of Medicine of Monastir Tunisia National Engineering School of Sousse University of Sousse Tunisia Universidad de Ingenieria y Tecnologia Electrical and Mechatronic Department Peru

ISBN: (纸本)9798331522124

The discrete wavelet transform (DWT) is commonly used for feature extraction in machine learning applications. Since these applications are frequently deployed in portable systems with limited computational resources, FPGA-based hybrid hardware/software solutions might be a viable choice. This article provides an analysis of various 4-level db4 DWT and feature extraction techniques implemented on the Zynq 7020 device. Alternative DWT versions include fixed-point and floating-point implementations, cascade and single-core reuse architectures, as well as designs in HDL and VHDL. The feature extraction process considers mean, energy, and entropy. It has also been implemented in an architecture that efficiently reuses these computational cores. These versions are compared in terms of accuracy, resources used, performance, and power consumption. © 2025 ieee.

关键词： image reconstruction

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：