检索结果-内蒙古大学图书馆

International Joint Conference on Neural Networks (IJCNN)

作者： Songlin Li Zhe Li Boyuan Li Xiuhong Li Jiabao Sheng School of Computer Science and Technology Xinjiang University Urumqi China Xinjiang Key Laboratory of Signal Detection and Processing Xinjiang University Urumqi China Department of Electrical and Electronic Engineering Hong Kong Polytechnic University HongKong China

ISBN: (数字)9798350359312

ISBN: (纸本)9798350359329

Camouflaged objects, exhibiting high similarity with their surroundings, pose a substantial challenge for both humans and machines to detect when concealed within the environment. Existing methods for camouflage object detection (COD) struggle in accurately segmenting the overall structure of camouflaged objects. To address this issue, we propose a novel boundary-guided fusion of multi-level features network (BGFM-Net) for COD. In contrast to existing boundary-guided methods, we pay more attention to addressing the significant imbalance in the pixel quantities between boundary and background features, allowing for a more comprehensive representation of boundary features. BGFM-Net primarily consists of a multi-scale aggregation module (MSAM), a boundary-guided feature module (BFM), and a cross-Level fusion module (CLFM). MSAM effectively integrates contextual semantics at different scales, achieving a powerful and efficient feature representation. BFM adeptly combines edge features while constraining interference from background features, guiding the learning of camouflaged object boundary representation. CLFM integrates multi-level features for predicting camouflaged objects while adaptively adjusting channel weights to emphasize important channels and diminish the impact of less relevant channels for the task. Extensive experiments on three benchmark camouflage datasets demonstrate that our BGFM-Net outperforms other state-of-the-art COD models.

关键词： Adaptation models Semantics Neural networks Object detection Interference Benchmark testing Feature extraction

来源：评论

学校读者我要写书评

暂无评论

Multimodal Associative Memory Based on Forgetting Memristor Bridge Synapse Circuit

SSRN

引用

SSRN 2023年

作者： Li, Ke Chen, Ling Li, Chuandong Qiu, Haoyang Liu, Chenchen Electronic Information and Engineering Chongqing Key Laboratory of Nonlinear Circuits and Intelligent Information Processing Southwest University 400715 China University of Maryland Baltimore County Computer Science and Electrical Engineering Department United States

Memristor is frequently used to construct synapses and memristor bridge synapse is a typical example of such a synapse. Unlike the traditional memristor bridge synapse, the forgetting memristor bridge synapse can express positive and negative weights, as well as dual weights for long-term and short-term memory. In this study, we constructed the forgetting memristor bridge synapse using the forgetting memristor SPICE model, and then simulated 1*1, 1*3, and 3*3 forgetting memristor bridge synapse networkswith neurons. We demonstrated the circuit and network’s functionality and effectiveness through simulation results and noise ***, we built the bipolar and gray-level BAM (BGBAM) network based on the forgetting memristor bridge. Owing to its dual weight characteristic, the BGBAM associative memory network uses time division multiplexing mode on the same architecture, which enables the realization of multi-modal associative memories between text, images, and audio. © 2023, The Authors. All rights reserved.

关键词： Memristors

来源：评论

学校读者我要写书评

暂无评论

Spatial and Contextual Path Network for Image Inpainting

引用

Intelligent Automation & Soft Computing 2024年第2期39卷 115-133页

作者： Dengyong Zhang Yuting Zhao Feng Li Arun Kumar Sangaiah Hunan Provincial Key Laboratory of Intelligent Processing of Big Data on Transportation Changsha University of Science and TechnologyChangsha410114China School of Computer and Communication Engineering Changsha University of Science and TechnologyChangsha410114China International Graduate Institute of AI National Yunlin University of Science and TechnologyYunlinTaiwan Department of Electrical and Computer Engineering Lebanese American UniversityByblosLebanon

Image inpainting is a kind of use known area of information technology to repair the loss or damage to the *** feature extraction is the core of image *** enough space for information and a larger receptive field is very important to realize high-precision image ***,in the process of feature extraction,it is difficult to meet the two requirements of obtaining sufficient spatial information and large receptive fields at the same *** order to obtain more spatial information and a larger receptive field at the same time,we put forward a kind of image restoration based on space path and context path *** the space path,we stack three convolution layers for 1/8 of the figure,the figure retained the rich spatial *** the context path,we use the global average pooling layer,where the accept field is the maximum of the backbone network,and the pooling module can provide global context information for the maximum accept *** order to better integrate the features extracted from the spatial and contextual paths,we study the fusion module of the two *** fusionmodule first path output of the space and context path,and then through themass normalization to balance the scale of the characteristics,finally the characteristics of the pool will be connected into a feature vector and calculate the weight *** of images in order to extract context information,we add attention to the context path refinement *** modules respectively from channel dimension and space dimension to weighted images,in order to obtain more effective *** show that our method is better than the existing technology in the quality and quantity of themethod,and further to expand our network to other inpainting networks,in order to achieve consistent performance improvements.

关键词： Image inpainting attention deep learning convolutional network

来源：评论

学校读者我要写书评

暂无评论

Graph Convolutional Networks based Muti-Label Deep Cross-Modal Hashing 13

Graph Convolutional Networks based Muti-Label Deep Cross-Mod...

引用

13th International Conference on Information Science and Technology, ICIST 2023

作者： Peng, Yi Zhang, Nian Jiang, Xin Xiong, Jiang School of Electronic and Information Engineering Chongqing Three Gorges University Chongqing40044 China Department of Electrical and Computer Engineering University of the District of Columbia WashingtonDC20008 United States Key Laboratory of Intelligent Information Processing and Control of Chongqing Municipal Institutions of Higher Education Chongqing Three Gorges University Chongqing40044 China College of Mathematics and Statistics Chongqing Three Gorges University Chongqing40044 China

ISBN: (纸本)9798350313925

Recently, multi-label deep cross-modal hashing (MDCH), which incorporates deep neural networks, hashing and multi-label learning for cross-modal retrieval tasks, has achieved excellent cross-modal retrieval results and thus became a highly popular area of research. Nevertheless, many existing MDCH methods concentrate on extracting information from multi-modal data, while neglecting the abundant semantic information in multiple labels. Few MDCH methods incorporate multi-label information, but they often treat labels as independent entities, ignoring the relationships between categories, which hinders the establishment of semantic connections among multi-modal data. In order to tackle the aforementioned challenges, we propose a graph convolutional networks based multi-label deep cross-modal hashing method (GMCH) in this paper. GMCH leverages two deep neural networks to generate hash representations from the original image-text pairs, during this process, a graph convolutional network is introduced to capture the category correlations of multi-labels and supervise the training of hash mapping. Experimental results on two commonly employed datasets validate the efficacy of the proposed GMCH method. You can find the code for our proposed GCMH at https://***/licher12/***. © 2023 IEEE.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Multi-Task Sparse Signal Recovery with Dirichlet Process Priors Based on Expectation Propagation Technique 7

Multi-Task Sparse Signal Recovery with Dirichlet Process Pri...

引用

7th International Conference on Signal and Image processing, ICSIP 2022

作者： Fu, Yin Wu, Qisong Zhang, Yimin D. Amin, Moeness G. Southeast University Key Laboratory of Underwater Acoustic Signal Processing of Ministry of Education Nanjing210096 China Temple University Department of Electrical and Computer Engineering PhiladelphiaPA19122 United States Villanova University Center for Advanced Communications VillanovaPA19085 United States

ISBN: (数字)9781665495639

ISBN: (纸本)9781665495639

Exploiting the shared information among tasks to significantly improve the sparse reconstruction performance lays the essence of multi-task compressive sensing. In this paper, a novel generative model of multi-task compressive sensing with Dirichlet process (DP) priors is proposed and the sharing mechanisms among tasks are revealed, yielding a principled means of inferring the clusters as well as performing compressive sensing inversion simultaneously. The spike-and-slab priors are first used to model the group sparsity among tasks within the identical cluster, and the DP priors are then introduced to automatically perform clustering for the tasks. The expectation propagation method is finally carried out to take the inference for posterior distribution approximation. The superiority of the proposed method over state-of-the-art algorithms is demonstrated by using experimental results on both numerical data and real data sets. © 2022 IEEE.

关键词： Compressed sensing

来源：评论

学校读者我要写书评

暂无评论

MN-Net: Speech Enhancement Network via Modeling the Noise

IEEE Transactions on Audio, Speech and Language Processing

引用

IEEE Transactions on Audio, Speech and Language processing 2025年 33卷 1208-1219页

作者： Ying Hu Qin Yang Wenbing Wei Li Lin Liang He Zhijian Ou Wenzhong Yang School of Computer Science and Technology Key Laboratory of Signal Detection and Processing Xinjiang University Urumqi China Xinjiang Institute of Electronics Research Shares Company Ltd. Urumqi China Department of Electrical Engineering Tsinghua University Beijing China Beijing National Research Center for information Science and Technology Department of Electronics Tsinghua University Beijing China

Currently, deep learning-based speech enhancement methods generally focus on target speech extraction while neglecting modeling the other sound sources in the mixture. These methods still can't distinguish the target speech from the interference well. In this paper, we present a monaural speech enhancement network via Modeling the Noise (MN-Net), which includes a shared Encoder and three separate Decoders for parallel modeling the magnitude and phase spectrogram of target speech, and the complex spectrogram of noise. Specifically, we propose a Multi-Branch Feature Extractor (MBFE) module to capture the richer contextual information in mixture, and a Spatial Reconstruction Unit (SRU) to remove the redundancy from extracted features. We compared our proposed MN-Net with 18 classical speech enhancement methods on the VoiceBank+DEMAND dataset, and with 9 ones on DNS-Challenge dataset for denoising task, and with 7 ones on the WHAMR! dataset for simultaneous denoising & de-reverberation task. Our proposed MBFE module was applied to two classical speech enhancement methods, DB-AIAT and CMGAN, replacing their DenseBlocks module. The results demonstrate that applying the MBFE module can boost their performances while keeping smaller model size. A series of visualization analysis intuitively verify that modeling the noise can enable the network to distinguish the target speech from noise and other interference more accurately.

关键词： Noise Feature extraction Spectrogram Speech enhancement Decoding Kernel Transformers Noise reduction Data mining Training

来源：评论

学校读者我要写书评

暂无评论

GEM: Context-Aware Gaze EstiMation with Visual Search Behavior Matching for Chest Radiograph

arXiv

引用

arXiv 2024年

作者： Liu, Shaonan Chen, Wenting Liu, Jie Luo, Xiaoling Shen, Linlin Computer Vision Institute College of Computer Science and Software Engineering Shenzhen University China Department of Electrical Engineering City University of Hong Kong Hong Kong AI Research Center for Medical Image Analysis and Diagnosis Shenzhen University China Guangdong Provincial Key Laboratory of Intelligent Information Processing China

Gaze estimation is pivotal in human scene comprehension tasks, particularly in medical diagnostic analysis. Eye-tracking technology facilitates the recording of physicians’ ocular movements during image interpretation, thereby elucidating their visual attention patterns and information-processing strategies. In this paper, we initially define the context-aware gaze estimation problem in medical radiology report settings. To understand the attention allocation and cognitive behavior of radiologists during the medical image interpretation process, we propose a context-aware Gaze EstiMation (GEM) network that utilizes eye gaze data collected from radiologists to simulate their visual search behavior patterns throughout the image interpretation process. It consists of a context-awareness module, visual behavior graph construction, and visual behavior matching. Within the context-awareness module, we achieve intricate multimodal registration by establishing connections between medical reports and images. Subsequently, for a more accurate simulation of genuine visual search behavior patterns, we introduce a visual behavior graph structure, capturing such behavior through high-order relationships (edges) between gaze points (nodes). To maintain the authenticity of visual behavior, we devise a visual behavior-matching approach, adjusting the high-order relationships between them by matching the graph constructed from real and estimated gaze points. Extensive experiments on four publicly available datasets demonstrate the superiority of GEM over existing methods and its strong generalizability, which also provides a new direction for the effective utilization of diverse modalities in medical image interpretation and enhances the interpretability of models in the field of medical imaging. https://***/Tiger-SN/GEM. Copyright © 2024, The Authors. All rights reserved.

关键词： Eye movements

来源：评论

学校读者我要写书评

暂无评论

Fuzzy Granule Density-Based Outlier Detection with Multi-Scale Granular Balls

arXiv

引用

arXiv 2025年

作者： Gao, Can Tan, Xiaofeng Zhou, Jie Ding, Weiping Pedrycz, Witold The College of Computer Science and Software Engineering Shenzhen University Shenzhen518060 China Guangdong Key Laboratory of Intelligent Information Processing Shenzhen518060 China The National Engineering Laboratory for Big Data System Computing Technology Shenzhen University Shenzhen518060 China The School of Artificial Intelligence and Computer Science Nantong University Nantong226019 China The Faculty of Data Science City University of Macau Macau999078 China The Department of Electrical and Computer Engineering University of Alberta Edmonton Canada Systems Research Institute Polish Academy of Sciences Warsaw Poland The Department of Electrical and Computer Engineering King Abdulaziz University Jeddah Saudi Arabia The Department of Computer Engineering Istinye University Istanbul Turkey

Outlier detection refers to the identification of anomalous samples that deviate significantly from the distribution of normal data and has been extensively studied and used in a variety of practical tasks. However, most unsupervised outlier detection methods are carefully designed to detect specified outliers, while real-world data may be entangled with different types of outliers. In this study, we propose a fuzzy rough sets-based multi-scale outlier detection method to identify various types of outliers. Specifically, a novel fuzzy rough sets-based method that integrates relative fuzzy granule density is first introduced to improve the capability of detecting local outliers. Then, a multi-scale view generation method based on granular-ball computing is proposed to collaboratively identify group outliers at different levels of granularity. Moreover, reliable outliers and inliers determined by the three-way decision are used to train a weighted support vector machine to further improve the performance of outlier detection. The proposed method innovatively transforms unsupervised outlier detection into a semi-supervised classification problem and for the first time explores the fuzzy rough sets-based outlier detection from the perspective of multi-scale granular balls, allowing for high adaptability to different types of outliers. Extensive experiments carried out on both artificial and UCI datasets demonstrate that the proposed outlier detection method significantly outperforms the state-of-the-art methods, improving the results by at least 8.48% in terms of the Area Under the ROC Curve (AUROC) index. The source codes are released at https://***/Xiaofeng-Tan/MGBOD. © 2025, CC BY.

关键词： Granulation

来源：评论

学校读者我要写书评

暂无评论

A robust incomplete large-scale group decision-making model for metaverse metro operations and maintenance

引用

Applied Soft Computing 2024年 156卷

作者： Bai, Wenhui Zhang, Chao Zhai, Yanhui Sangaiah, Arun Kumar School of Computer and Information Technology Key Laboratory of Computational Intelligence and Chinese Information Processing of Ministry of Education Shanxi University Shanxi Taiyuan030006 China International Graduate Institute of Artificial Intelligence National Yunlin University of Science and Technology Yunlin Taiwan Department of Electrical and Computer Engineering Lebanese American University Byblos Lebanon

The metaverse, constructed through digital technology, serves as a virtual realm intertwining with reality. Within this context, the challenge of evaluating data from diverse sources arises, and the application of large-scale group decision-making (LSGDM) methods emerges as a viable solution. Handling incomplete information and reducing dimensionality for large-scale decision-makers (DMs) is crucial in addressing complex decision-making problems. Moreover, addressing missing data is a fundamental and pivotal concern in tackling real-world decision challenges, given the ubiquitous presence of information gaps that cannot be straightforwardly integrated into decision models. Besides, the intricacies of LSGDM amplify this challenge by introducing a wealth of DMs, thereby augmenting the complexity and diversity of decision-related information. This paper proposes an approach to supplement missing data by double-dimensions. This paper explores various facets of similarity relationships within the data to enhance data completeness. Additionally, this paper categorizes DMs into clusters based on their relevance and establishes a two-stage consensus-reaching process (CRP) that takes into account both group sizes and individual consensus contributions. These CRPs play a crucial role in enhancing the overall consistency and consensus within the decision group. Subsequently, this paper applies a robust decision-making method rooted in MULTIMOORA (Multi-Objective Optimization by Ratio Analysis plus the complete MULTIplicative form) to rank decision objects. Finally, this paper employs this proposed methodology in a practical case study that involves evaluating the operational status of a metaverse's urban construction metro system. Following these considerations, a comprehensive stability analysis of relevant parameters is conducted to guarantee the robustness and reliability of the decision-making process. © 2024 Elsevier B.V.

关键词： Multiobjective optimization

来源：评论

学校读者我要写书评

暂无评论

The NeRF Signature: Codebook-Aided Watermarking for Neural Radiance Fields

arXiv

引用

arXiv 2025年

作者： Luo, Ziyuan Rocha, Anderson Shi, Boxin Guo, Qing Li, Haoliang Wan, Renjie Department of Computer Science Hong Kong Baptist University Hong Kong Institute of Computing University of Campinas Brazil State Key Laboratory of Multimedia Information Processing and National Engineering Research Center of Visual Technology School of Computer Science Peking University Beijing100871 China A*STAR Singapore Department of Electrical Engineering City University of Hong Kong Hong Kong

Neural Radiance Fields (NeRF) have been gaining attention as a significant form of 3D content representation. With the proliferation of NeRF-based creations, the need for copyright protection has emerged as a critical issue. Although some approaches have been proposed to embed digital watermarks into NeRF, they often neglect essential model-level considerations and incur substantial time overheads, resulting in reduced imperceptibility and robustness, along with user inconvenience. In this paper, we extend the previous criteria for image watermarking to the model level and propose NeRF Signature, a novel watermarking method for NeRF. We employ a Codebook-aided Signature Embedding (CSE) that does not alter the model structure, thereby maintaining imperceptibility and enhancing robustness at the model level. Furthermore, after optimization, any desired signatures can be embedded through the CSE, and no fine-tuning is required when NeRF owners want to use new binary signatures. Then, we introduce a joint pose-patch encryption watermarking strategy to hide signatures into patches rendered from a specific viewpoint for higher robustness. In addition, we explore a Complexity-Aware Key Selection (CAKS) scheme to embed signatures in high visual complexity patches to enhance imperceptibility. The experimental results demonstrate that our method outperforms other baseline methods in terms of imperceptibility and robustness. The source code is available at: https://***/luo-ziyuan/NeRF_Signature. Copyright © 2025, The Authors. All rights reserved.

关键词： Image watermarking

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：