检索结果-内蒙古大学图书馆

IEEE International Conference on Multimedia and Expo (ICME)

作者： Qiqin Lin Weixing Xie Rongzhou Zhou Xianpeng Cao Jingze Chen Junfeng Yao Qingqi Hong Center for Digital Media Computing School of Film Xiamen University Xiamen China National Institute for Data Science in Health and Medicine Xiamen University School of Informatics Xiamen University Xiamen China Key Laboratory of Digital Protection and Intelligent Processing of Intangible Cultural Heritage of Fujian and Taiwan Ministry of Culture and Tourism

ISBN: (数字)9798350390155

ISBN: (纸本)9798350390162

In semi-supervised medical image segmentation, the scarcity of labeled data makes models prone to learning bias, causing persistent errors in certain regions and eventual over-fitting, significantly impacting segmentation performance. These problematic regions, termed difficult areas, are inadequately addressed by existing methods. To address this, We propose the Difficulty Perception-Processing Heterogeneous Network (DPP-Net). It guides the model in accurately perceiving and rectifying difficult areas, overcoming learning bias. Specifically, we introduce the Global Mutual Perception (GMP) to establish a comprehensive information perception channel between sample data, enabling a more holistic and accurate perception of difficult areas. The Difficulty-Aware Rectification (DAR) structure ensures continuous monitoring of difficult areas during training, allowing for timely adjustments to errors. Additionally, the Adaptive Competitive Pseudo-Label (ACP) Augmentation strategy enhances pseudo-labels through adaptive confidence competition. Experimental results on two different medical image databases (CT and MRI) demonstrate that our approach outperforms several state-of-the-art methods.

关键词： Training Image segmentation Accuracy Image databases Magnetic resonance imaging Computed tomography Heterogeneous networks data models Monitoring Biomedical imaging

来源：评论

学校读者我要写书评

暂无评论

Multiview Subgraph Neural Networks: Self-Supervised Learning With Scarce Labeled data

引用

IEEE Transactions on Neural Networks and Learning Systems 2024年第6期36卷 11548-11561页

作者： Zhenzhong Wang Qingyuan Zeng Wanyu Lin Min Jiang Kay Chen Tan Department of Computing The Hong Kong Polytechnic University Hong Kong SAR China Department of Artificial Intelligence Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Key Laboratory of Digital Protection and Intelligent Processing of Intangible Cultural Heritage of Fujian and Taiwan Ministry of Culture and Tourism School of Informatics Xiamen University Xiamen Fujian China Department of Computing Department of Data Science and Artificial Intelligence The Hong Kong Polytechnic University Hong Kong SAR China Department of Data Science and Artificial Intelligence The Hong Kong Polytechnic University Hong Kong SAR China

While graph neural networks (GNNs) have become the de facto standard for graph-based node classification, they impose a strong assumption on the availability of sufficient labeled samples. This assumption restricts the classification performance of prevailing GNNs on many real-world applications suffering from low-data regimes. Specifically, features extracted from scarce labeled nodes could not provide sufficient supervision for the unlabeled samples, leading to severe overfitting. We point out that leveraging subgraphs to capture long-range dependencies can augment the node representation, thus alleviating the low-data regime. To this end, we present a novel self-supervised learning (SSL) framework, called multiview subgraph neural networks (Muse), for handling the long-range dependencies. In particular, we propose an information theory-based identification mechanism to identify two types of subgraphs from the views of input space and latent space, respectively. The former is to capture the local structure of the graph, while the latter captures the long-range dependencies among nodes. By fusing these two views of subgraphs, the learned representations can preserve the topological properties of the graph at large, including the local structure and long-range dependencies, thus maximizing their expressiveness. Theoretically, we provide the generalization error bound to show the effectiveness of capturing complementary information from multiview subgraphs. Empirically, we show a proof-of-concept of Muse on canonical node classification problems on graph data.

关键词： Manifolds Semantics Feature extraction Laplace equations data mining Sugar Social networking (online) Skeleton Manifold learning Graph neural networks

来源：评论

学校读者我要写书评

暂无评论

Hyperspectral super-resolution via nonlinear unmixing

引用

Information Fusion 2025年 123卷

作者： Qingke Zou Jie Zhou Mingjie Luo College of Mathematics Sichuan University Chengdu 610064 Sichuan China Key Laboratory of Data Protection and Intelligent Management (Sichuan University) Ministry of Education Chengdu 610065 China

Fusing a hyperspectral image (HSI) with a multispectral image (MSI) to produce a super-resolution image (SRI) that possesses both fine spatial and spectral resolutions is a widely adopted technique in hyperspectral super-resolution (HSR). Most existing HSR methods accomplish this task within the framework of linear mixing model (LMM). However, a severe challenge lies in the inherent linear constraint of LMM, which hinders the adaptability of these HSR methods to complex real-world scenarios. In this work, the LMM is extended to the generalized bilinear model (GBM), and a novel HSR method based on nonnegative tensor factorization is proposed in the framework of nonlinear unmixing. Apart from the linear part, it additionally considers the main nonlinear interactions, that is, the bilinear interactions between the endmembers. Crucially, each potential decomposition factor possesses a physical interpretation, enabling the incorporation of prior information to enhance reconstruction performance. Furthermore, an HSR algorithm has been devised specifically for scenarios where the spatial degradation operators from SRI to HSI are unknown, which undoubtedly enhances its practical applicability. The proposed methods overcome the inherent linear limitations of the LMM framework while avoiding the information loss associated with matrixizing HSI and MSI. The effectiveness of the proposed methods is showcased through simulated and real data.

关键词：

来源：评论

学校读者我要写书评

暂无评论

S2CCT: Self-Supervised Collaborative CNN-Transformer for Few-shot Medical Image Segmentation

S2CCT: Self-Supervised Collaborative CNN-Transformer for Few...

引用

IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

作者： Rongzhou Zhou Ziqi Shu Weixing Xie Junfeng Yao Qingqi Hong Center for Digital Media Computing School of Film School of Informatics Xiamen University Xiamen China National Institute for Data Science in Health and Medicine Xiamen University Key Laboratory of Digital Protection and Intelligent Processing of Intangible Cultural Heritage of Fujian and Taiwan Ministry of Culture and Tourism Institute of Artificial Intelligence Xiamen University Xiamen China

ISBN: (数字)9798350386226

ISBN: (纸本)9798350386233

Self-supervised pre-training followed by fine-tuning is a potent paradigm for few-shot learning, leveraging extensive unlabeled data with remarkable efficacy. Current self-supervised methods often lean towards Vision Transformers (ViTs) rather than CNN-Transformer hybrid architectures, which generally demonstrate superior performance. However, this reliance on ViTs can lead to poor perception of local features by the model. The challenge lies in designing a suitable proxy task for hybrid architectures like CNN-Transformers, which have significant structural differences. Additionally, the current organization of CNN-Transformer hybrid backbones is often sequential, hindering collaboration during pre-training and the acquisition of robust representations. To address these issues, we propose Self-Supervised Collaborative CNN-Transformer (S 2 CCT) for few-shot medical image segmentation. This framework introduces three innovative designs: (1) a composite proxy task based on image masking and image super-resolution tailored for CNN-Transformer hybrid architectures, enabling the backbone to acquire robust representations during pre-training that can be transferred to downstream tasks; (2) a parallel CNN-Transformer architecture that better attends to multi-scale features in images, making it more suitable for dense prediction tasks like image segmentation; (3) a sparse and dense feature fusion module to enhance collaboration between the two encoders. Experiments demonstrate that S 2 CCT outperforms previous state-of-the-art methods on two public medical image segmentation benchmarks, i.e., ACDC and KiTs19. The code and pretrained models will be released soon.

关键词： Image segmentation Three-dimensional displays Superresolution Collaboration Organizations Feature extraction Transformers Medical diagnosis Medical diagnostic imaging Knowledge transfer

来源：评论

学校读者我要写书评

暂无评论

RESEARCH ON THE REWARDS AND PUNISHMENTS OF URBAN SEWAGE TREATMENT UNDER THE UNCERTAINTY THEORY

引用

UPB Scientific Bulletin, Series A: Applied Mathematics and Physics 2024年第3期86卷 111-124页

作者： Ruixi, Yuan Lin, Zhang Li, Cun-Lin Masron, Tajul Ariffin Yang, Bao-Jun School of management North Minzu University Yinchuan750021 China Universiti Sains Malaysia Pinang 13500 Malaysia School of Mathematics and Information Science Guangzhou University Guangzhou510006 China School of Management The Key Laboratory of Intelligent Information and Big Data Processing of NingXia North Minzu University Yinchuan750021 China School of Management Universiti Sains Malaysia Pinang 13500 Malaysia School of Business Northinzu University Yinchuan750021 China

This paper fills a gap in urban sewage treatment decision-making by exploring reward and punishment mechanisms within the uncertainty theory framework. Addressing challenges in characterizing sewage treatment capacity due to incomplete information, the study integrates incentive and punishment mechanisms in a decision-making model. Results indicate that government rewards encourage companies to treat more sewage, but a nuanced relationship exists between rewards, treatment capacity, and expected returns. The study emphasizes the need for a balanced approach, suggesting a focus on technology enhancement over sole reliance on rewards. Examining a double-headed sewage market, the research establishes an uncertain Cournot game model, providing insights into sewage treatment dynamics under asymmetric information. In conclusion, the paper contributes to practical reward and punishment strategies in sewage treatment policies, offering theoretical support for effective decision-making in urban sewage management within a concise framework.. © 2024, Politechnica University of Bucharest. All rights reserved.

关键词： Sewage treatment

来源：评论

学校读者我要写书评

暂无评论

Suppress content shift: better diffusion features via off-the-shelf generation techniques 24

Suppress content shift: better diffusion features via off-th...

引用

Proceedings of the 38th International Conference on Neural Information Processing Systems

作者： Benyuan Meng Qianqian Xu Zitai Wang Zhiyong Yang Xiaochun Cao Qingming Huang Institute of Information Engineering CAS and School of Cyber Security University of Chinese Academy of Sciences Key Lab. of Intelligent Information Processing Institute of Computing Technology CAS and Peng Cheng Laboratory Key Lab. of Intelligent Information Processing Institute of Computing Technology CAS School of Computer Science and Tech. University of Chinese Academy of Sciences School of Cyber Science and Tech. Shenzhen Campus of Sun Yat-sen University School of Computer Science and Tech. University of Chinese Academy of Sciences and Key Lab. of Intelligent Information Processing Institute of Computing Technology CAS and Key Laboratory of Big Data Mining and Knowledge Management CAS

ISBN: (纸本)9798331314385

Diffusion models are powerful generative models, and this capability can also be applied to discrimination. The inner activations of a pre-trained diffusion model can serve as features for discriminative tasks, namely, diffusion feature. We discover that diffusion feature has been hindered by a hidden yet universal phenomenon that we call content shift. To be specific, there are content differences between features and the input image, such as the exact shape of a certain object. We locate the cause of content shift as one inherent characteristic of diffusion models, which suggests the broad existence of this phenomenon in diffusion feature. Further empirical study also indicates that its negative impact is not negligible even when content shift is not visually perceivable. Hence, we propose to suppress content shift to enhance the overall quality of diffusion features. Specifically, content shift is related to the information drift during the process of recovering an image from the noisy input, pointing out the possibility of turning off-the-shelf generation techniques into tools for content shift suppression. We further propose a practical guideline named GATE to efficiently evaluate the potential benefit of a technique and provide an implementation of our methodology. Despite the simplicity, the proposed approach has achieved superior results on various tasks and datasets, validating its potential as a generic booster for diffusion features. Our code is available at https://***/Darkbblue/diffusion-content-shift.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Spatial-Temporal Knowledge Transfer for Dynamic Constrained Multiobjective Optimization

引用

IEEE Transactions on Evolutionary Computation 2024年 1-1页

作者： Wang, Zhenzhong Xu, Dejun Jiang, Min Tan, Kay Chen Department of Computing The Hong Kong Polytechnic University Hong Kong P.R. China Department of Artificial Intelligence Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Key Laboratory of Digital Protection and Intelligent Processing of Intangible Cultural Heritage of Fujian and Taiwan Ministry of Culture and Tourism School of Informatics Xiamen University Xiamen Fujian China Department of Data Science and Artificial Intelligence The Hong Kong Polytechnic University Hong Kong P.R. China

Dynamic Constrained Multiobjective Optimization Problems (DCMOPs) are characterized by multiple conflicting optimization objectives and constraints that vary over time. The presence of both dynamism and constraints underscores the importance of preserving population diversity. This diversity is essential not only to escape local optima following environmental changes but also to climb infeasible barriers to approach feasible regions. However, existing constraint-handling techniques for enhancing solution feasibility could steer infeasible solutions toward partially feasible regions, potentially resulting in the loss of diversity. To maintain both diversity and feasibility, this work establishes two synergistic tasks: one task concentrates on exploring the unconstrained search space to preserve diversity, while the other delves into searching the constrained search space to prioritize feasibility. Particularly, in light of evolutionary transfer optimization, two knowledge transfer modules, i.e., the spatial knowledge transfer module and temporal knowledge transfer module are designed. The spatial knowledge transfer module facilitates knowledge transfer between the constrained and unconstrained search spaces to accelerate the exploration of both spaces. On the other hand, the temporal transfer module leverages historical knowledge to enhance search efficiency within the new environment. To advance the test suite toward real-world cases, we designed fourteen test problems with various properties. Experiments conducted on the proposed test problems and a real-world problem have demonstrated the efficacy of our proposed algorithm. IEEE

关键词： Constrained optimization

来源：评论

学校读者我要写书评

暂无评论

Probabilistic Electricity Price Forecasting and Uncertainty Estimation Using Deep Bayesian Model with Multi-Distribution Fusion

SSRN

引用

SSRN 2024年

作者： Shao, Zhen Han, Yating Fu, Chao Zha, Jianrui Yang, Shanlin School of Management Hefei University of Technology Hefei230009 China Key Laboratory of Process Optimization and Intelligent Decision-making Ministry of Education Anhui Hefei230009 China Philosophy and Social Sciences Laboratory of Data Science and Smart Society Governance Hefei University of Technology Ministry of Education Hefei230009 China Ministry of Education Engineering Research Center for Intelligent Decision-Making & Information System Technologies Hefei230009 China

Probabilistic electricity price forecasting (EPF) is paramount for stakeholder scheduling and trading in deregulated energy markets. However, during the process of establishing a probability prediction model, the consequential aleatoric and epistemic uncertainties are indistinguishable. Moreover, identifying epistemic uncertainty rather than aleatoric uncertainty is significant for conducting credible probabilistic EPF in cases with expected and unexpected fluctuations. This article proposes a hybrid approach under a Bayesian probability framework for performing reliable and accurate probabilistic forecasting. Specifically, it applies Bayesian deep learning model to convey the latent probability distribution of electricity prices, which represents the desirable epistemic uncertainty and inevitable aleatoric uncertainty. The embedded architecture employs a multi-head attention mechanism (MHAM) to allocate previous hidden states of long short-term memory (LSTM) cells, detecting both the local and global price patterns simultaneously. Considering the forecasting bias arising from aleatoric uncertainty in the process of establishing Bayesian networks, a new Wasserstein distance-based multi-distribution fusion mechanism driven by evidence theory is proposed and employed to identify and mitigate the aleatoric uncertainty of credible probabilistic EPF problems. The effectiveness of the proposed approach is evaluated on two real-world electricity price datasets and compared with twelve state-of-the-art models. © 2024, The Authors. All rights reserved.

关键词： Bayesian networks

来源：评论

学校读者我要写书评

暂无评论

Learning Unified Embeddings for Recommendation via Meta-path Semantics 21

Learning Unified Embeddings for Recommendation via Meta-path...

引用

29th ACM International Conference on Multimedia, MM 2021

作者： Hao, Qianxiu Xu, Qianqian Yang, Zhiyong Huang, Qingming Key Laboratory of Intelligent Information Processing Institute of Computing Technology CAS Beijing China School of Computer Science and Technology University of Chinese Academy of Sciences Beijing China Key Laboratory of Big Data Mining and Knowledge Management Chinese Academy of Sciences Beijing China Artificial Intelligence Research Center Peng Cheng Laboratory Shenzhen China

ISBN: (纸本)9781450386517

Heterogeneous information networks (HINs) have become a popular tool to capture complicated user-item relationships in recommendation problems in recent years. As a typical instantiation of HINs, meta-path is introduced in search of higher-level representations of user-item interactions. Though remarkable success has been achieved along this direction, existing meta-path-based recommendation methods face at least one of the following issues: 1) existing methods merely adopt simple meta-path fusion rules, which might be insufficient to exclude inconsistent information of different meta-paths that may hurt model performance;2) the representative power is limited by shallow/stage-wise formulations. To solve these issues, we propose an end-to-end and unified embedding-based recommendation framework with graph-based learning. To address 1), we propose a flexible fusion module to integrate meta-path-based similarities into relative similarities between users and items. To address 2), we take advantage of the powerful representative ability of deep neural networks to learn more complicated and flexible latent embeddings. Finally, empirical studies on real-world datasets demonstrate the effectiveness of our proposed method. © 2021 ACM.

关键词： Embeddings

来源：评论

学校读者我要写书评

暂无评论

Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models

arXiv

引用

arXiv 2024年

作者： Chen, Changyu Wang, Xiting Lin, Ting-En Lv, Ang Wu, Yuchuan Gao, Xin Wen, Ji-Rong Yan, Rui Li, Yongbin Beijing Key Laboratory of Big Data Management and Analysis Methods Gaoling School of Artificial Intelligence Renmin University of China China Alibaba Group China Engineering Research Center of Next-Generation Intelligent Search and Recommendation Ministry of Education China Computational Bioscience Research Center KAUST Saudi Arabia

In reasoning tasks, even a minor error can cascade into inaccurate results, leading to suboptimal performance of large language models in such domains. Earlier fine-tuning approaches sought to mitigate this by leveraging more precise supervisory signals from human labeling, larger models, or self-sampling, although at a high cost. Conversely, we develop a method that avoids external resources, relying instead on introducing perturbations to the input. Our training approach randomly masks certain tokens within the chain of thought, a technique we found to be particularly effective for reasoning tasks. When applied to fine-tuning with GSM8K on Llama-2-7B, this method achieved a 5% improvement in GSM8K accuracy and a 10% improvement in GSM-IC accuracy over standard supervised fine-tuning with a few codes modified. Furthermore, it is complementary to existing methods. When integrated with related explicit data augmentation methods, it leads to improvements across five datasets of various augmentation methods, as well as two different base models. We further investigate the mechanisms behind this improvement through case studies and quantitative analysis, suggesting that our approach may provide superior support for the model in capturing long-distance dependencies, especially those related to questions. This enhancement could deepen understanding of the premises in questions and prior steps. Our code is available at Github. © 2024, CC BY.

关键词： Signal sampling

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：