检索结果-内蒙古大学图书馆

arXiv 2025年

作者： Yu, Peilin Wu, Yuwei Gao, Zhi Fan, Xiaomeng Jia, Yunde Beijing Key Laboratory of Intelligent Information Technology School of Computer Science & Technology Beijing Institute of Technology China Guangdong Laboratory of Machine Perception and Intelligent Computing Shenzhen MSU-BIT University China

Riemannian meta-optimization provides a promising approach to solving non-linear constrained optimization problems, which trains neural networks as optimizers to perform optimization on Riemannian manifolds. However, existing Riemannian meta-optimization methods take up huge memory footprints in large-scale optimization settings, as the learned optimizer can only adapt gradients of a fixed size and thus cannot be shared across different Riemannian parameters. In this paper, we propose an efficient Riemannian meta-optimization method that significantly reduces the memory burden for large-scale optimization via a subspace adaptation scheme. Our method trains neural networks to individually adapt the row and column subspaces of Riemannian gradients, instead of directly adapting the full gradient matrices in existing Riemannian meta-optimization methods. In this case, our learned optimizer can be shared across Riemannian parameters with different sizes. Our method reduces the model memory consumption by six orders of magnitude when optimizing an orthogonal mainstream deep neural network (e.g. ResNet50). Experiments on multiple Riemannian tasks show that our method can not only reduce the memory consumption but also improve the performance of Riemannian meta-optimization. © 2025, CC BY-NC-ND.

关键词： Constrained optimization

来源：评论

学校读者我要写书评

暂无评论

XRL-SHAP-Cache:an explainable reinforcement learning approach for intelligent edge service caching in content delivery networks

引用

Science China(Information Sciences) 2024年第7期67卷 46-71页

作者： Xiaolong XU Fan WU Muhammad BILAL Xiaoyu XIA Wanchun DOU Lina YAO Weiyi ZHONG School of Software Nanjing University of Information Science and Technology School of Computing and Communications Lancaster University School of Computing Technologies Royal Melbourne Institute of Technology State Key Laboratory for Novel Software Technology Nanjing University School of Computer Science and Engineering University of New South Wales Data 61 Commonwealth Scientific and Industrial Research Organization School of Computer Science Qufu Normal University

Content delivery networks(CDNs) play a pivotal role in the modern internet infrastructure by enabling efficient content delivery across diverse geographical regions. As an essential component of CDNs, the edge caching scheme directly influences the user experience by determining the caching and eviction of content on edge servers. With the emergence of 5G technology, traditional caching schemes have faced challenges in adapting to increasingly complex and dynamic network environments. Consequently, deep reinforcement learning(DRL) offers a promising solution for intelligent zero-touch network governance. However, the blackbox nature of DRL models poses challenges in understanding and making trusting decisions. In this paper,we propose an explainable reinforcement learning(XRL)-based intelligent edge service caching approach,namely XRL-SHAP-Cache, which combines DRL with an explainable artificial intelligence(XAI) technique for cache management in CDNs. Instead of focusing solely on achieving performance gains, this study introduces a novel paradigm for providing interpretable caching strategies, thereby establishing a foundation for future transparent and trustworthy edge caching solutions. Specifically, a multi-level cache scheduling framework for CDNs was formulated theoretically, with the D3QN-based caching scheme serving as the targeted interpretable model. Subsequently, by integrating Deep-SHAP into our framework, the contribution of each state input feature to the agent's Q-value output was calculated, thereby providing valuable insights into the decision-making process. The proposed XRL-SHAP-Cache approach was evaluated through extensive experiments to demonstrate the behavior of the scheduling agent in the face of different environmental *** results demonstrate its strong explainability under various real-life scenarios while maintaining superior performance compared to traditional caching schemes in terms of cache hit ratio, quality of service(QoS),a

关键词： deep reinforcement learning (DRL) explainable artificial intelligence (XAI) multi-level cache content delivery network (CDN) D3QN algorithm Deep-SHAP

来源：评论

学校读者我要写书评

暂无评论

Multi-Label Stereo Matching for Transparent Scene Depth Estimation

arXiv

引用

arXiv 2025年

作者： Liu, Zhidan Yao, Chengtang Zeng, Jiaxi Wu, Yuwei Jia, Yunde Beijing Key Laboratory of Intelligent Information Technology School of Computer Science & Technology Beijing Institute of Technology China Guangdong Laboratory of Machine Perception and Intelligent Computing Shenzhen MSU-BIT University China

In this paper, we present a multi-label stereo matching method to simultaneously estimate the depth of the transparent objects and the occluded background in transparent scenes. Unlike previous methods that assume a unimodal distribution along the disparity dimension and formulate the matching as a single-label regression problem, we propose a multi-label regression formulation to estimate multiple depth values at the same pixel in transparent scenes. To resolve the multi-label regression problem, we introduce a pixel-wise multivariate Gaussian representation, where the mean vector encodes multiple depth values at the same pixel, and the covariance matrix determines whether a multi-label representation is necessary for a given pixel. The representation is iteratively predicted within a GRU framework. In each iteration, we first predict the update step for the mean parameters and then use both the update step and the updated mean parameters to estimate the covariance matrix. We also synthesize a dataset containing 10 scenes and 89 objects to validate the performance of transparent scene depth estimation. The experiments show that our method greatly improves the performance on transparent surfaces while preserving the background information for scene reconstruction. Code is available at https://***/BFZD233/TranScene. © 2025, CC BY.

关键词： Multiple linear regression

来源：评论

学校读者我要写书评

暂无评论

CESFusion: Cross-Frequency Enhanced Spatial—Spectral Fusion Network for Hyperspectral and Multispectral Image Fusion

引用

IEEE Transactions on Geoscience and Remote Sensing 2025年 63卷

作者： Zhang, Haozheng Yang, Yanhong Li, Chaoyang Lu, Yanjie Zhang, Guodao Chen, Shengyong Tianjin University of Technology School of Computer Science and Engineering Tianjin Key Laboratory of Intelligence Computing and Novel Software Technology Tianjin300384 China School of Computer Science and Engineering Tianjin300384 China Hangzhou Dianzi University Institute of Intelligent Media Computing Hangzhou310018 China

The fusion of hyperspectral and multispectral images (MSIs) involves integrating a high spectral resolution hyperspectral image (HSI) and a high spatial resolution MSI to generate an HSI with high-resolution HSI (HR-HSI). Existing HSI-MSI fusion methods primarily focus on information fusion within the spatial domain;however, few solutions have explored the employment of frequency analysis to enhance spatial resolution, limiting their capability for global perception. In this article, we propose an efficient and novel paradigm for HSI-MSI fusion through the cross-frequency enhanced spatial-spectral fusion network, named CESFusion, exploring the complementary fusion of information between the spatial and frequency domains. Specifically, we first present the cross-frequency fusion module (CFFM) to perform global analysis through the Fourier transform (FT) and effectively integrate and enhance the frequency domain information from both HSI and MSI. Subsequently, we propose the spectral modeling module (SpeMM) based on a state space model (SMM) to capture long-range spectral dependencies with linear complexity, and integrate it with the spatial residual block-based module (SRM) for joint spatial-spectral feature extraction. Finally, to enable sufficient interaction between the spatial and frequency domains, we adopt the cross-domain interaction module (CDIM), capturing and integrating complementary information from both domains. Moreover, a frequency-based loss function is purposely designed to further improve the restoration of global information. Extensive experiments conducted on both synthetic and real datasets demonstrate the superiority of our CESFusion, as evidenced by both quantitative and qualitative evaluation results. © 1980-2012 IEEE.

关键词： Image fusion

来源：评论

学校读者我要写书评

暂无评论

Task2Morph: Differentiable Task-inspired Framework for Contact-Aware Robot Design

arXiv

引用

arXiv 2024年

作者： Cai, Yishuai Yang, Shaowu Li, Minglong Chen, Xinglin Mao, Yunxin Yi, Xiaodong Yang, Wenjing The Institute for Quantum Information State Key Laboratory of High Performance Computing College of Computer Science and Technology National University of Defense Technology Changsha China

Optimizing the morphologies and the controllers that adapt to various tasks is a critical issue in the field of robot design, aka. embodied intelligence. Previous works typically model it as a joint optimization problem and use search-based methods to find the optimal solution in the morphology space. However, they ignore the implicit knowledge of task-to-morphology mapping which can directly inspire robot design. For example, flipping heavier boxes tends to require more muscular robot arms. This paper proposes a novel and general differentiable task-inspired framework for contact-aware robot design called Task2Morph. We abstract task features highly related to task performance and use them to build a task-to-morphology mapping. Further, we embed the mapping into a differentiable robot design process, where the gradient information is leveraged for both the mapping learning and the whole optimization. The experiments are conducted on three scenarios, and the results validate that Task2Morph outperforms DiffHand, which lacks a task-inspired morphology module, in terms of efficiency and effectiveness. Copyright © 2024, The Authors. All rights reserved.

关键词： Morphology

来源：评论

学校读者我要写书评

暂无评论

Video Face Recognition System: RetinaFace-Mnet-Faster and Secondary Search

Video Face Recognition System: RetinaFace-Mnet-Faster and Se...

引用

Future of Information and Communication Conference, FICC 2021

作者： Li, Qian Guo, Nan Ye, Xiaochun Fan, Dongrui Tang, Zhimin State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences Beijing China University of Chinese Academy of Sciences Beijing China

ISBN: (纸本)9783030731021

Face recognition is widely used in the scene. However, different visual environments require different methods, and face recognition has a difficulty in complex environments. Therefore, this paper mainly experiments complex faces in the video. First, we design an image pre-processing module for fuzzy scene or under-exposed faces to enhance images. Our experimental results demonstrate that effective images pre-processing improves the accuracy of 0.11%, 0.2% and 1.4% on LFW, WIDER FACE and our datasets, respectively. Second, we propose RetinacFace-mnet-faster for detection and a confidence threshold specification for face recognition, reducing the lost rate. Our experimental results show that our RetinaFace-mnet-faster for 640 × 480 resolution on the Tesla P40 and single-thread improve speed of 16.7% and 70.2%, respectively. Finally, we design secondary search mechanism with HNSW to improve performance. Ours is suitable for large-scale datasets, and experimental results show that our method is 82% faster than the violent retrieval for the single-frame detection. © 2021, The Author(s), under exclusive license to Springer Nature Switzerland AG.

关键词： Face recognition

来源：评论

学校读者我要写书评

暂无评论

Prototype Guided Personalized Federated Intrusion Detection System 10

Prototype Guided Personalized Federated Intrusion Detection ...

引用

10th IEEE Smart World Congress, SWC 2024

作者： Cheng, Long Yan, Huiru Zhou, Hanlin Wang, Ying Tang, Haichuan Fang, Fang North China Electric Power University State Key Laboratory of Alternate Electrical Power System with Renewable Energy Sources School of Control and Computer Engineering Beijing China Institute of Computing Technology Chinese Academy of Sciences Beijing China Ai Lab of Crrc Academy Beijing China

ISBN: (纸本)9798331520861

Preventing network attacks and protecting user privacy are consistently hot research topics in the Internet of Things (IoT) and edge computing fields. Recent advancements in Federated Learning (FL) have shown promise in addressing these challenges. FL allows various clients to collaboratively build an intrusion detection system (IDS) without sharing their private data. However, most existing methods train a single intrusion detection model for all clients and assume that the training and test data distribution are identical, leading to unsatisfactory detection accuracy and generalization abilities in practice. To overcome these challenges, this paper introduces a prototype-guided personalized FL approach named PG-FedIDS. We propose two novel mechanisms within this method. Firstly, we utilize class prototypes as auxiliary information carriers to generate personalized models for each client, rather than generating a single global model as in previous works. Secondly, we propose a prototype-guided ensemble learning strategy, which can leverage the global knowledge in prototypes to enhance detection accuracy and generalization abilities for each client. We conduct extensive experiments on two benchmark datasets with different evaluation test settings. The results demonstrate that our PG-FedIDS achieves promising detection accuracy and consistently outperforms other FL baselines. © 2024 IEEE.

关键词： Federated learning

来源：评论

学校读者我要写书评

暂无评论

A Distributed Framework for Large-scale Protein-protein Interaction Data Analysis and Prediction Using MapReduce

引用

IEEE/CAA Journal of Automatica Sinica 2022年第1期9卷 160-172页

作者： Lun Hu Shicheng Yang Xin Luo Huaqiang Yuan Khaled Sedraoui MengChu Zhou School of Computer Science and Technology Dongguan University of TechnologyDongguan 523808China School of Computer Science and Technology Wuhan University of TechnologyWuhan 430070China Chongqing Engineering Research Center of Big Data Application for Smart Cities and Chongqing Key Laboratory of Big Data and Intelligent ComputingChongqing Institute of Green and Intelligent TechnologyChinese Academy of SciencesChongqing 400714China Xinjiang Technical Institute of Physics and Chemistry Chinese Academy of SciencesUrumqi 830000China Center of Research Excellence in Renewable Energy and Power Systems and the Department of Electrical and Computer EngineeringFaculty of EngineeringKing Abdulaziz UniversityJeddah 21589Saudi Arabia Department of Electrical and Computer Engineering New Jersey Institute of TechnologyNewarkNJ 07102 USA

Protein-protein interactions are of great significance for human to understand the functional mechanisms of *** the rapid development of high-throughput genomic technologies,massive protein-protein interaction(PPI)data have been generated,making it very difficult to analyze them *** address this problem,this paper presents a distributed framework by reimplementing one of state-of-the-art algorithms,i.e.,CoFex,using *** do so,an in-depth analysis of its limitations is conducted from the perspectives of efficiency and memory consumption when applying it for large-scale PPI data analysis and *** solutions are then devised to overcome these *** particular,we adopt a novel tree-based data structure to reduce the heavy memory consumption caused by the huge sequence information of *** that,its procedure is modified by following the MapReduce framework to take the prediction task distributively.A series of extensive experiments have been conducted to evaluate the performance of our framework in terms of both efficiency and *** results well demonstrate that the proposed framework can considerably improve its computational efficiency by more than two orders of magnitude while retaining the same high accuracy.

关键词： Distributed computing large-scale prediction machine learning MapReduce protein-protein interaction(PPI)

来源：评论

学校读者我要写书评

暂无评论

Degradation analysis and optimization of temperature effect on MEMRISTOR-based Neural Network Accelerators by electro-thermal simulation

Degradation analysis and optimization of temperature effect ...

引用

2020 International Conference on Electronics, Communications and Information technology, CECIT 2020

作者： Shang, Mengjun Yin, Longxiang Xu, Ning School of Information Engineering Wuhan University of Technology Wuhan China State Key Laboratory of Computer Architecture Institute of Computing Technology Chinese Academy of Sciences Beijing China

Nowadays, memristor-based neural network accelerators have been widely studied due to their outstanding performance in massive parallel vector matrix multiplication. However, the memristor is sensitive to temperature and its on/off state operation window can be seriously degraded by the increasing temperature, which may lead to computation failures in memristor -based NN accelerators. In this work, we establish an electro-thermal simulation platform to evaluate the temperature impact on memristor -based NN accelerators. With this platform, we first investigate the impact on computation accuracy with the temperature increase in different NN layers in the accelerators. We then apply a temperature-aware NN weight mapping scheme to the most temperature-sensitive layer and achieve 28.89% improvement in computation accuracy, which only has 0.06% difference with the improvement achieved by applied the mapping scheme to the whole NN model. This finding can help to simplify the temperature-aware hardware optimization design in memristor-based neural network accelerators and reduce the power consumption. © Published under licence by IOP Publishing Ltd.

关键词： Memristors

来源：评论

学校读者我要写书评

暂无评论

FIRE: a dataset for feedback integration and refinement evaluation of multimodal models 24

FIRE: a dataset for feedback integration and refinement eval...

引用

Proceedings of the 38th International Conference on Neural Information Processing Systems

作者： Pengxiang Li Zhi Gao Bofei Zhang Tao Yuan Yuwei Wu Mehrtash Harandi Yunde Jia Song-Chun Zhu Qing Li Beijing Key Laboratory of Intelligent Information Technology School of Computer Science & Technology Beijing Institute of Technology and State Key Laboratory of General Artificial Intelligence BIGAI State Key Laboratory of General Artificial Intelligence BIGAI and State Key Laboratory of General Artificial Intelligence Peking University State Key Laboratory of General Artificial Intelligence BIGAI Beijing Key Laboratory of Intelligent Information Technology School of Computer Science & Technology Beijing Institute of Technology and Guangdong Laboratory of Machine Perception and Intelligent Computing Shenzhen MSU-BIT University Department of Electrical and Computer System Engineering Monash University Guangdong Laboratory of Machine Perception and Intelligent Computing Shenzhen MSU-BIT University and Beijing Key Laboratory of Intelligent Information Technology School of Computer Science & Technology Beijing Institute of Technology State Key Laboratory of General Artificial Intelligence BIGAI and State Key Laboratory of General Artificial Intelligence Peking University and Department of Automation Tsinghua University

ISBN: (纸本)9798331314385

Vision language models (VLMs) have achieved impressive progress in diverse applications, becoming a prevalent research direction. In this paper, we build FIRE, a feedback-refinement dataset, consisting of 1.1M multi-turn conversations that are derived from 27 source datasets, empowering VLMs to spontaneously refine their responses based on user feedback across diverse tasks. To scale up the data collection, FIRE is collected in two components: FIRE-100K and FIRE-1M, where FIRE-100K is generated by GPT-4V, and FIRE-1M is freely generated via models trained on FIRE-100K. Then, we build FIRE-Bench, a benchmark to comprehensively evaluate the feedback-refining capability of VLMs, which contains 11K feedback-refinement conversations as the test data, two evaluation settings, and a model to provide feedback for VLMs. We develop the FIRE-LLaVA model by fine-tuning LLaVA on FIRE-100K and FIRE-1M, which shows remarkable feedback-refining capability on FIRE-Bench and outperforms untrained VLMs by 50%, making more efficient user-agent interactions and underscoring the significance of the FIRE dataset.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：