咨询与建议

限定检索结果

文献类型

  • 204 篇 期刊文献
  • 170 篇 会议

馆藏范围

  • 374 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 235 篇 工学
    • 192 篇 计算机科学与技术...
    • 168 篇 软件工程
    • 49 篇 信息与通信工程
    • 29 篇 控制科学与工程
    • 22 篇 光学工程
    • 18 篇 机械工程
    • 16 篇 建筑学
    • 16 篇 土木工程
    • 15 篇 网络空间安全
    • 13 篇 电气工程
    • 11 篇 电子科学与技术(可...
    • 11 篇 测绘科学与技术
    • 11 篇 生物工程
    • 10 篇 化学工程与技术
    • 8 篇 安全科学与工程
    • 6 篇 交通运输工程
    • 6 篇 生物医学工程(可授...
    • 5 篇 仪器科学与技术
    • 4 篇 动力工程及工程热...
  • 76 篇 管理学
    • 43 篇 图书情报与档案管...
    • 40 篇 管理科学与工程(可...
    • 8 篇 工商管理
  • 67 篇 理学
    • 35 篇 数学
    • 17 篇 统计学(可授理学、...
    • 12 篇 物理学
    • 12 篇 生物学
    • 10 篇 化学
  • 4 篇 经济学
    • 4 篇 应用经济学
  • 4 篇 法学
    • 4 篇 社会学
    • 3 篇 法学
  • 2 篇 农学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 15 篇 semantics
  • 12 篇 image segmentati...
  • 10 篇 training
  • 9 篇 reinforcement le...
  • 9 篇 contrastive lear...
  • 8 篇 visual languages
  • 7 篇 speech processin...
  • 7 篇 convolution
  • 7 篇 computer vision
  • 7 篇 image reconstruc...
  • 6 篇 semantic segment...
  • 6 篇 distillation
  • 6 篇 visualization
  • 6 篇 pipelines
  • 5 篇 costs
  • 5 篇 benchmarking
  • 5 篇 codes
  • 4 篇 object detection
  • 4 篇 signal processin...
  • 4 篇 redundancy

机构

  • 214 篇 key laboratory o...
  • 62 篇 institute of art...
  • 40 篇 key laboratory o...
  • 36 篇 tencent youtu la...
  • 33 篇 peng cheng labor...
  • 32 篇 school of inform...
  • 25 篇 key laboratory o...
  • 22 篇 fujian key labor...
  • 20 篇 fujian key labor...
  • 18 篇 youtu lab tencen...
  • 17 篇 key laboratory o...
  • 10 篇 school of inform...
  • 10 篇 national univers...
  • 8 篇 tencent
  • 8 篇 skywork ai
  • 8 篇 school of comput...
  • 8 篇 the key laborato...
  • 8 篇 department of ar...
  • 7 篇 key laboratory o...
  • 6 篇 department of co...

作者

  • 152 篇 ji rongrong
  • 67 篇 sun xiaoshuai
  • 49 篇 rongrong ji
  • 40 篇 cao liujuan
  • 37 篇 ji jiayi
  • 30 篇 zhou yiyi
  • 30 篇 zhang shengchuan
  • 28 篇 wang cheng
  • 25 篇 ma yiwei
  • 24 篇 zhang yuxin
  • 24 篇 zheng xiawu
  • 22 篇 chao fei
  • 19 篇 luo gen
  • 18 篇 lin mingbao
  • 18 篇 zhang yan
  • 17 篇 xiaoshuai sun
  • 16 篇 wang haowei
  • 15 篇 shen yunhang
  • 15 篇 jiang guannan
  • 15 篇 li hui

语言

  • 318 篇 英文
  • 56 篇 其他
检索条件"机构=Key Laboratory of Multimedia Trusted Perception and Efficient Computing"
374 条 记 录,以下是151-160 订阅
排序:
DVHGNN: Multi-Scale Dilated Vision HGNN for efficient Vision Recognition
arXiv
收藏 引用
arXiv 2025年
作者: Li, Caoshuo Li, Tanzhe Hu, Xiaobin Luo, Donghao Jin, Taisong Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University China School of Informatics Xiamen University China Tencent Youtu Lab China
Recently, Vision Graph Neural Network (ViG) has gained considerable attention in computer vision. Despite its groundbreaking innovation, Vision Graph Neural Network encounters key issues including the quadratic comput... 详细信息
来源: 评论
Not All Attention is Needed: Parameter and Computation efficient Tuning for Multi-modal Large Language Models via Effective Attention Skipping
arXiv
收藏 引用
arXiv 2024年
作者: Wu, Qiong Ye, Weihao Zhou, Yiyi Sun, Xiaoshuai Ji, Rongrong Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University 361005 China Institute of Artificial Intelligence Xiamen University 361005 China
Recently, Multi-modal Large Language Models (MLLMs) have garnered an influx of interest from both academia and industry. However, for the downstream task applications, MLLMs not only require to update a large number o... 详细信息
来源: 评论
Cross-Modality Perturbation Synergy Attack for Person Re-identification  38
Cross-Modality Perturbation Synergy Attack for Person Re-ide...
收藏 引用
38th Conference on Neural Information Processing Systems, NeurIPS 2024
作者: Gong, Yunpeng Zhong, Zhun Qu, Yansong Luo, Zhiming Ji, Rongrong Jiang, Min School of Informatics Xiamen University China School of Computer Science and Information Engineering Hefei University of Technology China The Department of Artificial Intelligence Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China School of Informatics Key Laboratory of Digital Protection and Intelligent Processing of Intangible CulturalHeritage of Fujian and Taiwan Ministry of Culture and Tourism Xiamen University Fujian Xiamen361005 China
In recent years, there has been significant research focusing on addressing security concerns in single-modal person re-identification (ReID) systems that are based on RGB images. However, the safety of cross-modality...
来源: 评论
The dormant neuron phenomenon in multi-agent reinforcement learning value factorization  24
The dormant neuron phenomenon in multi-agent reinforcement l...
收藏 引用
Proceedings of the 38th International Conference on Neural Information Processing Systems
作者: Haoyuan Qin Chennan Ma Mian Deng Zhengzhu Liu Songzhu Mei Xinwang Liu Cheng Wang Siqi Shen Fujian Key Laboratory of Sensing and Computing for Smart Cities School of Informatics Xiamen University (XMU) China and Key Laboratory of Multimedia Trusted Perception and Efficient Computing XMU China School of Computer National University of Defense Technology China
In this work, we study the dormant neuron phenomenon in multi-agent reinforcement learning value factorization, where the mixing network suffers from reduced network expressivity caused by an increasing number of inac...
来源: 评论
Fit and Prune: Fast and Training-free Visual Token Pruning for Multi-modal Large Language Models
arXiv
收藏 引用
arXiv 2024年
作者: Ye, Weihao Wu, Qiong Lin, Wenhao Zhou, Yiyi Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University 361005 China Institute of Artificial Intelligence Xiamen University 361005 China
Recent progress in Multimodal Large Language Models (MLLMs) often use large image tokens to compensate the visual shortcoming of MLLMs, which not only exhibits obvious redundancy but also greatly exacerbates the alrea... 详细信息
来源: 评论
U-SAM: Upgrade Segment Anything Model With Semantic-Aware and Memory-efficient
U-SAM: Upgrade Segment Anything Model With Semantic-Aware an...
收藏 引用
International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
作者: Xiaofeng Jin Jie Hu Jianghang Lin Shengchuan Zhang Liujuan Cao Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University P.R. China Learning and Vision Lab National University of Singapore Singapore
Segment Anything Model (SAM) has achieved remarkable success in the field of class-agnostic image segmentation by utilizing points or boxes as prompts. However, we identify two significant limitations when compared to... 详细信息
来源: 评论
Proposal Distillation of Multi-Modal Feature Aggregation Network for Video Object Detection
Proposal Distillation of Multi-Modal Feature Aggregation Net...
收藏 引用
International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
作者: Zhenyu Qiu Qiang Qi Yang Lu Yan Yan Hanzi Wang Fujian Key Laboratory of Sensing and Computing for Smart City School of Informatics Xiamen University Xiamen China The Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University Xiamen China
Video object detection is a challenging task due to deteriorated object appearances. In order to bolster per-frame feature representations, one way is to aggregate features from relevant frames. However, relying exclu...
来源: 评论
SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation
arXiv
收藏 引用
arXiv 2024年
作者: Yang, Danni Ji, Jiayi Ma, Yiwei Guo, Tianyu Wang, Haowei Sun, Xiaoshuai Ji, Rongrong Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China School of Informatics Xiamen University 361005 China Youtu Lab Tencent Shanghai China
In this paper, we introduce SemiRES, a semi-supervised framework that effectively leverages a combination of labeled and unlabeled data to perform RES. A significant hurdle in applying semi-supervised techniques to RE... 详细信息
来源: 评论
Edge Guided Network with Motion Enhancement for Few-Shot Action Recognition
收藏 引用
IEEE Transactions on Circuits and Systems for Video Technology 2025年 第6期35卷 5331-5342页
作者: Du, Kaiwen Ye, Weirong Guo, Hanyu Yan, Yan Wang, Hanzi Huawei Technologies Hangzhou310051 China Ministry of Education of China Xiamen University Fujian Key Laboratory of Sensing and Computing for Smart City School of Informatics Key Laboratory of Multimedia Trusted Perception and Efficient Computing Xiamen361005 China Shanghai Artificial Intelligence Laboratory Shanghai200232 China
Existing state-of-the-art methods for few-shot action recognition (FSAR) achieve promising performance by spatial and temporal modeling. However, most current methods ignore the importance of edge information and moti... 详细信息
来源: 评论
Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model
arXiv
收藏 引用
arXiv 2024年
作者: Yang, Danni Dong, Ruohan Ji, Jiayi Ma, Yiwei Wang, Haowei Sun, Xiaoshuai Ji, Rongrong Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China School of Informatics Xiamen University China Youtu Lab. Tencent Shanghai China
Recently, diffusion models have increasingly demonstrated their capabilities in vision understanding. By leveraging prompt-based learning to construct sentences, these models have shown proficiency in classification a... 详细信息
来源: 评论