咨询与建议

限定检索结果

文献类型

  • 203 篇 期刊文献
  • 200 篇 会议

馆藏范围

  • 403 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 266 篇 工学
    • 224 篇 计算机科学与技术...
    • 168 篇 软件工程
    • 59 篇 控制科学与工程
    • 51 篇 信息与通信工程
    • 22 篇 光学工程
    • 18 篇 机械工程
    • 17 篇 网络空间安全
    • 15 篇 电气工程
    • 15 篇 建筑学
    • 15 篇 土木工程
    • 11 篇 电子科学与技术(可...
    • 11 篇 生物工程
    • 10 篇 测绘科学与技术
    • 10 篇 化学工程与技术
    • 10 篇 安全科学与工程
    • 8 篇 交通运输工程
    • 5 篇 仪器科学与技术
    • 5 篇 生物医学工程(可授...
    • 4 篇 动力工程及工程热...
  • 78 篇 管理学
    • 43 篇 图书情报与档案管...
    • 42 篇 管理科学与工程(可...
    • 8 篇 工商管理
  • 65 篇 理学
    • 34 篇 数学
    • 16 篇 统计学(可授理学、...
    • 12 篇 生物学
    • 11 篇 物理学
    • 10 篇 化学
  • 4 篇 经济学
    • 4 篇 应用经济学
  • 4 篇 法学
    • 4 篇 社会学
    • 3 篇 法学
  • 2 篇 农学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 15 篇 semantics
  • 12 篇 image segmentati...
  • 11 篇 semantic segment...
  • 10 篇 contrastive lear...
  • 10 篇 training
  • 9 篇 reinforcement le...
  • 8 篇 visual languages
  • 7 篇 speech processin...
  • 7 篇 convolution
  • 7 篇 computer vision
  • 7 篇 image reconstruc...
  • 6 篇 distillation
  • 6 篇 visualization
  • 6 篇 pipelines
  • 5 篇 object detection
  • 5 篇 graph neural net...
  • 5 篇 costs
  • 5 篇 benchmarking
  • 5 篇 codes
  • 4 篇 signal processin...

机构

  • 240 篇 key laboratory o...
  • 69 篇 institute of art...
  • 42 篇 key laboratory o...
  • 41 篇 school of inform...
  • 38 篇 tencent youtu la...
  • 33 篇 peng cheng labor...
  • 27 篇 fujian key labor...
  • 25 篇 key laboratory o...
  • 20 篇 fujian key labor...
  • 18 篇 youtu lab tencen...
  • 16 篇 key laboratory o...
  • 12 篇 national univers...
  • 11 篇 school of inform...
  • 8 篇 tencent
  • 8 篇 school of comput...
  • 8 篇 skywork ai
  • 8 篇 school of comput...
  • 8 篇 the key laborato...
  • 8 篇 department of ar...
  • 7 篇 key laboratory o...

作者

  • 153 篇 ji rongrong
  • 69 篇 sun xiaoshuai
  • 49 篇 rongrong ji
  • 40 篇 cao liujuan
  • 38 篇 ji jiayi
  • 34 篇 wang cheng
  • 33 篇 zhang shengchuan
  • 32 篇 zhou yiyi
  • 26 篇 zheng xiawu
  • 26 篇 ma yiwei
  • 25 篇 zhang yuxin
  • 23 篇 chao fei
  • 20 篇 lin mingbao
  • 20 篇 zhang yan
  • 19 篇 luo gen
  • 17 篇 wen chenglu
  • 17 篇 shen yunhang
  • 17 篇 xiaoshuai sun
  • 16 篇 jiang guannan
  • 16 篇 wang haowei

语言

  • 269 篇 英文
  • 134 篇 其他
检索条件"机构=The Key Laboratory of Multimedia Trusted Perception and Efficient Computing"
403 条 记 录,以下是131-140 订阅
排序:
ControlMLLM: training-free visual prompt learning for multimodal large language models  24
ControlMLLM: training-free visual prompt learning for multim...
收藏 引用
Proceedings of the 38th International Conference on Neural Information Processing Systems
作者: Mingrui Wu Xinyue Cai Jiayi Ji Jiale Li Oucheng Huang Gen Luo Hao Fei Guannan Jiang Xiaoshuai Sun Rongrong Ji Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University P.R. China National University of Singapore CATL
In this work, we propose a training-free method to inject visual prompts into Multimodal Large Language Models (MLLMs) through learnable latent variable optimization. We observe that attention, as the core module of M...
来源: 评论
AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction  39
AdaCo: Overcoming Visual Foundation Model Noise in 3D Semant...
收藏 引用
39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025
作者: Zou, Pufan Zhao, Shijia Huang, Weijie Xia, Qiming Wen, Chenglu Li, Wei Wang, Cheng Fujian Key Laboratory of Sensing and Computing for Smart Cities Xiamen University China Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University China Inceptio United States
Recently, Visual Foundation Models (VFMs) have shown a remarkable generalization performance in 3D perception tasks. However, their effectiveness in large-scale outdoor datasets remains constrained by the scarcity of ... 详细信息
来源: 评论
Semi-Supervised Panoptic Narrative Grounding
arXiv
收藏 引用
arXiv 2023年
作者: Yang, Danni Ji, Jiayi Sun, Xiaoshuai Wang, Haowei Li, Yinan Ma, Yiwei Ji, Rongrong Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University Fujian Xiamen China
Despite considerable progress, the advancement of Panoptic Narrative Grounding (PNG) remains hindered by costly annotations. In this paper, we introduce a novel Semi-Supervised Panoptic Narrative Grounding (SS-PNG) le... 详细信息
来源: 评论
StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization
arXiv
收藏 引用
arXiv 2024年
作者: Zhang, Jinlu Tang, Jiji Zhang, Rongsheng Lv, Tangjie Sun, Xiaoshuai Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University 361005 China Fuxi AI Lab Netease Inc
Story visualization has gained increasing attention in artificial intelligence. However, existing methods still struggle with maintaining a balance between character identity preservation and text-semantics alignment,... 详细信息
来源: 评论
UniPTS: A Unified Framework for Proficient Post-Training Sparsity
arXiv
收藏 引用
arXiv 2024年
作者: Xie, Jingjing Zhang, Yuxin Lin, Mingbao Lin, Zhihang Cao, Liujuan Ji, Rongrong Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China School of Informatics Xiamen University China Tencent Youtu Lab China
Post-training Sparsity (PTS) is a recently emerged avenue that chases efficient network sparsity with limited data in need. Existing PTS methods, however, undergo significant performance degradation compared with trad...
来源: 评论
DMAD: Dual Memory Bank for Real-World Anomaly Detection
arXiv
收藏 引用
arXiv 2024年
作者: Hu, Jianlong Chen, Xu Gan, Zhenye Peng, Jinlong Zhang, Shengchuan Zhang, Jiangning Wang, Yabiao Wang, Chengjie Cao, Liujuan Ji, Rongrong Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China School of Informatics Xiamen University China Youtu Lab Tencent China
Training a unified model is considered to be more suitable for practical industrial anomaly detection scenarios due to its generalization ability and storage efficiency. However, this multi-class setting, which exclus... 详细信息
来源: 评论
DSMNet: Deep High-Precision 3-D Surface Modeling from Sparse Point Cloud Frames
收藏 引用
IEEE Geoscience and Remote Sensing Letters 2023年 20卷 1-1页
作者: Qiu, Changjie Wang, Zhiyong Lin, Xiuhong Zang, Yu Wang, Cheng Liu, Weiquan Xiamen University Fujian Key Laboratory of Sensing and Computing for Smart Cities The Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China School of Informatics Xiamen361005 China
Existing point cloud modeling datasets primarily express the modeling precision by pose or trajectory precision rather than the point cloud modeling effect itself. Under this demand, we first independently construct a... 详细信息
来源: 评论
Improving Multilingual Sign Language Translation with Automatically Clustered Language Family Information  31
Improving Multilingual Sign Language Translation with Automa...
收藏 引用
31st International Conference on Computational Linguistics, COLING 2025
作者: Zhang, Ruiquan Hu, Cong Yu, Pei Chen, Yidong Department of Artificial Intelligence School of Informatics Xiamen University 361005 China Ministry of Culture and Tourism 361005 China National Language Resources Monitoring and Research Center for Education and Teaching Media Xiamen University 361005 China Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University 361005 China
Sign Language Translation (SLT) bridges the communication gap between deaf and hearing individuals by converting sign language videos into spoken language texts. While most SLT research has focused on bilingual transl... 详细信息
来源: 评论
efficient Infrared Image Super-Resolution Reconstruction via Guided Filter Coefficients Estimation with Parallax Attention Mechanism
Efficient Infrared Image Super-Resolution Reconstruction via...
收藏 引用
International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
作者: Qingyao Wu Bosheng Chen Chen Li Xiaotong Tu Xinghao Ding Yue Huang Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University P.R. China National Key Laboratory of Infrared Detection Technologies Shanghai China
Due to the spectral range mismatch between the images, building an efficient infrared (IR) image super-resolution algorithm suitable for embedded devices remains a significant challenge. Given that visible images poss... 详细信息
来源: 评论
What Kind of Visual Tokens Do We Need? Training-free Visual Token Pruning for Multi-modal Large Language Models from the Perspective of Graph
arXiv
收藏 引用
arXiv 2025年
作者: Jiang, Yutao Wu, Qiong Lin, Wenhao Yu, Wei Zhou, Yiyi Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University 361005 China Institute of Artificial Intelligence Xiamen University 361005 China
Recent Multimodal Large Language Models (MLLMs) often use a large number of visual tokens to compensate their visual shortcoming, leading to excessive computation and obvious visual redundancy. In this paper, we inves... 详细信息
来源: 评论