咨询与建议

限定检索结果

文献类型

  • 204 篇 期刊文献
  • 170 篇 会议

馆藏范围

  • 374 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 235 篇 工学
    • 192 篇 计算机科学与技术...
    • 168 篇 软件工程
    • 49 篇 信息与通信工程
    • 29 篇 控制科学与工程
    • 22 篇 光学工程
    • 18 篇 机械工程
    • 16 篇 建筑学
    • 16 篇 土木工程
    • 15 篇 网络空间安全
    • 13 篇 电气工程
    • 11 篇 电子科学与技术(可...
    • 11 篇 测绘科学与技术
    • 11 篇 生物工程
    • 10 篇 化学工程与技术
    • 8 篇 安全科学与工程
    • 6 篇 交通运输工程
    • 6 篇 生物医学工程(可授...
    • 5 篇 仪器科学与技术
    • 4 篇 动力工程及工程热...
  • 76 篇 管理学
    • 43 篇 图书情报与档案管...
    • 40 篇 管理科学与工程(可...
    • 8 篇 工商管理
  • 67 篇 理学
    • 35 篇 数学
    • 17 篇 统计学(可授理学、...
    • 12 篇 物理学
    • 12 篇 生物学
    • 10 篇 化学
  • 4 篇 经济学
    • 4 篇 应用经济学
  • 4 篇 法学
    • 4 篇 社会学
    • 3 篇 法学
  • 2 篇 农学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 15 篇 semantics
  • 12 篇 image segmentati...
  • 10 篇 training
  • 9 篇 reinforcement le...
  • 9 篇 contrastive lear...
  • 8 篇 visual languages
  • 7 篇 speech processin...
  • 7 篇 convolution
  • 7 篇 computer vision
  • 7 篇 image reconstruc...
  • 6 篇 semantic segment...
  • 6 篇 distillation
  • 6 篇 visualization
  • 6 篇 pipelines
  • 5 篇 costs
  • 5 篇 benchmarking
  • 5 篇 codes
  • 4 篇 object detection
  • 4 篇 signal processin...
  • 4 篇 redundancy

机构

  • 214 篇 key laboratory o...
  • 62 篇 institute of art...
  • 40 篇 key laboratory o...
  • 36 篇 tencent youtu la...
  • 33 篇 peng cheng labor...
  • 32 篇 school of inform...
  • 25 篇 key laboratory o...
  • 22 篇 fujian key labor...
  • 20 篇 fujian key labor...
  • 18 篇 youtu lab tencen...
  • 17 篇 key laboratory o...
  • 10 篇 school of inform...
  • 10 篇 national univers...
  • 8 篇 tencent
  • 8 篇 skywork ai
  • 8 篇 school of comput...
  • 8 篇 the key laborato...
  • 8 篇 department of ar...
  • 7 篇 key laboratory o...
  • 6 篇 department of co...

作者

  • 152 篇 ji rongrong
  • 67 篇 sun xiaoshuai
  • 49 篇 rongrong ji
  • 40 篇 cao liujuan
  • 37 篇 ji jiayi
  • 30 篇 zhou yiyi
  • 30 篇 zhang shengchuan
  • 28 篇 wang cheng
  • 25 篇 ma yiwei
  • 24 篇 zhang yuxin
  • 24 篇 zheng xiawu
  • 22 篇 chao fei
  • 19 篇 luo gen
  • 18 篇 lin mingbao
  • 18 篇 zhang yan
  • 17 篇 xiaoshuai sun
  • 16 篇 wang haowei
  • 15 篇 shen yunhang
  • 15 篇 jiang guannan
  • 15 篇 li hui

语言

  • 318 篇 英文
  • 56 篇 其他
检索条件"机构=The Key Laboratory of Multimedia Trusted Perception and Efficient Computing"
374 条 记 录,以下是61-70 订阅
排序:
Traffic Simulator: A Traffic Risk Hotspot Identification Platform  9
Traffic Simulator: A Traffic Risk Hotspot Identification Pla...
收藏 引用
9th IEEE Smart World Congress, SWC 2023
作者: Tang, Qingxian Liu, Changzhen Gao, Jiannan Si, Liwei Chen, Longbiao Wang, Cheng Xiamen University Fujian Key Laboratory of Sensing and Computing for Smart Cities 361005 China Xiamen University Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China 361005 China Xiamen University School of Information 361005 China
With the increment of urban population, the number of vehicles on the road is also constantly increasing, leading to a rise in traffic risks. Frequent urban traffic accidents significantly threaten people's lives,... 详细信息
来源: 评论
Diverse Consensuses Paired with Motion Estimation-Based Multi-Model Fitting  24
Diverse Consensuses Paired with Motion Estimation-Based Mult...
收藏 引用
32nd ACM International Conference on multimedia, MM 2024
作者: Yin, Wenyu Lin, Shuyuan Lu, Yang Wang, Hanzi Fujian Key Laboratory of Sensing and Computing for Smart City School of Informatics Xiamen University Xiamen China Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University Xiamen China College of Cyber Security College of Information Science and Technology Jinan University Guangzhou China
Multi-model fitting aims to robustly estimate the parameters of various model instances in data contaminated by noise and outliers. Most previous works employ only a single type of consensus or implicit fusion model t... 详细信息
来源: 评论
Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models
arXiv
收藏 引用
arXiv 2024年
作者: Wu, Mingrui Ji, Jiayi Huang, Oucheng Li, Jiale Wu, Yuhang Sun, Xiaoshuai Ji, Rongrong Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University 361005 China
The issue of hallucinations is a prevalent concern in existing Large Vision-Language Models (LVLMs). Previous efforts have primarily focused on investigating object hallucinations, which can be easily alleviated by in... 详细信息
来源: 评论
Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings
arXiv
收藏 引用
arXiv 2024年
作者: Wu, Qiong Lin, Wenhao Ye, Weihao Zhou, Yiyi Sun, Xiaoshuai Ji, Rongrong Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University 361005 China
The excessive use of visual tokens in existing Multimoal Large Language Models (MLLMs) often exhibits obvious redundancy and brings in prohibitively expensive computation. To gain insights into this problem, we first ... 详细信息
来源: 评论
Cantor: Inspiring Multimodal Chain-of-Thought of MLLM  24
Cantor: Inspiring Multimodal Chain-of-Thought of MLLM
收藏 引用
32nd ACM International Conference on multimedia, MM 2024
作者: Gao, Timin Chen, Peixian Zhang, Mengdan Fu, Chaoyou Shen, Yunhang Zhang, Yan Zhang, Shengchuan Zheng, Xiawu Sun, Xing Cao, Liujuan Ji, Rongrong Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University Xiamen China Tencent Youtu Lab Shanghai China State Key Laboratory for Novel Software Technology Nanjing University China School of Intelligence Science and Technology Nanjing University China
With the advent of large language models(LLMs) enhanced by the chain-of-thought(CoT) methodology, the visual reasoning problem is usually decomposed into manageable sub-tasks and tackled sequentially with various exte... 详细信息
来源: 评论
CaM: cache merging for memory-efficient LLMs inference  24
CaM: cache merging for memory-efficient LLMs inference
收藏 引用
Proceedings of the 41st International Conference on Machine Learning
作者: Yuxin Zhang Yuxuan Du Gen Luo Yunshan Zhong Zhenyu Zhang Shiwei Liu Rongrong Ji Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University and Peng Cheng Laboratory Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University University of Texas at Austin Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University and University of Oxford and Eindhoven University of Technology Institute of Artificial Intelligence Xiamen University
Despite the exceptional performance of Large Language Models (LLMs), the substantial volume of key-value (KV) pairs cached during inference presents a barrier to their efficient deployment. To ameliorate this, recent ...
来源: 评论
FocSAM: Delving Deeply into Focused Objects in Segmenting Anything
arXiv
收藏 引用
arXiv 2024年
作者: Huang, You Lan, Zongyu Cao, Liujuan Lin, Xianming Zhang, Shengchuan Jiang, Guannan Ji, Rongrong Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University China China
The Segment Anything Model (SAM) marks a notable milestone in segmentation models, highlighted by its robust zero-shot capabilities and ability to handle diverse prompts. SAM follows a pipeline that separates interact... 详细信息
来源: 评论
Any-to-3D Generation via Hybrid Diffusion Supervision
arXiv
收藏 引用
arXiv 2024年
作者: Fan, Yijun Ma, Yiwei Ji, Jiayi Sun, Xiaoshuai Ji, Rongrong Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University 361005 China
Recent progress in 3D object generation has been fueled by the strong priors offered by diffusion models. However, existing models are tailored to specific tasks, accommodating only one modality at a time and necessit... 详细信息
来源: 评论
GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane
arXiv
收藏 引用
arXiv 2024年
作者: Qu, Yansong Dai, Shaohui Li, Xinyang Lin, Jianghang Cao, Liujuan Zhang, Shengchuan Ji, Rongrong Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University Fujian China
3D open-vocabulary scene understanding, crucial for advancing augmented reality and robotic applications, involves interpreting and locating specific regions within a 3D space as directed by natural language instructi... 详细信息
来源: 评论
TextRefiner: Internal Visual Feature as efficient Refiner for Vision-Language Models Prompt Tuning
arXiv
收藏 引用
arXiv 2024年
作者: Xie, Jingjing Zhang, Yuxin Peng, Jun Huang, Zhaohong Cao, Liujuan Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University Xiamen China
Despite the efficiency of prompt learning in transferring vision-language models (VLMs) to downstream tasks, existing methods mainly learn the prompts in a coarse-grained manner where the learned prompt vectors are sh... 详细信息
来源: 评论