咨询与建议

限定检索结果

文献类型

  • 199 篇 期刊文献
  • 158 篇 会议

馆藏范围

  • 357 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 233 篇 工学
    • 191 篇 计算机科学与技术...
    • 168 篇 软件工程
    • 49 篇 信息与通信工程
    • 29 篇 控制科学与工程
    • 22 篇 光学工程
    • 18 篇 机械工程
    • 16 篇 建筑学
    • 16 篇 土木工程
    • 15 篇 网络空间安全
    • 12 篇 电气工程
    • 11 篇 电子科学与技术(可...
    • 11 篇 测绘科学与技术
    • 11 篇 生物工程
    • 10 篇 化学工程与技术
    • 8 篇 安全科学与工程
    • 6 篇 交通运输工程
    • 6 篇 生物医学工程(可授...
    • 5 篇 仪器科学与技术
    • 4 篇 动力工程及工程热...
  • 76 篇 管理学
    • 43 篇 图书情报与档案管...
    • 40 篇 管理科学与工程(可...
    • 8 篇 工商管理
  • 67 篇 理学
    • 35 篇 数学
    • 17 篇 统计学(可授理学、...
    • 12 篇 物理学
    • 12 篇 生物学
    • 10 篇 化学
  • 4 篇 经济学
    • 4 篇 应用经济学
  • 4 篇 法学
    • 4 篇 社会学
    • 3 篇 法学
  • 2 篇 农学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 14 篇 semantics
  • 12 篇 image segmentati...
  • 9 篇 reinforcement le...
  • 9 篇 contrastive lear...
  • 9 篇 training
  • 8 篇 visual languages
  • 7 篇 speech processin...
  • 7 篇 convolution
  • 7 篇 computer vision
  • 7 篇 image reconstruc...
  • 6 篇 semantic segment...
  • 6 篇 distillation
  • 6 篇 pipelines
  • 5 篇 visualization
  • 5 篇 costs
  • 5 篇 benchmarking
  • 5 篇 codes
  • 4 篇 object detection
  • 4 篇 signal processin...
  • 4 篇 redundancy

机构

  • 212 篇 key laboratory o...
  • 60 篇 institute of art...
  • 39 篇 key laboratory o...
  • 35 篇 tencent youtu la...
  • 31 篇 peng cheng labor...
  • 30 篇 school of inform...
  • 22 篇 fujian key labor...
  • 20 篇 fujian key labor...
  • 20 篇 key laboratory o...
  • 17 篇 key laboratory o...
  • 17 篇 youtu lab tencen...
  • 10 篇 school of inform...
  • 8 篇 tencent
  • 8 篇 skywork ai
  • 8 篇 school of comput...
  • 8 篇 the key laborato...
  • 7 篇 key laboratory o...
  • 7 篇 national univers...
  • 7 篇 department of ar...
  • 6 篇 shanghai jiao to...

作者

  • 151 篇 ji rongrong
  • 66 篇 sun xiaoshuai
  • 41 篇 rongrong ji
  • 39 篇 cao liujuan
  • 36 篇 ji jiayi
  • 30 篇 zhou yiyi
  • 30 篇 zhang shengchuan
  • 28 篇 wang cheng
  • 25 篇 ma yiwei
  • 24 篇 zhang yuxin
  • 24 篇 zheng xiawu
  • 22 篇 chao fei
  • 19 篇 luo gen
  • 18 篇 lin mingbao
  • 18 篇 zhang yan
  • 15 篇 shen yunhang
  • 15 篇 jiang guannan
  • 15 篇 wang haowei
  • 15 篇 li hui
  • 14 篇 wen chenglu

语言

  • 301 篇 英文
  • 56 篇 其他
检索条件"机构=Key Laboratory of Multimedia Trusted Perception and Efficient Computing"
357 条 记 录,以下是41-50 订阅
排序:
HmPEAR: A Dataset for Human Pose Estimation and Action Recognition  24
HmPEAR: A Dataset for Human Pose Estimation and Action Recog...
收藏 引用
32nd ACM International Conference on multimedia, MM 2024
作者: Lin, Yitai Wei, Zhijie Zhang, Wanfa Lin, Xiping Dai, Yudi Wen, Chenglu Shen, Siqi Xu, Lan Wang, Cheng Fujian Key Laboratory of Sensing and Computing for Smart Cities Xiamen University Xiamen China Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University Xiamen China Xiamen University Xiamen China ShanghaiTech University Shanghai China
We introduce HmPEAR, a novel dataset crafted for advancing research in 3D Human Pose Estimation (3D HPE) and Human Action Recognition (HAR), with a primary focus on outdoor environments. This dataset offers a synchron... 详细信息
来源: 评论
Deep Instruction Tuning for Segment Anything Model  24
Deep Instruction Tuning for Segment Anything Model
收藏 引用
32nd ACM International Conference on multimedia, MM 2024
作者: Huang, Xiaorui Luo, Gen Zhu, Chaoyang Tong, Bo Zhou, Yiyi Sun, Xiaoshuai Ji, Rongrong Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University Fujian Xiamen China The Department of Computer Science and Engineering The Hong Kong University of Science and Technology Hong Kong
Recently, Segment Anything Model (SAM) has become a research hotspot in the fields of multimedia and computer vision, which exhibits powerful yet versatile capabilities on various (un) conditional image segmentation t... 详细信息
来源: 评论
SimCLIP: Refining Image-Text Alignment with Simple Prompts for Zero-/Few-shot Anomaly Detection  24
SimCLIP: Refining Image-Text Alignment with Simple Prompts f...
收藏 引用
32nd ACM International Conference on multimedia, MM 2024
作者: Deng, Chenghao Xu, Haote Chen, Xiaolu Xu, Haodi Tu, Xiaotong Ding, Xinghao Huang, Yue Institute of Artificial Intelligence Xiamen University Xiamen China School of Informatics Xiamen University Xiamen China School of Informatics Xiamen University Key Laboratory of Multimedia Trusted Perception and Efficient Computing Xiamen China
Recently, large pre-trained vision-language models, such as CLIP, have demonstrated significant potential in zero-/few-shot anomaly detection tasks. However, existing methods not only rely on expert knowledge to manua... 详细信息
来源: 评论
LIGHTMOTION: A LIGHT AND TUNING-FREE METHOD FOR SIMULATING CAMERA MOTION IN VIDEO GENERATION
arXiv
收藏 引用
arXiv 2025年
作者: Song, Quanjian Lin, Zhihang Zeng, Zhanpeng Zhang, Ziyue Cao, Liujuan Ji, Rongrong Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University China
Existing camera motion-controlled video generation methods face computational bottlenecks in fine-tuning and inference. This paper proposes LightMotion, a light and tuning-free method for simulating camera motion in v... 详细信息
来源: 评论
Evolving High-Quality Rendering and Reconstruction in a Unified Framework with Contribution-Adaptive Regularization
arXiv
收藏 引用
arXiv 2025年
作者: Shen, You Zhang, Zhipeng Li, Xinyang Qu, Yansong Lin, Yu Zhang, Shengchuan Cao, Liujuan Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University China
Representing 3D scenes from multiview images is a core challenge in computer vision and graphics, which requires both precise rendering and accurate reconstruction. Recently, 3D Gaussian Splatting (3DGS) has garnered ... 详细信息
来源: 评论
Training-Free Hierarchical Scene Understanding for Gaussian Splatting with Superpoint Graphs
arXiv
收藏 引用
arXiv 2025年
作者: Dai, Shaohui Qu, Yansong Li, Zheyan Li, Xinyang Zhang, Shengchuan Cao, Liujuan Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University China
Bridging natural language and 3D geometry is a crucial step toward flexible, language-driven scene understanding. While recent advances in 3D Gaussian Splatting (3DGS) have enabled fast and high-quality scene reconstr... 详细信息
来源: 评论
Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text  38
Director3D: Real-world Camera Trajectory and 3D Scene Genera...
收藏 引用
38th Conference on Neural Information Processing Systems, NeurIPS 2024
作者: Li, Xinyang Lai, Zhangyu Xu, Linning Qu, Yansong Cao, Liujuan Zhang, Shengchuan Dai, Bo Ji, Rongrong Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University China Shanghai Artificial Intelligence Laboratory China The Chinese University of Hong Kong Hong Kong University of Hong Kong Hong Kong
Recent advancements in 3D generation have leveraged synthetic datasets with ground truth 3D assets and predefined camera trajectories. However, the potential of adopting real-world datasets, which can produce signific...
来源: 评论
ESR-DDLN : Enhanced Single Image Super-Resolution Via Dual-Domain Learning Network
ESR-DDLN : Enhanced Single Image Super-Resolution Via Dual-D...
收藏 引用
IEEE International Conference on multimedia and Expo (ICME)
作者: Zihao He Shengchuan Zhang Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University
Most existing CNN-based super-resolution (SR) methods focus solely on the spatial domain. We argue that frequency domain details are essential for reconstructing fine textures and patterns. To leverage the frequency i... 详细信息
来源: 评论
Representation Purification for End-to-End Speech Translation  31
Representation Purification for End-to-End Speech Translatio...
收藏 引用
31st International Conference on Computational Linguistics, COLING 2025
作者: Zhang, Chengwei Zhou, Yue Zhao, Rui Chen, Yidong Shi, Xiaodong School of Informatics Xiamen University China Key Laboratory of Digital Protection and Intelligent Processing of Intangible Cultural Heritage of Fujian and Taiwan Ministry of Culture and Tourism China Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University Xiamen China
Speech-to-text translation (ST) is a cross-modal task that involves converting spoken language into text in a different language. Previous research primarily focused on enhancing speech translation by facilitating kno...
来源: 评论
RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation  38
RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End...
收藏 引用
38th Conference on Neural Information Processing Systems, NeurIPS 2024
作者: Wu, Changli Chen, Qi Ji, Jiayi Wang, Haowei Ma, Yiwei Huang, You Luo, Gen Fei, Hao Sun, Xiaoshuai Ji, Rongrong Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University 361005 China Shanghai Innovation Institute Shanghai China Youtu Lab Tencent Shanghai China National University of Singapore Singapore
3D Referring Expression Segmentation (3D-RES) aims to segment 3D objects by correlating referring expressions with point clouds. However, traditional approaches frequently encounter issues like over-segmentation or mi...
来源: 评论