咨询与建议

限定检索结果

文献类型

  • 14 篇 期刊文献
  • 10 篇 会议

馆藏范围

  • 24 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 15 篇 工学
    • 12 篇 计算机科学与技术...
    • 10 篇 软件工程
    • 3 篇 建筑学
    • 3 篇 土木工程
    • 3 篇 测绘科学与技术
    • 2 篇 机械工程
    • 1 篇 仪器科学与技术
    • 1 篇 控制科学与工程
    • 1 篇 生物医学工程(可授...
    • 1 篇 安全科学与工程
  • 5 篇 理学
    • 4 篇 数学
    • 3 篇 统计学(可授理学、...
    • 1 篇 系统科学
  • 5 篇 管理学
    • 4 篇 管理科学与工程(可...
    • 2 篇 工商管理
    • 1 篇 图书情报与档案管...
  • 2 篇 经济学
    • 2 篇 应用经济学

主题

  • 3 篇 diffusion
  • 2 篇 reinforcement le...
  • 2 篇 cost effectivene...
  • 1 篇 object detection
  • 1 篇 data privacy
  • 1 篇 knowledge graph
  • 1 篇 inference algori...
  • 1 篇 risk perception
  • 1 篇 incremental lear...
  • 1 篇 annealing
  • 1 篇 tensors
  • 1 篇 costs
  • 1 篇 gaussian distrib...
  • 1 篇 simulated anneal...
  • 1 篇 computer vision
  • 1 篇 training
  • 1 篇 visual languages

机构

  • 10 篇 peng cheng labor...
  • 10 篇 key laboratory o...
  • 7 篇 institute of art...
  • 6 篇 bytedance inc.
  • 4 篇 key laboratory o...
  • 3 篇 fuxi ai lab nete...
  • 3 篇 fujian engineeri...
  • 3 篇 bytedance inc
  • 3 篇 shenzhen researc...
  • 2 篇 key laboratory o...
  • 2 篇 baidu inc
  • 2 篇 fuxi ai lab nete...
  • 1 篇 bytedance inc. a...
  • 1 篇 deep wisdom inc
  • 1 篇 key laboratory o...
  • 1 篇 key laboratory o...
  • 1 篇 key laboratory o...
  • 1 篇 samsara inc
  • 1 篇 fujian key lab o...
  • 1 篇 key laboratory o...

作者

  • 10 篇 ji rongrong
  • 7 篇 sun xiaoshuai
  • 6 篇 xiawu zheng
  • 6 篇 rongrong ji
  • 6 篇 fei chao
  • 5 篇 zheng xiawu
  • 5 篇 zhang rongsheng
  • 5 篇 chao fei
  • 5 篇 xiao xuefeng
  • 5 篇 li huixia
  • 5 篇 wang rui
  • 4 篇 wen shilei
  • 4 篇 ma yuexiao
  • 3 篇 rui wang
  • 3 篇 ji jiayi
  • 3 篇 tang jiji
  • 3 篇 yan wang
  • 3 篇 xuefeng xiao
  • 3 篇 ma yiwei
  • 3 篇 huixia li

语言

  • 19 篇 英文
  • 5 篇 其他
检索条件"机构=ByteDance Inc. and Key Laboratory of Multimedia Trusted Perception and Efficient Computing"
24 条 记 录,以下是1-10 订阅
排序:
Outlier-Aware Slicing for Post-Training Quantization in Vision Transformer  41
Outlier-Aware Slicing for Post-Training Quantization in Visi...
收藏 引用
41st International Conference on Machine Learning, ICML 2024
作者: Ma, Yuexiao Li, Huixia Zheng, Xiawu Ling, Feng Xiao, Xuefeng Wang, Rui Wen, Shilei Chao, Fei Ji, Rongrong Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China School of Informatics Xiamen University 361005 China ByteDance Inc. China Peng Cheng Laboratory Shenzhen China Institute of Artificial Intelligence Xiamen University China
Post-Training Quantization (PTQ) is a vital technique for network compression and acceleration, gaining prominence as model sizes inc.ease. This paper addresses a critical challenge in PTQ: the severe impact of outlie...
来源: 评论
AFFINEQUANT: AFFINE TRANSFORMATION QUANTIZATION FOR LARGE LANGUAGE MODELS  12
AFFINEQUANT: AFFINE TRANSFORMATION QUANTIZATION FOR LARGE LA...
收藏 引用
12th International Conference on Learning Representations, ICLR 2024
作者: Ma, Yuexiao Li, Huixia Zheng, Xiawu Ling, Feng Xiao, Xuefeng Wang, Rui Wen, Shilei Chao, Fei Ji, Rongrong Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China School of Informatics Xiamen University 361005 China ByteDance Inc. China Peng Cheng Laboratory Shenzhen China Institute of Artificial Intelligence Xiamen University China
The significant resource requirements associated with Large-scale Language Models (LLMs) have generated considerable interest in the development of techniques aimed at compressing and accelerating neural networks. Amo... 详细信息
来源: 评论
Outlier-aware slicing for post-training quantization in vision transformer  24
Outlier-aware slicing for post-training quantization in visi...
收藏 引用
Proceedings of the 41st International Conference on Machine Learning
作者: Yuexiao Ma Huixia Li Xiawu Zheng Feng Ling Xuefeng Xiao Rui Wang Shilei Wen Fei Chao Rongrong Ji ByteDance Inc. and Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China School of Informatics Xiamen University P.R. China. ByteDance Inc. Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China School of Informatics Xiamen University P.R. China and Peng Cheng Laboratory Shenzhen China and Institute of Artificial Intelligence Xiamen University Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China School of Informatics Xiamen University P.R. China. Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China School of Informatics Xiamen University P.R. China and and Institute of Artificial Intelligence Xiamen University
Post-Training Quantization (PTQ) is a vital technique for network compression and acceleration, gaining prominence as model sizes inc.ease. This paper addresses a critical challenge in PTQ: the severe impact of outlie...
来源: 评论
Clover: Towards A Unified Video-Language Alignment and Fusion Model
Clover: Towards A Unified Video-Language Alignment and Fusio...
收藏 引用
Conference on Computer Vision and Pattern Recognition (CVPR)
作者: Jingjia Huang Yinan Li Jiashi Feng Xinglong Wu Xiaoshuai Sun Rongrong Ji Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University China ByteDance Inc China
Building a universal Video-Language model for solving various video understanding tasks (e.g., text-video retrieval, video question answering) is an open challenge to the machine learning field. Towards this goal, mos...
来源: 评论
Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting
arXiv
收藏 引用
arXiv 2025年
作者: Qu, Yansong Chen, Dian Li, Xinyang Li, Xiaofan Zhang, Shengchuan Cao, Liujuan Ji, Rongrong Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University China Baidu Inc China
Recent advancements in 3D scene editing have been propelled by the rapid development of generative models. Existing methods typically utilize generative models to perform text-guided editing on 3D representations, suc... 详细信息
来源: 评论
ComfyGPT: A Self-Optimizing Multi-Agent System for Comprehensive ComfyUI Workflow Generation
arXiv
收藏 引用
arXiv 2025年
作者: Huang, Oucheng Ma, Yuhang Zhao, Zeng Wu, Mingrui Ji, Jiayi Zhang, Rongsheng Hu, Zhipeng Sun, Xiaoshuai Ji, Rongrong Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education Xiamen University China Fuxi AI Lab Netease Inc China
ComfyUI provides a widely-adopted, workflow-based interface that enables users to customize various image generation tasks through an intuitive node-based architecture. However, the intricate connections between nodes... 详细信息
来源: 评论
NeRF-DetS: Enhanced Adaptive Spatial-wise Sampling and View-wise Fusion Strategies for NeRF-based Indoor Multi-view 3D Object Detection
arXiv
收藏 引用
arXiv 2024年
作者: Huang, Chi Li, Xinyang Qu, Yansong Wu, Changli Li, Xiaofan Zhang, Shengchuan Cao, Liujuan Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University 361005 China Baidu Inc China
In indoor scenes, the diverse distribution of object locations and scales makes the visual 3D perception task a big challenge. Previous works (e.g., NeRF-Det) have demonstrated that implicit representation has the cap... 详细信息
来源: 评论
Towards efficient Diffusion-Based Image Editing with Instant Attention Masks
arXiv
收藏 引用
arXiv 2024年
作者: Zou, Siyu Tang, Jiji Zhou, Yiyi He, Jing Zhao, Chaoyi Zhang, Rongsheng Hu, Zhipeng Sun, Xiaoshuai Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University 361005 China Fuxi AI Lab NetEase Inc. Hangzhou China
Diffusion-based Image Editing (DIE) is an emerging research hot-spot, which often applies a semantic mask to control the target area for diffusion-based editing. However, most existing solutions obtain these masks via... 详细信息
来源: 评论
MLLM-Selector: Necessity and Diversity-driven High-Value Data Selection for Enhanced Visual Instruction Tuning
arXiv
收藏 引用
arXiv 2025年
作者: Ma, Yiwei Xu, Guohai Sun, Xiaoshuai Ji, Jiayi Lou, Jie Zhang, Debing Ji, Rongrong Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University Fujian361005 China Xiaohongshu Inc Shanghai200025 China
Visual instruction tuning (VIT) has emerged as a crucial technique for enabling multi-modal large language models (MLLMs) to follow user instructions adeptly. Yet, a significant gap persists in understanding the attri... 详细信息
来源: 评论
StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization
arXiv
收藏 引用
arXiv 2024年
作者: Zhang, Jinlu Tang, Jiji Zhang, Rongsheng Lv, Tangjie Sun, Xiaoshuai Key Laboratory of Multimedia Trusted Perception and Efficient Computing Ministry of Education of China Xiamen University 361005 China Fuxi AI Lab Netease Inc
Story visualization has gained inc.easing attention in artificial intelligence. However, existing methods still struggle with maintaining a balance between character identity preservation and text-semantics alignment,... 详细信息
来源: 评论