咨询与建议

限定检索结果

文献类型

  • 1 篇 会议

馆藏范围

  • 1 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 1 篇 工学
    • 1 篇 计算机科学与技术...

主题

  • 1 篇 visual large lan...
  • 1 篇 visual foundatio...
  • 1 篇 semantic segment...
  • 1 篇 zero shot classi...
  • 1 篇 distillation
  • 1 篇 knowledge distil...

机构

  • 1 篇 nvidia santa cla...

作者

  • 1 篇 heinrich greg
  • 1 篇 molchanov pavlo
  • 1 篇 ranzinger mike
  • 1 篇 kautz jan

语言

  • 1 篇 英文
检索条件"主题词=Visual Large Language Models"
1 条 记 录,以下是1-10 订阅
排序:
AM-RADIO: Agglomerative Vision Foundation Model Reduce All Domains Into One
AM-RADIO: Agglomerative Vision Foundation Model Reduce All D...
收藏 引用
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
作者: Ranzinger, Mike Heinrich, Greg Kautz, Jan Molchanov, Pavlo NVIDIA Santa Clara CA 95051 USA
A handful of visual foundation models (VFMs) have recently emerged as the backbones for numerous downstream tasks. VFMs like CLIP, DINOv2, SAM are trained with distinct objectives, exhibiting unique characteristics fo... 详细信息
来源: 评论