咨询与建议

限定检索结果

文献类型

  • 50,479 篇 会议
  • 1,421 册 图书
  • 1,041 篇 期刊文献
  • 1 篇 学位论文

馆藏范围

  • 52,940 篇 电子文献
  • 4 种 纸本馆藏

日期分布

学科分类号

  • 31,811 篇 工学
    • 24,804 篇 计算机科学与技术...
    • 12,568 篇 软件工程
    • 5,153 篇 光学工程
    • 4,756 篇 电气工程
    • 4,436 篇 信息与通信工程
    • 4,257 篇 机械工程
    • 3,956 篇 控制科学与工程
    • 2,474 篇 生物工程
    • 1,728 篇 生物医学工程(可授...
    • 1,584 篇 仪器科学与技术
    • 1,317 篇 电子科学与技术(可...
    • 793 篇 化学工程与技术
    • 698 篇 安全科学与工程
    • 542 篇 交通运输工程
    • 379 篇 建筑学
    • 331 篇 土木工程
  • 11,839 篇 理学
    • 6,434 篇 物理学
    • 5,405 篇 数学
    • 2,761 篇 生物学
    • 1,910 篇 统计学(可授理学、...
    • 801 篇 化学
    • 669 篇 系统科学
  • 5,305 篇 医学
    • 5,094 篇 临床医学
    • 729 篇 基础医学(可授医学...
    • 459 篇 药学(可授医学、理...
  • 3,350 篇 管理学
    • 1,953 篇 图书情报与档案管...
    • 1,535 篇 管理科学与工程(可...
    • 479 篇 工商管理
  • 720 篇 艺术学
    • 718 篇 设计学(可授艺术学...
  • 428 篇 法学
    • 401 篇 社会学
  • 297 篇 农学
  • 197 篇 教育学
  • 163 篇 经济学
  • 63 篇 文学
  • 49 篇 军事学

主题

  • 17,385 篇 computer vision
  • 9,017 篇 pattern recognit...
  • 4,196 篇 training
  • 3,815 篇 feature extracti...
  • 3,134 篇 cameras
  • 2,870 篇 computational mo...
  • 2,789 篇 image segmentati...
  • 2,622 篇 visualization
  • 2,573 篇 shape
  • 2,533 篇 face recognition
  • 2,171 篇 robustness
  • 2,123 篇 computer science
  • 1,973 篇 object detection
  • 1,959 篇 computer archite...
  • 1,878 篇 layout
  • 1,853 篇 object recogniti...
  • 1,802 篇 three-dimensiona...
  • 1,725 篇 neural networks
  • 1,708 篇 humans
  • 1,691 篇 image recognitio...

机构

  • 165 篇 univ chinese aca...
  • 144 篇 tsinghua univers...
  • 136 篇 national laborat...
  • 108 篇 univ sci & techn...
  • 104 篇 zhejiang univers...
  • 100 篇 shanghai jiao to...
  • 95 篇 microsoft resear...
  • 94 篇 university of sc...
  • 86 篇 zhejiang univ pe...
  • 84 篇 shanghai ai lab ...
  • 74 篇 school of comput...
  • 69 篇 computer vision ...
  • 68 篇 peking univ peop...
  • 68 篇 chinese acad sci...
  • 65 篇 chinese univ hon...
  • 63 篇 institute of inf...
  • 62 篇 google res mount...
  • 61 篇 univ oxford oxfo...
  • 59 篇 univ toronto on
  • 57 篇 swiss fed inst t...

作者

  • 91 篇 van gool luc
  • 87 篇 umapada pal
  • 76 篇 zhang lei
  • 64 篇 lee seong-whan
  • 49 篇 vittorio murino
  • 42 篇 yang yi
  • 34 篇 nassir navab
  • 33 篇 li xin
  • 33 篇 jie yang
  • 32 篇 liu yang
  • 31 篇 escalera sergio
  • 31 篇 loy chen change
  • 30 篇 ling haibin
  • 30 篇 h. bischof
  • 29 篇 zhou jie
  • 29 篇 vasconcelos nuno
  • 29 篇 jan-michael frah...
  • 29 篇 hanqing lu
  • 28 篇 blumenstein mich...
  • 27 篇 jia yunde

语言

  • 51,871 篇 英文
  • 835 篇 其他
  • 241 篇 中文
  • 22 篇 土耳其文
  • 5 篇 西班牙文
  • 2 篇 日文
  • 2 篇 葡萄牙文
  • 2 篇 俄文
检索条件"任意字段=IEEE Conference on Computer Vision and Pattern Recognition"
52943 条 记 录,以下是91-100 订阅
排序:
Adapters Strike Back
Adapters Strike Back
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Steitz, Jan-Martin Roth, Stefan Tech Univ Darmstadt Dept Comp Sci Darmstadt Germany Hessian AI Darmstadt Germany
Adapters provide an efficient and lightweight mechanism for adapting trained transformer models to a variety of different tasks. However, they have often been found to be outperformed by other adaptation mechanisms in... 详细信息
来源: 评论
PointInfinity: Resolution-Invariant Point Diffusion Models
PointInfinity: Resolution-Invariant Point Diffusion Models
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Huang, Zixuan Johnson, Justin Debnath, Shoubhik Rehg, James M. Wu, Chao-Yuan Meta FAIR Menlo Pk CA 94025 USA Univ Illinois Champaign IL 61820 USA
We present PointInfinity, an efficient family of point cloud diffusion models. Our core idea is to use a transformer-based architecture with a fixed-size, resolution-invariant latent representation. This enables effic... 详细信息
来源: 评论
AirPlanes: Accurate Plane Estimation via 3D-Consistent Embeddings
AirPlanes: Accurate Plane Estimation via 3D-Consistent Embed...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Watson, Jamie Aleotti, Filippo Sayed, Mohamed Qureshi, Zawar Mac Aodha, Oisin Brostow, Gabriel Firman, Michael Vicente, Sara Niantic San Francisco CA 94111 USA Univ Edinburgh Edinburgh Midlothian Scotland UCL London England
Extracting planes from a 3D scene is useful for downstream tasks in robotics and augmented reality. In this paper we tackle the problem of estimating the planar surfaces in a scene from posed images. Our first finding... 详细信息
来源: 评论
You Only Need Less Attention at Each Stage in vision Transformers
You Only Need Less Attention at Each Stage in Vision Transfo...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Zhang, Shuoxi Liu, Hanpeng Lin, Stephen He, Kun Huazhong Univ Sci & Technol Wuhan Peoples R China Microsoft Res Asia Beijing Peoples R China
The advent of vision Transformers (ViTs) marks a substantial paradigm shift in the realm of computer vision. ViTs capture the global information of images through self-attention modules, which perform dot product comp... 详细信息
来源: 评论
3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surfaces
3DFIRES: Few Image 3D REconstruction for Scenes with Hidden ...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Jin, Linyi Kulkarni, Nilesh Fouhey, David F. Univ Michigan Ann Arbor MI 48109 USA NYU New York NY USA
This paper introduces 3DFIRES, a novel system for scene-level 3D reconstruction from posed images. Designed to work with as few as one view, 3DFIRES reconstructs the complete geometry of unseen scenes, including hidde... 详细信息
来源: 评论
Training vision Transformers for Semi-Supervised Semantic Segmentation
Training Vision Transformers for Semi-Supervised Semantic Se...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Hu, Xinting Jiang, Li Schiele, Bernt Max Planck Inst Informat Saarland Informat Campus Munich Germany
We present S(4)Former, a novel approach to training vision Transformers for Semi-Supervised Semantic Segmentation (S-4). At its core, S(4)Former employs a vision Transformer within a classic teacher-student framework,...
来源: 评论
V*: Guided Visual Search as a Core Mechanism in Multimodal LLMs
V*: Guided Visual Search as a Core Mechanism in Multimodal L...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Wu, Penghao Xie, Saining Univ Calif San Diego La Jolla CA 92093 USA NYU New York NY USA
When we look around and perform complex tasks, how we see and selectively process what we see is crucial. However, the lack of this visual search mechanism in current multimodal LLMs (MLLMs) hinders their ability to f... 详细信息
来源: 评论
Token Transformation Matters: Towards Faithful Post-hoc Explanation for vision Transformer
Token Transformation Matters: Towards Faithful Post-hoc Expl...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Wu, Junyi Duan, Bin Kang, Weitai Tang, Hao Yan, Yan IIT Dept Comp Sci Chicago IL 60616 USA Carnegie Mellon Univ Robot Inst Pittsburgh PA 15213 USA
While Transformers have rapidly gained popularity in various computer vision applications, post-hoc explanations of their internal mechanisms remain largely unexplored. vision Transformers extract visual information b... 详细信息
来源: 评论
Enhancing vision-Language Pre-training with Rich Supervisions
Enhancing Vision-Language Pre-training with Rich Supervision...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Gao, Yuan Shi, Kunyu Zhu, Pengkai Belval, Edouard Nuriel, Oren Appalaraju, Srikar Ghadar, Shabnam Tu, Zhuowen Mahadevan, Vijay Soatto, Stefano Stanford Univ Stanford CA 94305 USA AWS AI Labs Seattle WA USA Amazon Seattle WA 98109 USA
We propose Strongly Supervised pre-training with ScreenShots (S4) - a novel pre-training paradigm for vision-Language Models using data from large-scale web screenshot rendering. Using web screenshots unlocks a treasu... 详细信息
来源: 评论
Contrasting intra-modal and ranking cross-modal hard negatives to enhance visio-linguistic compositional understanding
Contrasting intra-modal and ranking cross-modal hard negativ...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Zhang, Le Awal, Rabiul Agrawal, Aishwarya Mila Quebec AI Inst Montreal PQ Canada Univ Montreal Montreal PQ Canada Canada CIFAR AI Chair Montreal PQ Canada
vision-Language Models (VLMs), such as CLIP, exhibit strong image-text comprehension abilities, facilitating advances in several downstream tasks such as zero-shot image classification, image-text retrieval, and text-... 详细信息
来源: 评论