咨询与建议

限定检索结果

文献类型

  • 19,636 篇 会议
  • 49 篇 期刊文献
  • 3 册 图书

馆藏范围

  • 19,687 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 12,587 篇 工学
    • 10,355 篇 计算机科学与技术...
    • 2,449 篇 机械工程
    • 2,010 篇 软件工程
    • 815 篇 光学工程
    • 599 篇 电气工程
    • 433 篇 控制科学与工程
    • 329 篇 信息与通信工程
    • 211 篇 测绘科学与技术
    • 80 篇 生物医学工程(可授...
    • 75 篇 生物工程
    • 69 篇 电子科学与技术(可...
    • 67 篇 仪器科学与技术
    • 37 篇 建筑学
    • 36 篇 土木工程
    • 34 篇 力学(可授工学、理...
    • 31 篇 航空宇航科学与技...
    • 29 篇 安全科学与工程
    • 23 篇 交通运输工程
    • 21 篇 化学工程与技术
    • 20 篇 材料科学与工程(可...
  • 3,435 篇 医学
    • 3,434 篇 临床医学
  • 1,980 篇 理学
    • 1,001 篇 数学
    • 972 篇 物理学
    • 356 篇 统计学(可授理学、...
    • 340 篇 生物学
    • 235 篇 系统科学
    • 26 篇 化学
  • 262 篇 管理学
    • 141 篇 管理科学与工程(可...
    • 124 篇 图书情报与档案管...
    • 26 篇 工商管理
  • 19 篇 法学
  • 12 篇 农学
  • 8 篇 教育学
  • 6 篇 经济学
  • 4 篇 艺术学
  • 2 篇 军事学

主题

  • 7,949 篇 computer vision
  • 2,773 篇 training
  • 2,712 篇 pattern recognit...
  • 1,771 篇 computational mo...
  • 1,660 篇 visualization
  • 1,427 篇 cameras
  • 1,383 篇 three-dimensiona...
  • 1,345 篇 shape
  • 1,236 篇 face recognition
  • 1,222 篇 feature extracti...
  • 1,213 篇 image segmentati...
  • 1,117 篇 robustness
  • 1,094 篇 semantics
  • 977 篇 layout
  • 961 篇 object detection
  • 946 篇 benchmark testin...
  • 944 篇 computer archite...
  • 931 篇 codes
  • 897 篇 computer science
  • 861 篇 deep learning

机构

  • 174 篇 univ sci & techn...
  • 159 篇 carnegie mellon ...
  • 148 篇 univ chinese aca...
  • 144 篇 chinese univ hon...
  • 109 篇 microsoft resear...
  • 103 篇 zhejiang univ pe...
  • 103 篇 tsinghua univ pe...
  • 99 篇 swiss fed inst t...
  • 92 篇 tsinghua univers...
  • 89 篇 microsoft res as...
  • 88 篇 shanghai ai lab ...
  • 81 篇 zhejiang univers...
  • 76 篇 alibaba grp peop...
  • 74 篇 university of sc...
  • 73 篇 hong kong univ s...
  • 73 篇 university of ch...
  • 72 篇 peking univ peop...
  • 68 篇 shanghai jiao to...
  • 66 篇 univ oxford oxfo...
  • 65 篇 google res mount...

作者

  • 79 篇 van gool luc
  • 70 篇 zhang lei
  • 60 篇 timofte radu
  • 48 篇 yang yi
  • 48 篇 luc van gool
  • 46 篇 xiaoou tang
  • 43 篇 darrell trevor
  • 43 篇 tian qi
  • 42 篇 loy chen change
  • 42 篇 sun jian
  • 42 篇 li fei-fei
  • 40 篇 li stan z.
  • 39 篇 qi tian
  • 36 篇 chen xilin
  • 36 篇 torralba antonio
  • 35 篇 vasconcelos nuno
  • 35 篇 shan shiguang
  • 35 篇 liu yang
  • 34 篇 liu xiaoming
  • 34 篇 tao dacheng

语言

  • 19,682 篇 英文
  • 3 篇 中文
  • 2 篇 日文
  • 1 篇 其他
检索条件"任意字段=IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015"
19688 条 记 录,以下是451-460 订阅
排序:
3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos
3DGStream: On-the-Fly Training of 3D Gaussians for Efficient...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Sun, Jiakai Jiao, Han Li, Guangyuan Zhang, Zhanjie Zhao, Lei Xing, Wei Zhejiang Univ Hangzhou Peoples R China
Constructing photo-realistic Free-Viewpoint Videos ( FVVs) of dynamic scenes from multi-view videos remains a challenging endeavor. Despite the remarkable advance-ments achieved by current neural rendering techniques,... 详细信息
来源: 评论
Three Pillars improving vision Foundation Model Distillation for Lidar
Three Pillars improving Vision Foundation Model Distillation...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Puy, Gilles Gidaris, Spyros Boulch, Alexandre Simeoni, Oriane Sautier, Corentin Perez, Patrick Bursucl, Andrei Marlet, Renaud Valeo ai Paris France Kyutai Paris France Univ Gustave Eiffel CNRS LIGM Ecole Ponts Marne La Vallee France
Self-supervised image backbones can be used to address complex 2D tasks (e.g., semantic segmentation, object discovery) very efficiently and with little or no downstream supervision. Ideally, 3D backbones for lidar sh... 详细信息
来源: 评论
360Loc: A Dataset and Benchmark for Omnidirectional Visual Localization with Cross-device Queries
360Loc: A Dataset and Benchmark for Omnidirectional Visual L...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Huang, Huajian Liu, Changkun Zhu, Yipeng Cheng, Hui Braud, Tristan Yeung, Sai-Kit Hong Kong Univ Sci & Technol Hong Kong Peoples R China Sun Yat Sen Univ Guangzhou Peoples R China
Portable 360 degrees cameras are becoming a cheap and efficient tool to establish large visual databases. By capturing omnidirectional views of a scene, these cameras could expedite building environment models that ar... 详细信息
来源: 评论
OpenEQA: Embodied Question Answering in the Era of Foundation Models
OpenEQA: Embodied Question Answering in the Era of Foundatio...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Majumdar, Arjun Ajay, Anurag Zhang, Xi Aohan Punya, Pranav Yenamandra, Sriram Henaff, Mikael Silwal, Sneha Mcvay, Paul Maksymets, Oleksandr Arnaud, Sergio Yadav, Karmesh Li, Qiyang Newman, Ben Sharma, Mohit Berges, Vincent Zhang, Shiqi Agrawal, Pulkit Bisk, Yonatan Batra, Dhruv Kalakrishnan, Mrinal Meier, Franziska Paxton, Chris Sax, Alexander Rajeswaran, Aravind Georgia Tech Atlanta GA 30332 USA MIT 77 Massachusetts Ave Cambridge MA 02139 USA SUNY Binghamton Binghamton NY USA Meta AI Menlo Pk CA USA Univ Calif Berkeley Berkeley CA USA CMU Pittsburgh PA USA Meta Fundamental AI Res FAIR Menlo Pk CA USA
We present a modern formulation of Embodied Question Answering (EQA) as the task of understanding an environment well enough to answer questions about it in natural language. An agent can achieve such an understanding... 详细信息
来源: 评论
Egocentric Whole-Body Motion Capture with FisheyeViT and Diffusion-Based Motion Refinement
Egocentric Whole-Body Motion Capture with FisheyeViT and Dif...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Wang, Jian Cao, Zhe Luvizon, Diogo Liu, Lingjie Sarkar, Kripasindhu Tang, Danhang Beeler, Thabo Theobalt, Christian MPI Informat & Saarland Informat Campus Saarbrucken Germany Google Mountain View CA USA Univ Penn Philadelphia PA USA Saarbrucken Res Ctr Visual Com Interact & Artific Saarbrucken Germany
In this work, we explore egocentric whole-body motion capture using a single fisheye camera, which simultaneously estimates human body and hand motion. This task presents significant challenges due to three factors: t... 详细信息
来源: 评论
HRVDA: High-Resolution Visual Document Assistant
HRVDA: High-Resolution Visual Document Assistant
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Liu, Chaohu Yin, Kun Cao, Haoyu Jiang, Xinghua Li, Xin Liu, Yinsong Jiang, Deqiang Sun, Xing Xu, Linli Univ Sci & Technol China Sch Comp Sci & Technol Hefei Anhui Peoples R China State Key Lab Cognit Intelligence Hefei Anhui Peoples R China Tencent YouTu Lab Shanghai Peoples R China
Leveraging vast training data, multimodal large language models (MLLMs) have demonstrated formidable general visual comprehension capabilities and achieved remarkable performance across various tasks. However, their p... 详细信息
来源: 评论
Seeing the Unseen: Visual Common Sense for Semantic Placement
Seeing the Unseen: Visual Common Sense for Semantic Placemen...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Ramrakhya, Ram Kembhavi, Aniruddha Batra, Dhruv Kira, Zsolt Zeng, Kuo-Hao Weihs, Luca Georgia Inst Technol Atlanta GA 30332 USA PRIOR Allen Inst AI Seattle WA USA PRIOR AI2 Seattle WA USA
computer vision tasks typically involve describing what is present in an image (e.g. classification, detection, segmentation, and captioning). We study a visual common sense task that requires understanding 'what ... 详细信息
来源: 评论
PromptSync: Bridging Domain Gaps in vision-Language Models through Class-Aware Prototype Alignment and Discrimination
PromptSync: Bridging Domain Gaps in Vision-Language Models t...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Khandelwal, Anant Glance AI Bangalore Karnataka India
The potential for zero-shot generalization in vision-language (V-L) models such as CLIP has spurred their widespread adoption in addressing numerous downstream tasks. Previous methods have employed test-time prompt tu... 详细信息
来源: 评论
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
Panda-70M: Captioning 70M Videos with Multiple Cross-Modalit...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Chen, Tsai-Shien Siarohin, Aliaksandr Menapace, Willi Deyneka, Ekaterina Chao, Hsiang-wei Jeon, Byung Eun Fang, Yuwei Lee, Hsin-Ying Ren, Jian Yang, Ming-Hsuan Tulyakov, Sergey Snap Inc Santa Monica CA 90405 USA Univ Calif Merced Merced CA 95343 USA Univ Trento Trento Italy Snap Santa Monica CA USA
The quality of the data and annotation upper-bounds the quality of a downstream model. While there exist large text corpora and image-text pairs, high-quality video-text data is much harder to collect. First of all, m... 详细信息
来源: 评论
Multi-criteria Token Fusion with One-step-ahead Attention for Efficient vision Transformers
Multi-criteria Token Fusion with One-step-ahead Attention fo...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Lee, Sanghyeok Choi, Joonmyung Kim, Hyunwoo J. Korea Univ Dept Comp Sci & Engn Seoul South Korea
vision Transformer (ViT) has emerged as a prominent backbone for computer vision. For more efficient ViTs, recent works lessen the quadratic cost of the self- attention layer by pruning or fusing the redundant tokens.... 详细信息
来源: 评论