咨询与建议

限定检索结果

文献类型

  • 20,798 篇 会议
  • 87 篇 期刊文献
  • 65 册 图书

馆藏范围

  • 20,949 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 13,274 篇 工学
    • 10,922 篇 计算机科学与技术...
    • 2,484 篇 机械工程
    • 2,307 篇 软件工程
    • 913 篇 光学工程
    • 770 篇 电气工程
    • 556 篇 控制科学与工程
    • 405 篇 信息与通信工程
    • 210 篇 测绘科学与技术
    • 131 篇 生物医学工程(可授...
    • 104 篇 电子科学与技术(可...
    • 100 篇 生物工程
    • 92 篇 仪器科学与技术
    • 56 篇 化学工程与技术
    • 52 篇 建筑学
    • 48 篇 土木工程
    • 44 篇 安全科学与工程
    • 38 篇 力学(可授工学、理...
    • 38 篇 航空宇航科学与技...
    • 35 篇 交通运输工程
  • 3,457 篇 医学
    • 3,449 篇 临床医学
    • 34 篇 基础医学(可授医学...
  • 2,315 篇 理学
    • 1,154 篇 数学
    • 1,132 篇 物理学
    • 417 篇 统计学(可授理学、...
    • 386 篇 生物学
    • 252 篇 系统科学
    • 57 篇 化学
  • 353 篇 管理学
    • 184 篇 图书情报与档案管...
    • 176 篇 管理科学与工程(可...
    • 32 篇 工商管理
  • 28 篇 法学
  • 20 篇 农学
  • 15 篇 教育学
  • 9 篇 经济学
  • 8 篇 艺术学
  • 5 篇 文学
  • 5 篇 军事学

主题

  • 8,202 篇 computer vision
  • 3,009 篇 pattern recognit...
  • 2,732 篇 training
  • 1,769 篇 computational mo...
  • 1,657 篇 visualization
  • 1,482 篇 cameras
  • 1,415 篇 shape
  • 1,369 篇 three-dimensiona...
  • 1,369 篇 face recognition
  • 1,285 篇 image segmentati...
  • 1,272 篇 feature extracti...
  • 1,178 篇 robustness
  • 1,090 篇 semantics
  • 1,040 篇 layout
  • 1,006 篇 object detection
  • 975 篇 object recogniti...
  • 968 篇 computer science
  • 946 篇 computer archite...
  • 946 篇 benchmark testin...
  • 931 篇 codes

机构

  • 174 篇 univ sci & techn...
  • 154 篇 carnegie mellon ...
  • 148 篇 univ chinese aca...
  • 144 篇 chinese univ hon...
  • 113 篇 microsoft resear...
  • 103 篇 zhejiang univ pe...
  • 99 篇 swiss fed inst t...
  • 97 篇 tsinghua univ pe...
  • 93 篇 tsinghua univers...
  • 91 篇 microsoft res as...
  • 88 篇 shanghai ai lab ...
  • 81 篇 zhejiang univers...
  • 76 篇 alibaba grp peop...
  • 74 篇 hong kong univ s...
  • 73 篇 university of sc...
  • 72 篇 peking univ peop...
  • 69 篇 university of ch...
  • 68 篇 shanghai jiao to...
  • 66 篇 google res mount...
  • 66 篇 univ oxford oxfo...

作者

  • 80 篇 van gool luc
  • 71 篇 zhang lei
  • 59 篇 timofte radu
  • 48 篇 yang yi
  • 47 篇 xiaoou tang
  • 44 篇 darrell trevor
  • 43 篇 tian qi
  • 43 篇 luc van gool
  • 42 篇 loy chen change
  • 42 篇 sun jian
  • 42 篇 li fei-fei
  • 40 篇 qi tian
  • 39 篇 li stan z.
  • 37 篇 liu yang
  • 37 篇 chen xilin
  • 36 篇 shan shiguang
  • 35 篇 liu xiaoming
  • 35 篇 vasconcelos nuno
  • 35 篇 torralba antonio
  • 32 篇 zhou jie

语言

  • 20,927 篇 英文
  • 14 篇 中文
  • 6 篇 其他
  • 2 篇 日文
  • 2 篇 土耳其文
检索条件"任意字段=2009 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009"
20950 条 记 录,以下是451-460 订阅
排序:
360Loc: A Dataset and Benchmark for Omnidirectional Visual Localization with Cross-device Queries
360Loc: A Dataset and Benchmark for Omnidirectional Visual L...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Huang, Huajian Liu, Changkun Zhu, Yipeng Cheng, Hui Braud, Tristan Yeung, Sai-Kit Hong Kong Univ Sci & Technol Hong Kong Peoples R China Sun Yat Sen Univ Guangzhou Peoples R China
Portable 360 degrees cameras are becoming a cheap and efficient tool to establish large visual databases. By capturing omnidirectional views of a scene, these cameras could expedite building environment models that ar... 详细信息
来源: 评论
OpenEQA: Embodied Question Answering in the Era of Foundation Models
OpenEQA: Embodied Question Answering in the Era of Foundatio...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Majumdar, Arjun Ajay, Anurag Zhang, Xi Aohan Punya, Pranav Yenamandra, Sriram Henaff, Mikael Silwal, Sneha Mcvay, Paul Maksymets, Oleksandr Arnaud, Sergio Yadav, Karmesh Li, Qiyang Newman, Ben Sharma, Mohit Berges, Vincent Zhang, Shiqi Agrawal, Pulkit Bisk, Yonatan Batra, Dhruv Kalakrishnan, Mrinal Meier, Franziska Paxton, Chris Sax, Alexander Rajeswaran, Aravind Georgia Tech Atlanta GA 30332 USA MIT 77 Massachusetts Ave Cambridge MA 02139 USA SUNY Binghamton Binghamton NY USA Meta AI Menlo Pk CA USA Univ Calif Berkeley Berkeley CA USA CMU Pittsburgh PA USA Meta Fundamental AI Res FAIR Menlo Pk CA USA
We present a modern formulation of Embodied Question Answering (EQA) as the task of understanding an environment well enough to answer questions about it in natural language. An agent can achieve such an understanding... 详细信息
来源: 评论
Egocentric Whole-Body Motion Capture with FisheyeViT and Diffusion-Based Motion Refinement
Egocentric Whole-Body Motion Capture with FisheyeViT and Dif...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Wang, Jian Cao, Zhe Luvizon, Diogo Liu, Lingjie Sarkar, Kripasindhu Tang, Danhang Beeler, Thabo Theobalt, Christian MPI Informat & Saarland Informat Campus Saarbrucken Germany Google Mountain View CA USA Univ Penn Philadelphia PA USA Saarbrucken Res Ctr Visual Com Interact & Artific Saarbrucken Germany
In this work, we explore egocentric whole-body motion capture using a single fisheye camera, which simultaneously estimates human body and hand motion. This task presents significant challenges due to three factors: t... 详细信息
来源: 评论
HRVDA: High-Resolution Visual Document Assistant
HRVDA: High-Resolution Visual Document Assistant
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Liu, Chaohu Yin, Kun Cao, Haoyu Jiang, Xinghua Li, Xin Liu, Yinsong Jiang, Deqiang Sun, Xing Xu, Linli Univ Sci & Technol China Sch Comp Sci & Technol Hefei Anhui Peoples R China State Key Lab Cognit Intelligence Hefei Anhui Peoples R China Tencent YouTu Lab Shanghai Peoples R China
Leveraging vast training data, multimodal large language models (MLLMs) have demonstrated formidable general visual comprehension capabilities and achieved remarkable performance across various tasks. However, their p... 详细信息
来源: 评论
Seeing the Unseen: Visual Common Sense for Semantic Placement
Seeing the Unseen: Visual Common Sense for Semantic Placemen...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Ramrakhya, Ram Kembhavi, Aniruddha Batra, Dhruv Kira, Zsolt Zeng, Kuo-Hao Weihs, Luca Georgia Inst Technol Atlanta GA 30332 USA PRIOR Allen Inst AI Seattle WA USA PRIOR AI2 Seattle WA USA
computer vision tasks typically involve describing what is present in an image (e.g. classification, detection, segmentation, and captioning). We study a visual common sense task that requires understanding 'what ... 详细信息
来源: 评论
PromptSync: Bridging Domain Gaps in vision-Language Models through Class-Aware Prototype Alignment and Discrimination
PromptSync: Bridging Domain Gaps in Vision-Language Models t...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Khandelwal, Anant Glance AI Bangalore Karnataka India
The potential for zero-shot generalization in vision-language (V-L) models such as CLIP has spurred their widespread adoption in addressing numerous downstream tasks. Previous methods have employed test-time prompt tu... 详细信息
来源: 评论
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
Panda-70M: Captioning 70M Videos with Multiple Cross-Modalit...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Chen, Tsai-Shien Siarohin, Aliaksandr Menapace, Willi Deyneka, Ekaterina Chao, Hsiang-wei Jeon, Byung Eun Fang, Yuwei Lee, Hsin-Ying Ren, Jian Yang, Ming-Hsuan Tulyakov, Sergey Snap Inc Santa Monica CA 90405 USA Univ Calif Merced Merced CA 95343 USA Univ Trento Trento Italy Snap Santa Monica CA USA
The quality of the data and annotation upper-bounds the quality of a downstream model. While there exist large text corpora and image-text pairs, high-quality video-text data is much harder to collect. First of all, m... 详细信息
来源: 评论
Multi-criteria Token Fusion with One-step-ahead Attention for Efficient vision Transformers
Multi-criteria Token Fusion with One-step-ahead Attention fo...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Lee, Sanghyeok Choi, Joonmyung Kim, Hyunwoo J. Korea Univ Dept Comp Sci & Engn Seoul South Korea
vision Transformer (ViT) has emerged as a prominent backbone for computer vision. For more efficient ViTs, recent works lessen the quadratic cost of the self- attention layer by pruning or fusing the redundant tokens.... 详细信息
来源: 评论
Making Visual Sense of Oracle Bones for You and Me
Making Visual Sense of Oracle Bones for You and Me
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Qiao, Runqi Yang, Lan Pang, Kaiyue Zhang, Honggang Beijing Univ Posts & Telecommun Sch Artificial Intelligence Beijing Peoples R China Univ Surrey CVSSP SketchX Guildford Surrey England
Visual perception evolves over time. This is particularly the case of oracle bone scripts, where visual glyphs seem intuitive to people from distant past prove difficult to be understood in contemporary eyes. While se... 详细信息
来源: 评论
SIFU: Side-view Conditioned Implicit Function for Real-world Usable Clothed Human Reconstruction
SIFU: Side-view Conditioned Implicit Function for Real-world...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Zhang, Zechuan Yang, Zongxin Yang, Yi Zhejiang Univ CCAL ReLER Hangzhou Peoples R China
Creating high-quality 3D models of clothed humans from single images for real-world applications is crucial. Despite recent advancements, accurately reconstructing humans in complex poses or with loose clothing from i... 详细信息
来源: 评论