咨询与建议

限定检索结果

文献类型

  • 20,860 篇 会议
  • 105 篇 期刊文献
  • 43 册 图书

馆藏范围

  • 21,007 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 13,620 篇 工学
    • 11,056 篇 计算机科学与技术...
    • 2,652 篇 机械工程
    • 2,252 篇 软件工程
    • 914 篇 光学工程
    • 885 篇 电气工程
    • 529 篇 控制科学与工程
    • 477 篇 信息与通信工程
    • 216 篇 测绘科学与技术
    • 135 篇 生物工程
    • 127 篇 生物医学工程(可授...
    • 98 篇 电子科学与技术(可...
    • 92 篇 仪器科学与技术
    • 46 篇 安全科学与工程
    • 40 篇 建筑学
    • 40 篇 化学工程与技术
    • 39 篇 土木工程
    • 37 篇 交通运输工程
    • 35 篇 力学(可授工学、理...
    • 33 篇 航空宇航科学与技...
  • 3,494 篇 医学
    • 3,489 篇 临床医学
    • 32 篇 基础医学(可授医学...
  • 2,247 篇 理学
    • 1,145 篇 物理学
    • 1,081 篇 数学
    • 401 篇 生物学
    • 384 篇 统计学(可授理学、...
    • 245 篇 系统科学
    • 46 篇 化学
  • 343 篇 管理学
    • 176 篇 管理科学与工程(可...
    • 168 篇 图书情报与档案管...
    • 34 篇 工商管理
  • 31 篇 法学
  • 19 篇 农学
  • 15 篇 教育学
  • 8 篇 经济学
  • 5 篇 艺术学
  • 2 篇 军事学
  • 1 篇 文学

主题

  • 8,141 篇 computer vision
  • 2,886 篇 training
  • 2,841 篇 pattern recognit...
  • 1,809 篇 computational mo...
  • 1,715 篇 visualization
  • 1,493 篇 cameras
  • 1,433 篇 three-dimensiona...
  • 1,433 篇 feature extracti...
  • 1,366 篇 shape
  • 1,360 篇 face recognition
  • 1,243 篇 image segmentati...
  • 1,135 篇 robustness
  • 1,124 篇 semantics
  • 992 篇 computer archite...
  • 985 篇 object detection
  • 982 篇 layout
  • 959 篇 benchmark testin...
  • 935 篇 codes
  • 900 篇 computer science
  • 898 篇 object recogniti...

机构

  • 174 篇 univ sci & techn...
  • 158 篇 univ chinese aca...
  • 153 篇 carnegie mellon ...
  • 145 篇 chinese univ hon...
  • 109 篇 microsoft resear...
  • 103 篇 zhejiang univ pe...
  • 99 篇 swiss fed inst t...
  • 95 篇 tsinghua univers...
  • 90 篇 microsoft res as...
  • 90 篇 tsinghua univ pe...
  • 88 篇 shanghai ai lab ...
  • 81 篇 zhejiang univers...
  • 77 篇 alibaba grp peop...
  • 74 篇 hong kong univ s...
  • 73 篇 university of sc...
  • 72 篇 peking univ peop...
  • 72 篇 university of ch...
  • 68 篇 shanghai jiao to...
  • 66 篇 univ oxford oxfo...
  • 65 篇 google res mount...

作者

  • 80 篇 van gool luc
  • 70 篇 zhang lei
  • 58 篇 timofte radu
  • 48 篇 yang yi
  • 47 篇 luc van gool
  • 46 篇 xiaoou tang
  • 44 篇 tian qi
  • 43 篇 darrell trevor
  • 42 篇 loy chen change
  • 42 篇 sun jian
  • 41 篇 qi tian
  • 40 篇 li stan z.
  • 38 篇 li fei-fei
  • 37 篇 chen xilin
  • 36 篇 shan shiguang
  • 35 篇 zhou jie
  • 35 篇 vasconcelos nuno
  • 35 篇 liu yang
  • 35 篇 torralba antonio
  • 34 篇 liu xiaoming

语言

  • 20,982 篇 英文
  • 10 篇 中文
  • 7 篇 其他
  • 5 篇 土耳其文
  • 2 篇 日文
  • 2 篇 葡萄牙文
检索条件"任意字段=2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016"
21008 条 记 录,以下是451-460 订阅
排序:
Teeth-SEG: An Efficient Instance Segmentation Framework for Orthodontic Treatment based on Multi-Scale Aggregation and Anthropic Prior Knowledge
Teeth-SEG: An Efficient Instance Segmentation Framework for ...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Zou, Bo Wang, Shaofeng Liu, Hao Sun, Gaoyue Wang, Yajie Zuo, FeiFei Quan, Chengbin Zhaot, Youjian Tsinghua Univ Beijing Peoples R China Capital Med Univ Beijing Peoples R China Imperial Coll London London England LargeV Inc Beijing Peoples R China Tsinghua Univ Zhongguancun Lab Beijing Peoples R China
Teeth localization, segmentation, and labeling in 2D images have great potential in modern dentistry to enhance dental diagnostics, treatment planning, and population-based studies on oral health. However, general ins... 详细信息
来源: 评论
ES3: Evolving Self-Supervised Learning of Robust Audio-Visual Speech Representations
ES<SUP>3</SUP>: Evolving Self-Supervised Learning of Robust ...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Zhang, Yuanhang Yang, Shuang Shan, Shiguang Chen, Xilin Chinese Acad Sci Inst Comp Technol Key Lab Intelligent Informat Proc Beijing 100190 Peoples R China Univ Chinese Acad Sci Beijing 100049 Peoples R China
We propose a novel strategy, ES3, for self-supervised learning of robust audio-visual speech representations from unlabeled talking face videos. While many recent approaches for this task primarily rely on guiding the... 详细信息
来源: 评论
Benchmarking Zero-Shot recognition with vision-Language Models: Challenges on Granularity and Specificity
Benchmarking Zero-Shot Recognition with Vision-Language Mode...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Xu, Zhenlin Zhu, Yi Deng, Siqi Mittal, Abhay Chen, Yanbei Wang, Manchen Favaro, Paolo Tighe, Joseph Modolo, Davide AWS AI Labs Seattle WA 98109 USA Boson AI Santa Clara CA 95054 USA Meta Menlo Pk CA USA
This paper presents novel benchmarks for evaluating vision-language models (VLMs) in zero-shot recognition, focusing on granularity and specificity. Although VLMs excel in tasks like image captioning, they face challe... 详细信息
来源: 评论
3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos
3DGStream: On-the-Fly Training of 3D Gaussians for Efficient...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Sun, Jiakai Jiao, Han Li, Guangyuan Zhang, Zhanjie Zhao, Lei Xing, Wei Zhejiang Univ Hangzhou Peoples R China
Constructing photo-realistic Free-Viewpoint Videos ( FVVs) of dynamic scenes from multi-view videos remains a challenging endeavor. Despite the remarkable advance-ments achieved by current neural rendering techniques,... 详细信息
来源: 评论
Three Pillars improving vision Foundation Model Distillation for Lidar
Three Pillars improving Vision Foundation Model Distillation...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Puy, Gilles Gidaris, Spyros Boulch, Alexandre Simeoni, Oriane Sautier, Corentin Perez, Patrick Bursucl, Andrei Marlet, Renaud Valeo ai Paris France Kyutai Paris France Univ Gustave Eiffel CNRS LIGM Ecole Ponts Marne La Vallee France
Self-supervised image backbones can be used to address complex 2D tasks (e.g., semantic segmentation, object discovery) very efficiently and with little or no downstream supervision. Ideally, 3D backbones for lidar sh... 详细信息
来源: 评论
360Loc: A Dataset and Benchmark for Omnidirectional Visual Localization with Cross-device Queries
360Loc: A Dataset and Benchmark for Omnidirectional Visual L...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Huang, Huajian Liu, Changkun Zhu, Yipeng Cheng, Hui Braud, Tristan Yeung, Sai-Kit Hong Kong Univ Sci & Technol Hong Kong Peoples R China Sun Yat Sen Univ Guangzhou Peoples R China
Portable 360 degrees cameras are becoming a cheap and efficient tool to establish large visual databases. By capturing omnidirectional views of a scene, these cameras could expedite building environment models that ar... 详细信息
来源: 评论
OpenEQA: Embodied Question Answering in the Era of Foundation Models
OpenEQA: Embodied Question Answering in the Era of Foundatio...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Majumdar, Arjun Ajay, Anurag Zhang, Xi Aohan Punya, Pranav Yenamandra, Sriram Henaff, Mikael Silwal, Sneha Mcvay, Paul Maksymets, Oleksandr Arnaud, Sergio Yadav, Karmesh Li, Qiyang Newman, Ben Sharma, Mohit Berges, Vincent Zhang, Shiqi Agrawal, Pulkit Bisk, Yonatan Batra, Dhruv Kalakrishnan, Mrinal Meier, Franziska Paxton, Chris Sax, Alexander Rajeswaran, Aravind Georgia Tech Atlanta GA 30332 USA MIT 77 Massachusetts Ave Cambridge MA 02139 USA SUNY Binghamton Binghamton NY USA Meta AI Menlo Pk CA USA Univ Calif Berkeley Berkeley CA USA CMU Pittsburgh PA USA Meta Fundamental AI Res FAIR Menlo Pk CA USA
We present a modern formulation of Embodied Question Answering (EQA) as the task of understanding an environment well enough to answer questions about it in natural language. An agent can achieve such an understanding... 详细信息
来源: 评论
Egocentric Whole-Body Motion Capture with FisheyeViT and Diffusion-Based Motion Refinement
Egocentric Whole-Body Motion Capture with FisheyeViT and Dif...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Wang, Jian Cao, Zhe Luvizon, Diogo Liu, Lingjie Sarkar, Kripasindhu Tang, Danhang Beeler, Thabo Theobalt, Christian MPI Informat & Saarland Informat Campus Saarbrucken Germany Google Mountain View CA USA Univ Penn Philadelphia PA USA Saarbrucken Res Ctr Visual Com Interact & Artific Saarbrucken Germany
In this work, we explore egocentric whole-body motion capture using a single fisheye camera, which simultaneously estimates human body and hand motion. This task presents significant challenges due to three factors: t... 详细信息
来源: 评论
HRVDA: High-Resolution Visual Document Assistant
HRVDA: High-Resolution Visual Document Assistant
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Liu, Chaohu Yin, Kun Cao, Haoyu Jiang, Xinghua Li, Xin Liu, Yinsong Jiang, Deqiang Sun, Xing Xu, Linli Univ Sci & Technol China Sch Comp Sci & Technol Hefei Anhui Peoples R China State Key Lab Cognit Intelligence Hefei Anhui Peoples R China Tencent YouTu Lab Shanghai Peoples R China
Leveraging vast training data, multimodal large language models (MLLMs) have demonstrated formidable general visual comprehension capabilities and achieved remarkable performance across various tasks. However, their p... 详细信息
来源: 评论
Seeing the Unseen: Visual Common Sense for Semantic Placement
Seeing the Unseen: Visual Common Sense for Semantic Placemen...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Ramrakhya, Ram Kembhavi, Aniruddha Batra, Dhruv Kira, Zsolt Zeng, Kuo-Hao Weihs, Luca Georgia Inst Technol Atlanta GA 30332 USA PRIOR Allen Inst AI Seattle WA USA PRIOR AI2 Seattle WA USA
computer vision tasks typically involve describing what is present in an image (e.g. classification, detection, segmentation, and captioning). We study a visual common sense task that requires understanding 'what ... 详细信息
来源: 评论