咨询与建议

限定检索结果

文献类型

  • 11,267 篇 会议
  • 14 篇 期刊文献

馆藏范围

  • 11,281 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 7,859 篇 工学
    • 7,418 篇 计算机科学与技术...
    • 799 篇 机械工程
    • 390 篇 电气工程
    • 377 篇 软件工程
    • 224 篇 控制科学与工程
    • 68 篇 光学工程
    • 32 篇 信息与通信工程
    • 26 篇 生物工程
    • 10 篇 生物医学工程(可授...
    • 8 篇 化学工程与技术
    • 7 篇 电子科学与技术(可...
    • 6 篇 交通运输工程
    • 5 篇 安全科学与工程
    • 3 篇 仪器科学与技术
    • 2 篇 力学(可授工学、理...
    • 2 篇 材料科学与工程(可...
    • 2 篇 动力工程及工程热...
    • 2 篇 航空宇航科学与技...
  • 3,103 篇 医学
    • 3,102 篇 临床医学
    • 4 篇 基础医学(可授医学...
  • 297 篇 理学
    • 199 篇 系统科学
    • 69 篇 物理学
    • 27 篇 生物学
    • 24 篇 数学
    • 9 篇 统计学(可授理学、...
    • 7 篇 化学
  • 23 篇 管理学
    • 14 篇 图书情报与档案管...
    • 9 篇 管理科学与工程(可...
    • 4 篇 工商管理
  • 6 篇 法学
    • 6 篇 社会学
  • 2 篇 农学
  • 1 篇 教育学
  • 1 篇 艺术学

主题

  • 5,461 篇 computer vision
  • 2,564 篇 training
  • 2,118 篇 pattern recognit...
  • 1,632 篇 computational mo...
  • 1,454 篇 visualization
  • 1,325 篇 three-dimensiona...
  • 1,070 篇 semantics
  • 972 篇 codes
  • 968 篇 benchmark testin...
  • 930 篇 computer archite...
  • 885 篇 deep learning
  • 831 篇 task analysis
  • 729 篇 feature extracti...
  • 541 篇 conferences
  • 530 篇 neural networks
  • 526 篇 face recognition
  • 503 篇 transformers
  • 480 篇 object detection
  • 478 篇 image segmentati...
  • 469 篇 cameras

机构

  • 169 篇 univ sci & techn...
  • 146 篇 tsinghua univ pe...
  • 142 篇 univ chinese aca...
  • 142 篇 carnegie mellon ...
  • 132 篇 chinese univ hon...
  • 122 篇 peng cheng lab p...
  • 102 篇 zhejiang univ pe...
  • 96 篇 sensetime res pe...
  • 95 篇 swiss fed inst t...
  • 90 篇 shanghai ai lab ...
  • 86 篇 tsinghua univers...
  • 86 篇 stanford univ st...
  • 84 篇 shanghai jiao to...
  • 80 篇 zhejiang univers...
  • 79 篇 alibaba grp peop...
  • 79 篇 univ hong kong p...
  • 76 篇 peng cheng labor...
  • 76 篇 tech univ munich...
  • 74 篇 australian natl ...
  • 73 篇 peking univ peop...

作者

  • 67 篇 timofte radu
  • 60 篇 van gool luc
  • 50 篇 zhang lei
  • 43 篇 yang yi
  • 36 篇 loy chen change
  • 36 篇 tao dacheng
  • 31 篇 liu yang
  • 30 篇 zhou jie
  • 30 篇 chen chen
  • 30 篇 tian qi
  • 29 篇 sun jian
  • 28 篇 zha zheng-jun
  • 27 篇 qi tian
  • 27 篇 boxin shi
  • 26 篇 li xin
  • 26 篇 vasconcelos nuno
  • 26 篇 pollefeys marc
  • 24 篇 liu xiaoming
  • 24 篇 zheng wei-shi
  • 24 篇 luo ping

语言

  • 11,274 篇 英文
  • 6 篇 其他
  • 1 篇 中文
检索条件"任意字段=2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020"
11281 条 记 录,以下是131-140 订阅
排序:
RegionGPT: Towards Region Understanding vision Language Model
RegionGPT: Towards Region Understanding Vision Language Mode...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Guo, Qiushan De Mello, Shalini Yin, Hongxu Byeon, Wonmin Cheung, Ka Chun Yu, Yizhou Luo, Ping Liu, Sifei Univ Hong Kong Hong Kong Peoples R China NVIDIA San Francisco CA USA
vision language models (VLMs) have experienced rapid advancements through the integration of large language models (LLMs) with image-text pairs, yet they struggle with detailed regional visual understanding due to lim...
来源: 评论
Action Scene Graphs for Long-Form Understanding of Egocentric Videos
Action Scene Graphs for Long-Form Understanding of Egocentri...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Rodin, Ivan Furnari, Antonino Min, Kyle Tripathi, Subarna Farinella, Giovanni Maria Univ Catania Catania Italy Intel Labs Hillsboro OR USA
We present Egocentric Action Scene Graphs (EASGs), a new representation for long-form understanding of egocentric videos. EASGs extend standard manually-annotated representations of egocentric videos, such as verb-nou... 详细信息
来源: 评论
Efficient Test-Time Adaptation of vision-Language Models
Efficient Test-Time Adaptation of Vision-Language Models
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Karmanov, Adilbek Guan, Dayan Lu, Shijian El Saddik, Abdulmotaleb Xing, Eric Mohamed bin Zayed Univ Artificial Intelligence Abu Dhabi U Arab Emirates Nanyang Technol Univ Singapore Singapore Univ Ottawa Ottawa ON Canada Carnegie Mellon Univ Pittsburgh PA 15213 USA
Test-time adaptation with pre-trained vision-language models has attracted increasing attention for tackling distribution shifts during the test time. Though prior studies have achieved very promising performance, the...
来源: 评论
Multi-criteria Token Fusion with One-step-ahead Attention for Efficient vision Transformers
Multi-criteria Token Fusion with One-step-ahead Attention fo...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Lee, Sanghyeok Choi, Joonmyung Kim, Hyunwoo J. Korea Univ Dept Comp Sci & Engn Seoul South Korea
vision Transformer (ViT) has emerged as a prominent backbone for computer vision. For more efficient ViTs, recent works lessen the quadratic cost of the self- attention layer by pruning or fusing the redundant tokens.... 详细信息
来源: 评论
Towards 3D vision with Low-Cost Single-Photon Cameras
Towards 3D Vision with Low-Cost Single-Photon Cameras
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Mu, Fangzhou Sifferman, Carter Jungerman, Sacha Li, Yiquan Han, Mark Gleicher, Michael Gupta, Mohit Li, Yin Univ Wisconsin Madison WI 53706 USA
We present a method for reconstructing 3D shape of arbitrary Lambertian objects based on measurements by miniature, energy-efficient, low-cost single-photon cameras. These cameras, operating as time resolved image sen...
来源: 评论
SkipPLUS: Skip the First Few Layers to Better Explain vision Transformers
SkipPLUS: Skip the First Few Layers to Better Explain Vision...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Mehri, Faridoun Fayyaz, Mohsen Baghshah, Mahdieh Soleymani Pilehvar, Mohammad Taher Sharif Univ Technol Tehran Iran Univ Tehran Tehran Iran Cardiff Univ Cardiff Wales
Despite their remarkable performance, the explainability of vision Transformers (ViTs) remains a challenge. While forward attention-based token attribution techniques have become popular in text processing, their suit... 详细信息
来源: 评论
Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box vision-Language Models for Selective Visual Question Answering
Consistency and Uncertainty: Identifying Unreliable Response...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Khan, Zaid Fu, Yun Northeastern Univ Boston MA 02115 USA
The goal of selective prediction is to allow an a model to abstain when it may not be able to deliver a reliable prediction, which is important in safety-critical contexts. Existing approaches to selective prediction ... 详细信息
来源: 评论
Sequential Modeling Enables Scalable Learning for Large vision Models
Sequential Modeling Enables Scalable Learning for Large Visi...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Bail, Yutong Geng, Xinyang Mangalam, Karttikeya Bar, Amir Yuille, Alan L. Darrell, Trevor Malik, Jitendra Efros, Alexei A. UC Berkeley BAIR Berkeley CA 94720 USA Johns Hopkins Univ Baltimore MD 21218 USA
We introduce a novel sequential modeling approach which enables learning a Large vision Model (LVM) without making use of any linguistic data. To do this, we define a common format, "visual sentences", in wh... 详细信息
来源: 评论
GRAM: Global Reasoning for Multi-Page VQA
GRAM: Global Reasoning for Multi-Page VQA
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Blau, Tsachi Fogel, Sharon Ronen, Roi Goltst, Alona Per, Shahar Tsi Ben Avraham, Elad Aberdam, Aviad Ganz, Roy Litman, Ron Technion Haifa Israel AWS AI Labs Shanghai Peoples R China
The increasing use of transformer-based large language models brings forward the challenge of processing long sequences. In document visual question answering (DocVQA), leading methods focus on the single-page setting... 详细信息
来源: 评论
Emu Edit: Precise Image Editing via recognition and Generation Tasks
Emu Edit: Precise Image Editing via Recognition and Generati...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Sheynin, Shelly Polyak, Adam Singer, Uriel Kirstain, Yuval Zohar, Amit Ashual, Oron Parikh, Devi Taigman, Yaniv Meta GenAI Menlo Pk CA 94025 USA
Instruction-based image editing holds immense potential for a variety of applications, as it enables users to perform any editing operation using a natural language instruction. However, current models in this domain ...
来源: 评论