咨询与建议

限定检索结果

文献类型

  • 23,136 篇 会议
  • 90 篇 期刊文献
  • 15 册 图书

馆藏范围

  • 23,240 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 13,631 篇 工学
    • 11,162 篇 计算机科学与技术...
    • 3,338 篇 软件工程
    • 2,414 篇 机械工程
    • 1,663 篇 光学工程
    • 1,203 篇 电气工程
    • 973 篇 控制科学与工程
    • 738 篇 信息与通信工程
    • 381 篇 仪器科学与技术
    • 322 篇 生物工程
    • 239 篇 生物医学工程(可授...
    • 188 篇 电子科学与技术(可...
    • 109 篇 化学工程与技术
    • 104 篇 安全科学与工程
    • 99 篇 测绘科学与技术
    • 85 篇 建筑学
    • 83 篇 交通运输工程
    • 82 篇 土木工程
    • 56 篇 力学(可授工学、理...
  • 3,696 篇 医学
    • 3,684 篇 临床医学
    • 76 篇 基础医学(可授医学...
  • 3,138 篇 理学
    • 1,880 篇 物理学
    • 1,605 篇 数学
    • 547 篇 统计学(可授理学、...
    • 466 篇 生物学
    • 243 篇 系统科学
    • 107 篇 化学
  • 491 篇 管理学
    • 290 篇 图书情报与档案管...
    • 212 篇 管理科学与工程(可...
    • 74 篇 工商管理
  • 252 篇 艺术学
    • 251 篇 设计学(可授艺术学...
  • 58 篇 法学
  • 38 篇 农学
  • 25 篇 教育学
  • 19 篇 经济学
  • 10 篇 军事学
  • 3 篇 文学

主题

  • 10,395 篇 computer vision
  • 3,892 篇 pattern recognit...
  • 3,101 篇 training
  • 2,104 篇 computational mo...
  • 1,898 篇 visualization
  • 1,799 篇 cameras
  • 1,487 篇 feature extracti...
  • 1,475 篇 three-dimensiona...
  • 1,464 篇 shape
  • 1,447 篇 image segmentati...
  • 1,287 篇 robustness
  • 1,234 篇 computer archite...
  • 1,213 篇 semantics
  • 1,112 篇 benchmark testin...
  • 1,111 篇 conferences
  • 1,104 篇 layout
  • 1,092 篇 object detection
  • 1,084 篇 computer science
  • 1,026 篇 codes
  • 907 篇 face recognition

机构

  • 137 篇 univ sci & techn...
  • 124 篇 univ chinese aca...
  • 121 篇 chinese univ hon...
  • 108 篇 tsinghua univers...
  • 108 篇 carnegie mellon ...
  • 105 篇 microsoft resear...
  • 97 篇 zhejiang univ pe...
  • 91 篇 swiss fed inst t...
  • 85 篇 university of sc...
  • 84 篇 zhejiang univers...
  • 81 篇 shanghai ai lab ...
  • 79 篇 university of ch...
  • 75 篇 shanghai jiao to...
  • 69 篇 microsoft res as...
  • 68 篇 alibaba grp peop...
  • 66 篇 adobe research
  • 65 篇 national laborat...
  • 64 篇 peking univ peop...
  • 61 篇 univ oxford oxfo...
  • 59 篇 peng cheng labor...

作者

  • 80 篇 van gool luc
  • 71 篇 timofte radu
  • 65 篇 zhang lei
  • 43 篇 luc van gool
  • 40 篇 yang yi
  • 37 篇 loy chen change
  • 34 篇 li stan z.
  • 33 篇 liu yang
  • 33 篇 xiaoou tang
  • 33 篇 murino vittorio
  • 33 篇 chen chen
  • 33 篇 qi tian
  • 33 篇 li fei-fei
  • 32 篇 tian qi
  • 32 篇 sun jian
  • 30 篇 ying shan
  • 30 篇 pascal fua
  • 29 篇 darrell trevor
  • 28 篇 li xin
  • 28 篇 hanqing lu

语言

  • 23,148 篇 英文
  • 66 篇 其他
  • 20 篇 中文
  • 5 篇 土耳其文
  • 2 篇 日文
检索条件"任意字段=IEEE/CVF Conference on Computer Vision and Pattern Recognition"
23241 条 记 录,以下是61-70 订阅
Contrasting intra-modal and ranking cross-modal hard negatives to enhance visio-linguistic compositional understanding
Contrasting intra-modal and ranking cross-modal hard negativ...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Zhang, Le Awal, Rabiul Agrawal, Aishwarya Mila Quebec AI Inst Montreal PQ Canada Univ Montreal Montreal PQ Canada Canada CIFAR AI Chair Montreal PQ Canada
vision-Language Models (VLMs), such as CLIP, exhibit strong image-text comprehension abilities, facilitating advances in several downstream tasks such as zero-shot image classification, image-text retrieval, and text-... 详细信息
来源: 评论
Generating Diverse Agricultural Data for vision-Based Farming Applications
Generating Diverse Agricultural Data for Vision-Based Farmin...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Cieslak, Mikolaj Govindarajan, Umabharathi Garcia, Alejandro Chandrashekar, Anuradha Haedrich, Torsten Mendoza-Drosik, Aleksander Michels, Dominik L. Pirk, Soeren Fu, Chia-Chun Palubicki, Wojciech GreenMatterAI Berlin Germany Blue River Technol Santa Clara CA USA King Abdullah Univ Sci & Technol Thuwal Saudi Arabia Tech Univ Darmstadt Darmstadt Germany Christian Albrecht Univ Kiel Kiel Germany Adam Mickiewicz Univ Poznan Poland
We present a specialized procedural model for generating synthetic agricultural scenes, focusing on soybean crops, along with various weeds. The model simulates distinct growth stages of these plants, diverse soil con...
来源: 评论
Learning Correlation Structures for vision Transformers
Learning Correlation Structures for Vision Transformers
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Kim, Manjin Seo, Paul Hongsuck Schmid, Cordelia Cho, Minsu POSTECH Pohang South Korea Korea Univ Seoul South Korea Google Res Mountain View CA USA
We introduce a new attention mechanism, dubbed structural self-attention (StructSA), that leverages rich correlation patterns naturally emerging in key-query interactions of attention. StructSA generates attention map... 详细信息
来源: 评论
RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
RoDLA: Benchmarking the Robustness of Document Layout Analys...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Chen, Yufan Zhang, Jiaming Peng, Kunyu Zheng, Junwei Liu, Ruiping Torre, Philip Stiefelhagen, Rainer Karlsruhe Inst Technol Karlsruhe Germany Univ Oxford Oxford England
Before developing a Document Layout Analysis (DLA) model in real-world applications, conducting comprehensive robustness testing is essential. However, the robustness of DLA models remains underexplored in the literat... 详细信息
来源: 评论
PEEKABOO: Interactive Video Generation via Masked-Diffusion
PEEKABOO: Interactive Video Generation via Masked-Diffusion
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Jain, Yash Nasery, Anshul Vineet, Vibhav Behl, Harkirat Microsoft Redmond WA 98052 USA Univ Washington Seattle WA USA
Modern video generation models like Sora have achieved remarkable success in producing high-quality videos. However, a significant limitation is their inability to offer interactive control to users, a feature that pr... 详细信息
来源: 评论
Large-Scale Bidirectional Training for Zero-Shot Image Captioning
Large-Scale Bidirectional Training for Zero-Shot Image Capti...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Kim, Taehoon Marsden, Mark Ahn, Pyunghwan Kim, Sangyun Lee, Sihaeng Sala, Alessandra Kim, Seung Hwan LG AI Res Seoul South Korea Shutterstock New York NY USA
When trained on large-scale datasets, image captioning models can understand the content of images from a general domain but often fail to generate accurate, detailed captions. To improve performance, pretraining-and-... 详细信息
来源: 评论
HaLViT: Half of the Weights are Enough
HaLViT: Half of the Weights are Enough
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Koyun, Onur Can Toreyin, Behcet Ugur Istanbul Tech Univ Dept Artificial Intelligence & Data Engn Signal Proc Computat Intelligence Res Grp SP4CING Inst Informat Istanbul Turkiye
Deep learning architectures like Transformers and Convolutional Neural Networks (CNNs) have led to ground-breaking advances across numerous fields. However, their extensive need for parameters poses challenges for imp... 详细信息
来源: 评论
Token Transformation Matters: Towards Faithful Post-hoc Explanation for vision Transformer
Token Transformation Matters: Towards Faithful Post-hoc Expl...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Wu, Junyi Duan, Bin Kang, Weitai Tang, Hao Yan, Yan IIT Dept Comp Sci Chicago IL 60616 USA Carnegie Mellon Univ Robot Inst Pittsburgh PA 15213 USA
While Transformers have rapidly gained popularity in various computer vision applications, post-hoc explanations of their internal mechanisms remain largely unexplored. vision Transformers extract visual information b... 详细信息
来源: 评论
Honeybee: Locality-enhanced Projector for Multimodal LLM
Honeybee: Locality-enhanced Projector for Multimodal LLM
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Cha, Junbum Kang, Wooyoung Mun, Jonghwan Roh, Byungseok Kakao Brain Seongnam South Korea
In Multimodal Large Language Models (MLLMs), a visual projector plays a crucial role in bridging pre-trained vision encoders with LLMs, enabling profound visual understanding while harnessing the LLMs' robust capa... 详细信息
来源: 评论
Making vision Transformers Truly Shift-Equivariant
Making Vision Transformers Truly Shift-Equivariant
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Rojas-Gomez, Renan A. Lim, Teck-Yian Do, Minh N. Yeh, Raymond A. UIUC Dept Elect Engn Urbana IL 61801 USA UIUC VinUni Illinois Smart Hlth Ctr Urbana IL USA Purdue Univ Dept Comp Sci W Lafayette IN 47907 USA
In the field of computer vision, vision Transformers (ViTs) have emerged as a prominent deep learning architecture. Despite being inspired by Convolutional Neural Networks (CNNs), ViTs are susceptible to small spatial... 详细信息
来源: 评论