咨询与建议

限定检索结果

文献类型

  • 11,267 篇 会议
  • 14 篇 期刊文献

馆藏范围

  • 11,281 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 7,859 篇 工学
    • 7,418 篇 计算机科学与技术...
    • 799 篇 机械工程
    • 390 篇 电气工程
    • 377 篇 软件工程
    • 224 篇 控制科学与工程
    • 68 篇 光学工程
    • 32 篇 信息与通信工程
    • 26 篇 生物工程
    • 10 篇 生物医学工程(可授...
    • 8 篇 化学工程与技术
    • 7 篇 电子科学与技术(可...
    • 6 篇 交通运输工程
    • 5 篇 安全科学与工程
    • 3 篇 仪器科学与技术
    • 2 篇 力学(可授工学、理...
    • 2 篇 材料科学与工程(可...
    • 2 篇 动力工程及工程热...
    • 2 篇 航空宇航科学与技...
  • 3,103 篇 医学
    • 3,102 篇 临床医学
    • 4 篇 基础医学(可授医学...
  • 297 篇 理学
    • 199 篇 系统科学
    • 69 篇 物理学
    • 27 篇 生物学
    • 24 篇 数学
    • 9 篇 统计学(可授理学、...
    • 7 篇 化学
  • 23 篇 管理学
    • 14 篇 图书情报与档案管...
    • 9 篇 管理科学与工程(可...
    • 4 篇 工商管理
  • 6 篇 法学
    • 6 篇 社会学
  • 2 篇 农学
  • 1 篇 教育学
  • 1 篇 艺术学

主题

  • 5,461 篇 computer vision
  • 2,564 篇 training
  • 2,118 篇 pattern recognit...
  • 1,632 篇 computational mo...
  • 1,454 篇 visualization
  • 1,325 篇 three-dimensiona...
  • 1,070 篇 semantics
  • 972 篇 codes
  • 968 篇 benchmark testin...
  • 930 篇 computer archite...
  • 885 篇 deep learning
  • 831 篇 task analysis
  • 729 篇 feature extracti...
  • 541 篇 conferences
  • 530 篇 neural networks
  • 526 篇 face recognition
  • 503 篇 transformers
  • 480 篇 object detection
  • 478 篇 image segmentati...
  • 469 篇 cameras

机构

  • 169 篇 univ sci & techn...
  • 146 篇 tsinghua univ pe...
  • 142 篇 univ chinese aca...
  • 142 篇 carnegie mellon ...
  • 132 篇 chinese univ hon...
  • 122 篇 peng cheng lab p...
  • 102 篇 zhejiang univ pe...
  • 96 篇 sensetime res pe...
  • 95 篇 swiss fed inst t...
  • 90 篇 shanghai ai lab ...
  • 86 篇 tsinghua univers...
  • 86 篇 stanford univ st...
  • 84 篇 shanghai jiao to...
  • 80 篇 zhejiang univers...
  • 79 篇 alibaba grp peop...
  • 79 篇 univ hong kong p...
  • 76 篇 peng cheng labor...
  • 76 篇 tech univ munich...
  • 74 篇 australian natl ...
  • 73 篇 peking univ peop...

作者

  • 67 篇 timofte radu
  • 60 篇 van gool luc
  • 50 篇 zhang lei
  • 43 篇 yang yi
  • 36 篇 loy chen change
  • 36 篇 tao dacheng
  • 31 篇 liu yang
  • 30 篇 zhou jie
  • 30 篇 chen chen
  • 30 篇 tian qi
  • 29 篇 sun jian
  • 28 篇 zha zheng-jun
  • 27 篇 qi tian
  • 27 篇 boxin shi
  • 26 篇 li xin
  • 26 篇 vasconcelos nuno
  • 26 篇 pollefeys marc
  • 24 篇 liu xiaoming
  • 24 篇 zheng wei-shi
  • 24 篇 luo ping

语言

  • 11,273 篇 英文
  • 7 篇 其他
  • 1 篇 中文
检索条件"任意字段=2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020"
11281 条 记 录,以下是301-310 订阅
排序:
OTE: Exploring Accurate Scene Text recognition Using One Token
OTE: Exploring Accurate Scene Text Recognition Using One Tok...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Xu, Jianjun Wang, Yuxin Xie, Hongtao Zhang, Yongdong Univ Sci & Technol China Hefei Peoples R China
In this paper, we propose a novel framework to fully exploit the potential of a single vector for scene text recognition (STR). Different from previous sequence-to-sequence methods that rely on a sequence of visual to... 详细信息
来源: 评论
Leveraging vision-Language Models for Improving Domain Generalization in Image Classification
Leveraging Vision-Language Models for Improving Domain Gener...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Addepalli, Sravanti Asokan, Ashish Ramayee Sharma, Lakshay Babu, R. Venkatesh Indian Inst Sci Vision & AI Lab Bangalore Karnataka India
vision-Language Models (VLMs) such as CLIP are trained on large amounts of image-text pairs, resulting in remarkable generalization across several data distributions. However, in several cases, their expensive trainin... 详细信息
来源: 评论
Lacunarity Pooling Layers for Plant Image Classification using Texture Analysis
Lacunarity Pooling Layers for Plant Image Classification usi...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Mohan, Akshatha Peeples, Joshua Texas A&M Univ Dept Elect & Comp Engn College Stn TX 77840 USA
Pooling layers (e.g., max and average) may overlook important information encoded in the spatial arrangement of pixel intensity and/or feature values. We propose a novel lacunarity pooling layer that aims to capture t... 详细信息
来源: 评论
IrrNet: Spatio-Temporal Segmentation guided Classification for Irrigation Mapping
IrrNet: Spatio-Temporal Segmentation guided Classification f...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Hoque, Oishee Bintey Univ Virginia Dept Comp Sci Charlottesville VA 22903 USA
Irrigation systems can vary widely in scale, from smallscale subsistence farming to large commercial agriculture (see Fig. 1 ). The heterogeneity in irrigation practices and systems across different regions adds to th... 详细信息
来源: 评论
VCoder: Versatile vision Encoders for Multimodal Large Language Models
VCoder: Versatile Vision Encoders for Multimodal Large Langu...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Jain, Jitesh Yang, Jianwei Shi, Humphrey Georgia Tech SHI Labs Atlanta GA 30332 USA Microsoft Res Redmond WA USA Picsart AI Res PAIR Atlanta GA USA
Humans possess the remarkable skill of Visual Perception, the ability to see and understand the seen, helping them make sense of the visual world and, in turn, reason. Multimodal Large Language Models (MLLM) have rece... 详细信息
来源: 评论
MAPLM: A Real-World Large-Scale vision-Language Benchmark for Map and Traffic Scene Understanding
MAPLM: A Real-World Large-Scale Vision-Language Benchmark fo...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Cao, Xu Zhou, Tong Ma, Yunsheng Ye, Wenqian Cui, Can Tang, Kun Cao, Zhipeng Liang, Kaizhao Wang, Ziran Rehg, James M. Zheng, Chao Tencent T Lab Palo Alto CA 94306 USA Univ Illinois Champaign IL USA Purdue Univ W Lafayette IN USA Univ Virginia Charlottesville VA USA SambaNova Syst Inc Palo Alto CA USA
vision-language generative AI has demonstrated remarkable promise for empowering cross-modal scene understanding of autonomous driving and high-definition (HD) map systems. However, current benchmark datasets lack mul... 详细信息
来源: 评论
TIM: A Time Interval Machine for Audio-Visual Action recognition
TIM: A Time Interval Machine for Audio-Visual Action Recogni...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Chalk, Jacob Huh, Jaesung Kazakos, Evangelos Zisserman, Andrew Damen, Dima Univ Bristol Bristol Avon England Univ Oxford VGG Oxford England Czech Tech Univ Prague Czech Republic
Diverse actions give rise to rich audio-visual signals in long videos. Recent works showcase that the two modalities of audio and video exhibit different temporal extents of events and distinct labels. We address the ... 详细信息
来源: 评论
DRESS: Instructing Large vision-Language Models to Align and Interact with Humans via Natural Language Feedback
DRESS: Instructing Large Vision-Language Models to Align and...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Chen, Yangyi Sikka, Karan Cogswell, Michael Ji, Heng Divakaran, Ajay SRI Int Menlo Pk CA 94025 USA Univ Illinois Champaign IL 61820 USA
We present DRESS, a large vision language model (LVLM) that innovatively exploits Natural Language feedback (NLF) from Large Language Models to enhance its alignment and interactions by addressing two key limitations ... 详细信息
来源: 评论
Prompting vision Foundation Models for Pathology Image Analysis
Prompting Vision Foundation Models for Pathology Image Analy...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Yin, Chong Liu, Siqi Zhou, Kaiyang Wong, Vincent Wai-Sun Yuen, Pong C. Hong Kong Baptist Univ Dept Comp Sci Hong Kong Peoples R China Chinese Univ Hong Kong Shenzhen Res Inst Big Data Shenzhen Peoples R China Chinese Univ Hong Kong Dept Med & Therapeut Hong Kong Peoples R China
The rapid increase in cases of non-alcoholic fatty liver disease (NAFLD) in recent years has raised significant public concern. Accurately identifying tissue alteration regions is crucial for the diagnosis of NAFLD, b... 详细信息
来源: 评论
MaskCLR: Attention-Guided Contrastive Learning for Robust Action Representation Learning
MaskCLR: Attention-Guided Contrastive Learning for Robust Ac...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Abdelfattah, Mohamed Hassan, Mariam Alahi, Alexandre Ecole Polytech Fed Lausanne EPFL Lausanne Switzerland
Current transformer-based skeletal action recognition models tend to focus on a limited set of joints and low-level motion patterns to predict action classes. This results in significant performance degradation under ... 详细信息
来源: 评论