咨询与建议

限定检索结果

文献类型

  • 50,479 篇 会议
  • 1,421 册 图书
  • 1,041 篇 期刊文献
  • 1 篇 学位论文

馆藏范围

  • 52,940 篇 电子文献
  • 4 种 纸本馆藏

日期分布

学科分类号

  • 31,811 篇 工学
    • 24,804 篇 计算机科学与技术...
    • 12,568 篇 软件工程
    • 5,153 篇 光学工程
    • 4,756 篇 电气工程
    • 4,436 篇 信息与通信工程
    • 4,257 篇 机械工程
    • 3,956 篇 控制科学与工程
    • 2,474 篇 生物工程
    • 1,728 篇 生物医学工程(可授...
    • 1,584 篇 仪器科学与技术
    • 1,317 篇 电子科学与技术(可...
    • 793 篇 化学工程与技术
    • 698 篇 安全科学与工程
    • 542 篇 交通运输工程
    • 379 篇 建筑学
    • 331 篇 土木工程
  • 11,839 篇 理学
    • 6,434 篇 物理学
    • 5,405 篇 数学
    • 2,761 篇 生物学
    • 1,910 篇 统计学(可授理学、...
    • 801 篇 化学
    • 669 篇 系统科学
  • 5,305 篇 医学
    • 5,094 篇 临床医学
    • 729 篇 基础医学(可授医学...
    • 459 篇 药学(可授医学、理...
  • 3,350 篇 管理学
    • 1,953 篇 图书情报与档案管...
    • 1,535 篇 管理科学与工程(可...
    • 479 篇 工商管理
  • 720 篇 艺术学
    • 718 篇 设计学(可授艺术学...
  • 428 篇 法学
    • 401 篇 社会学
  • 297 篇 农学
  • 197 篇 教育学
  • 163 篇 经济学
  • 63 篇 文学
  • 49 篇 军事学

主题

  • 17,385 篇 computer vision
  • 9,017 篇 pattern recognit...
  • 4,196 篇 training
  • 3,815 篇 feature extracti...
  • 3,134 篇 cameras
  • 2,870 篇 computational mo...
  • 2,789 篇 image segmentati...
  • 2,622 篇 visualization
  • 2,573 篇 shape
  • 2,533 篇 face recognition
  • 2,171 篇 robustness
  • 2,123 篇 computer science
  • 1,973 篇 object detection
  • 1,959 篇 computer archite...
  • 1,878 篇 layout
  • 1,853 篇 object recogniti...
  • 1,802 篇 three-dimensiona...
  • 1,725 篇 neural networks
  • 1,708 篇 humans
  • 1,691 篇 image recognitio...

机构

  • 165 篇 univ chinese aca...
  • 144 篇 tsinghua univers...
  • 136 篇 national laborat...
  • 108 篇 univ sci & techn...
  • 104 篇 zhejiang univers...
  • 100 篇 shanghai jiao to...
  • 95 篇 microsoft resear...
  • 94 篇 university of sc...
  • 86 篇 zhejiang univ pe...
  • 84 篇 shanghai ai lab ...
  • 74 篇 school of comput...
  • 69 篇 computer vision ...
  • 68 篇 peking univ peop...
  • 68 篇 chinese acad sci...
  • 65 篇 chinese univ hon...
  • 63 篇 institute of inf...
  • 62 篇 google res mount...
  • 61 篇 univ oxford oxfo...
  • 59 篇 univ toronto on
  • 57 篇 swiss fed inst t...

作者

  • 91 篇 van gool luc
  • 87 篇 umapada pal
  • 76 篇 zhang lei
  • 64 篇 lee seong-whan
  • 49 篇 vittorio murino
  • 42 篇 yang yi
  • 34 篇 nassir navab
  • 33 篇 li xin
  • 33 篇 jie yang
  • 32 篇 liu yang
  • 31 篇 escalera sergio
  • 31 篇 loy chen change
  • 30 篇 ling haibin
  • 30 篇 h. bischof
  • 29 篇 zhou jie
  • 29 篇 vasconcelos nuno
  • 29 篇 jan-michael frah...
  • 29 篇 hanqing lu
  • 28 篇 blumenstein mich...
  • 27 篇 jia yunde

语言

  • 51,871 篇 英文
  • 835 篇 其他
  • 241 篇 中文
  • 22 篇 土耳其文
  • 5 篇 西班牙文
  • 2 篇 日文
  • 2 篇 葡萄牙文
  • 2 篇 俄文
检索条件"任意字段=IEEE Conference on Computer Vision and Pattern Recognition"
52943 条 记 录,以下是71-80 订阅
排序:
One-Shot Open Affordance Learning with Foundation Models
One-Shot Open Affordance Learning with Foundation Models
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Li, Gen Sun, Deqing Sevilla-Lara, Laura Jampani, Varun Univ Edinburgh Edinburgh Midlothian Scotland Google Res Mountain View CA USA Stabil AI London England
We introduce One-shot Open Affordance Learning (OOAL), where a model is trained with just one example per base object category, but is expected to identify novel objects and affordances. While vision-language models e... 详细信息
来源: 评论
Deep Video Codec Control for vision Models
Deep Video Codec Control for Vision Models
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Reich, Christoph Debnath, Biplob Patel, Deep Prangemeier, Tim Cremers, Daniel Chakradhar, Srimat NEC Labs Amer Inc San Jose CA 95110 USA Tech Univ Munich Munich Germany Tech Univ Darmstadt Darmstadt Germany Munich Ctr Machine Learning MCML Munich Germany
Standardized lossy video coding is at the core of almost all real-world video processing pipelines. Rate control is used to enable standard codecs to adapt to different network bandwidth conditions or storage constrai... 详细信息
来源: 评论
Label Propagation for Zero-shot Classification with vision-Language Models
Label Propagation for Zero-shot Classification with Vision-L...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Stojnic, Vladan Kalantidis, Yannis Tolias, Giorgos Czech Tech Univ FEE VRG Prague Czech Republic NAVER LABS Europe Meylan France
vision-Language Models (VLMs) have demonstrated impressive performance on zero-shot classification, i.e. classification when provided merely with a list of class names. In this paper, we tackle the case of zero-shot c... 详细信息
来源: 评论
PromptSync: Bridging Domain Gaps in vision-Language Models through Class-Aware Prototype Alignment and Discrimination
PromptSync: Bridging Domain Gaps in Vision-Language Models t...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Khandelwal, Anant Glance AI Bangalore Karnataka India
The potential for zero-shot generalization in vision-language (V-L) models such as CLIP has spurred their widespread adoption in addressing numerous downstream tasks. Previous methods have employed test-time prompt tu... 详细信息
来源: 评论
Blur2Blur: Blur Conversion for Unsupervised Image Deblurring on Unknown Domains
Blur2Blur: Blur Conversion for Unsupervised Image Deblurring...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Bang-Dang Pham Phong Tran Anh Tran Cuong Pham Rang Nguyen Minh Hoai VinAI Res Hanoi Vietnam MBZUAI Abu Dhabi U Arab Emirates Posts & Telecommun Inst Tech Hanoi Vietnam Univ Adelaide Adelaide SA Australia
This paper presents an innovative framework designed to train an image deblurring algorithm tailored to a specific camera device. This algorithm works by transforming a blurry input image, which is challenging to debl... 详细信息
来源: 评论
PixelRNN: In-pixel Recurrent Neural Networks for End-to-end-optimized Perception with Neural Sensors
PixelRNN: In-pixel Recurrent Neural Networks for End-to-end-...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: So, Haley M. Bose, Laurie Dudek, Piotr Wetzstein, Gordon Stanford Univ Stanford CA 94305 USA Univ Manchester Manchester Lancs England
Conventional image sensors digitize high-resolution images at fast frame rates, producing a large amount of data that needs to be transmitted off the sensor for further processing. This is challenging for perception s... 详细信息
来源: 评论
Learning Optimized Low-Light Image Enhancement for Edge vision Tasks
Learning Optimized Low-Light Image Enhancement for Edge Visi...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Sharif, S. M. A. Myrzabekov, Azamat Khujaev, Nodirkhuja Tsoy, Roman Kim, Seongwan Lee, Jaeho LG Sciencepk Seoul South Korea
Low-light image enhancement (LLIE) has a significant role in edge vision applications (EVA). Despite its widespread practicability, the existing LLIE methods are impractical due to their high computational costs. This... 详细信息
来源: 评论
Frozen Feature Augmentation for Few-Shot Image Classification
Frozen Feature Augmentation for Few-Shot Image Classificatio...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Bar, Andreas Houlsby, Neil Dehghani, Mostafa Kumar, Manoj Google DeepMind London England Tech Univ Carolo Wilhelmina Braunschweig Braunschweig Germany
Training a linear classifier or lightweight model on top of pretrained vision model outputs, so-called 'frozen features', leads to impressive performance on a number of downstream few-shot tasks. Currently, fr... 详细信息
来源: 评论
Question Aware vision Transformer for Multimodal Reasoning
Question Aware Vision Transformer for Multimodal Reasoning
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Ganz, Roy Kittenplont, Yair Aberdam, Aviad Ben Avraham, Elad Nuriel, Oren Mazor, Shai Litmant, Ron Technion Haifa Israel AWS AI Labs Seattle WA 98019 USA
vision-Language (VL) models have gained significant research focus, enabling remarkable advances in multimodal reasoning. These architectures typically comprise a vision encoder, a Large Language Model (LLM), and a pr...
来源: 评论
A vision Check-up for Language Models
A Vision Check-up for Language Models
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Sharma, Pratyusha Shaham, Tamar Rott Baradad, Manel Fu, Stephanie Rodriguez-Munoz, Adrian Duggal, Shivam Isola, Phillip Torralba, Antonio MIT CSAIL Cambridge MA 02139 USA Univ Calif Berkeley Berkeley CA USA
What does learning to model relationships between strings teach Large Language Models (LLMs) about the visual world? We systematically evaluate LLMs' abilities to generate and recognize an assortment of visual con... 详细信息
来源: 评论