咨询与建议

限定检索结果

文献类型

  • 20,860 篇 会议
  • 105 篇 期刊文献
  • 43 册 图书

馆藏范围

  • 21,007 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 13,620 篇 工学
    • 11,056 篇 计算机科学与技术...
    • 2,652 篇 机械工程
    • 2,252 篇 软件工程
    • 914 篇 光学工程
    • 885 篇 电气工程
    • 529 篇 控制科学与工程
    • 477 篇 信息与通信工程
    • 216 篇 测绘科学与技术
    • 135 篇 生物工程
    • 127 篇 生物医学工程(可授...
    • 98 篇 电子科学与技术(可...
    • 92 篇 仪器科学与技术
    • 46 篇 安全科学与工程
    • 40 篇 建筑学
    • 40 篇 化学工程与技术
    • 39 篇 土木工程
    • 37 篇 交通运输工程
    • 35 篇 力学(可授工学、理...
    • 33 篇 航空宇航科学与技...
  • 3,494 篇 医学
    • 3,489 篇 临床医学
    • 32 篇 基础医学(可授医学...
  • 2,247 篇 理学
    • 1,145 篇 物理学
    • 1,081 篇 数学
    • 401 篇 生物学
    • 384 篇 统计学(可授理学、...
    • 245 篇 系统科学
    • 46 篇 化学
  • 343 篇 管理学
    • 176 篇 管理科学与工程(可...
    • 168 篇 图书情报与档案管...
    • 34 篇 工商管理
  • 31 篇 法学
  • 19 篇 农学
  • 15 篇 教育学
  • 8 篇 经济学
  • 5 篇 艺术学
  • 2 篇 军事学
  • 1 篇 文学

主题

  • 8,141 篇 computer vision
  • 2,886 篇 training
  • 2,841 篇 pattern recognit...
  • 1,809 篇 computational mo...
  • 1,715 篇 visualization
  • 1,493 篇 cameras
  • 1,433 篇 three-dimensiona...
  • 1,433 篇 feature extracti...
  • 1,366 篇 shape
  • 1,360 篇 face recognition
  • 1,243 篇 image segmentati...
  • 1,135 篇 robustness
  • 1,124 篇 semantics
  • 992 篇 computer archite...
  • 985 篇 object detection
  • 982 篇 layout
  • 959 篇 benchmark testin...
  • 935 篇 codes
  • 900 篇 computer science
  • 898 篇 object recogniti...

机构

  • 174 篇 univ sci & techn...
  • 158 篇 univ chinese aca...
  • 153 篇 carnegie mellon ...
  • 145 篇 chinese univ hon...
  • 109 篇 microsoft resear...
  • 103 篇 zhejiang univ pe...
  • 99 篇 swiss fed inst t...
  • 95 篇 tsinghua univers...
  • 90 篇 microsoft res as...
  • 90 篇 tsinghua univ pe...
  • 88 篇 shanghai ai lab ...
  • 81 篇 zhejiang univers...
  • 77 篇 alibaba grp peop...
  • 74 篇 hong kong univ s...
  • 73 篇 university of sc...
  • 72 篇 peking univ peop...
  • 72 篇 university of ch...
  • 68 篇 shanghai jiao to...
  • 66 篇 univ oxford oxfo...
  • 65 篇 google res mount...

作者

  • 80 篇 van gool luc
  • 70 篇 zhang lei
  • 58 篇 timofte radu
  • 48 篇 yang yi
  • 47 篇 luc van gool
  • 46 篇 xiaoou tang
  • 44 篇 tian qi
  • 43 篇 darrell trevor
  • 42 篇 loy chen change
  • 42 篇 sun jian
  • 41 篇 qi tian
  • 40 篇 li stan z.
  • 38 篇 li fei-fei
  • 37 篇 chen xilin
  • 36 篇 shan shiguang
  • 35 篇 zhou jie
  • 35 篇 vasconcelos nuno
  • 35 篇 liu yang
  • 35 篇 torralba antonio
  • 34 篇 liu xiaoming

语言

  • 20,982 篇 英文
  • 10 篇 中文
  • 7 篇 其他
  • 5 篇 土耳其文
  • 2 篇 日文
  • 2 篇 葡萄牙文
检索条件"任意字段=2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016"
21008 条 记 录,以下是461-470 订阅
排序:
PromptSync: Bridging Domain Gaps in vision-Language Models through Class-Aware Prototype Alignment and Discrimination
PromptSync: Bridging Domain Gaps in Vision-Language Models t...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Khandelwal, Anant Glance AI Bangalore Karnataka India
The potential for zero-shot generalization in vision-language (V-L) models such as CLIP has spurred their widespread adoption in addressing numerous downstream tasks. Previous methods have employed test-time prompt tu... 详细信息
来源: 评论
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
Panda-70M: Captioning 70M Videos with Multiple Cross-Modalit...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Chen, Tsai-Shien Siarohin, Aliaksandr Menapace, Willi Deyneka, Ekaterina Chao, Hsiang-wei Jeon, Byung Eun Fang, Yuwei Lee, Hsin-Ying Ren, Jian Yang, Ming-Hsuan Tulyakov, Sergey Snap Inc Santa Monica CA 90405 USA Univ Calif Merced Merced CA 95343 USA Univ Trento Trento Italy Snap Santa Monica CA USA
The quality of the data and annotation upper-bounds the quality of a downstream model. While there exist large text corpora and image-text pairs, high-quality video-text data is much harder to collect. First of all, m... 详细信息
来源: 评论
Multi-criteria Token Fusion with One-step-ahead Attention for Efficient vision Transformers
Multi-criteria Token Fusion with One-step-ahead Attention fo...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Lee, Sanghyeok Choi, Joonmyung Kim, Hyunwoo J. Korea Univ Dept Comp Sci & Engn Seoul South Korea
vision Transformer (ViT) has emerged as a prominent backbone for computer vision. For more efficient ViTs, recent works lessen the quadratic cost of the self- attention layer by pruning or fusing the redundant tokens.... 详细信息
来源: 评论
Doubly Right Object recognition: A Why Prompt for Visual Rationales
Doubly Right Object Recognition: A Why Prompt for Visual Rat...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Mao, Chengzhi Teotial, Revant Sundari, Amrutha Menon, Sachit Yang, Junfeng Wang, Xin Vondrick, Carl Columbia Univ New York NY 10027 USA Microsoft Res Redmond WA USA
Many visual recognition models are evaluated only on their classification accuracy, a metric for which they obtain strong performance. In this paper, we investigate whether computer vision models can also provide corr... 详细信息
来源: 评论
SIFU: Side-view Conditioned Implicit Function for Real-world Usable Clothed Human Reconstruction
SIFU: Side-view Conditioned Implicit Function for Real-world...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Zhang, Zechuan Yang, Zongxin Yang, Yi Zhejiang Univ CCAL ReLER Hangzhou Peoples R China
Creating high-quality 3D models of clothed humans from single images for real-world applications is crucial. Despite recent advancements, accurately reconstructing humans in complex poses or with loose clothing from i... 详细信息
来源: 评论
Making Visual Sense of Oracle Bones for You and Me
Making Visual Sense of Oracle Bones for You and Me
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Qiao, Runqi Yang, Lan Pang, Kaiyue Zhang, Honggang Beijing Univ Posts & Telecommun Sch Artificial Intelligence Beijing Peoples R China Univ Surrey CVSSP SketchX Guildford Surrey England
Visual perception evolves over time. This is particularly the case of oracle bone scripts, where visual glyphs seem intuitive to people from distant past prove difficult to be understood in contemporary eyes. While se... 详细信息
来源: 评论
What, when, and where? Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions
What, when, and where? Self-Supervised Spatio-Temporal Groun...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Chen, Brian Shvetsova, Nina Rouditchenko, Andrew Kondermann, Daniel Thomas, Samuel Chang, Shih-Fu Feris, Rogerio Glass, James Kuehne, Hilde Columbia Univ New York NY 10027 USA Goethe Univ Frankfurt Frankfurt Germany Univ Bonn Bonn Germany MIT CSAIL Cambridge MA USA Qual Match GmbH Heidelberg Germany IBM Res AI Yorktown Hts NY USA MIT IBM Watson Lab Cambridge MA USA
Spatio-temporal grounding describes the task of localizing events in space and time, e.g., in video data, based on verbal descriptions only. Models for this task are usually trained with human-annotated sentences and ... 详细信息
来源: 评论
Single-Model and Any-Modality for Video Object Tracking
Single-Model and Any-Modality for Video Object Tracking
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Wu, Zongwei Zheng, Jilai Ren, Xiangxuan Vasluianu, Florin-Alexandru Ma, Chao Paudel, Danda Pani Luc Van Gool Timofte, Radu Univ Wurzburg Comp Vision Lab CAIDAS & IFI Wurzburg Germany Shanghai Jiao Tong Univ AI Inst Shanghai Peoples R China Sofia Univ INSAIT Sofia Bulgaria Swiss Fed Inst Technol CVL Zurich Switzerland
In the realm of video object tracking, auxiliary modalities such as depth, thermal, or event data have emerged as valuable assets to complement the RGB trackers. In practice, most existing RGB trackers learn a single ...
来源: 评论
Behavioral Analysis of vision-and-Language Navigation Agents
Behavioral Analysis of Vision-and-Language Navigation Agents
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Yang, Zijiao Majumdar, Arjun Lee, Stefan Oregon State Univ Corvallis OR 97331 USA Georgia Inst Technol Atlanta GA USA
To be successful, vision-and-Language Navigation (VLN) agents must be able to ground instructions to actions based on their surroundings. In this work, we develop a methodology to study agent behavior on a skill-speci... 详细信息
来源: 评论
Deep Video Codec Control for vision Models
Deep Video Codec Control for Vision Models
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Reich, Christoph Debnath, Biplob Patel, Deep Prangemeier, Tim Cremers, Daniel Chakradhar, Srimat NEC Labs Amer Inc San Jose CA 95110 USA Tech Univ Munich Munich Germany Tech Univ Darmstadt Darmstadt Germany Munich Ctr Machine Learning MCML Munich Germany
Standardized lossy video coding is at the core of almost all real-world video processing pipelines. Rate control is used to enable standard codecs to adapt to different network bandwidth conditions or storage constrai... 详细信息
来源: 评论