咨询与建议

限定检索结果

文献类型

  • 12,844 篇 会议
  • 13 篇 期刊文献
  • 2 册 图书

馆藏范围

  • 12,859 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 7,573 篇 工学
    • 6,863 篇 计算机科学与技术...
    • 880 篇 机械工程
    • 814 篇 软件工程
    • 435 篇 控制科学与工程
    • 360 篇 光学工程
    • 306 篇 电气工程
    • 209 篇 仪器科学与技术
    • 124 篇 信息与通信工程
    • 91 篇 生物工程
    • 62 篇 生物医学工程(可授...
    • 39 篇 电子科学与技术(可...
    • 34 篇 安全科学与工程
    • 26 篇 化学工程与技术
    • 21 篇 交通运输工程
    • 20 篇 建筑学
    • 18 篇 土木工程
  • 2,957 篇 医学
    • 2,956 篇 临床医学
    • 15 篇 基础医学(可授医学...
    • 12 篇 药学(可授医学、理...
  • 700 篇 理学
    • 359 篇 物理学
    • 225 篇 数学
    • 175 篇 系统科学
    • 95 篇 统计学(可授理学、...
    • 93 篇 生物学
    • 22 篇 化学
  • 201 篇 艺术学
    • 201 篇 设计学(可授艺术学...
  • 84 篇 管理学
    • 59 篇 图书情报与档案管...
    • 25 篇 管理科学与工程(可...
    • 14 篇 工商管理
  • 23 篇 法学
    • 21 篇 社会学
  • 5 篇 农学
  • 4 篇 教育学
  • 2 篇 经济学
  • 1 篇 军事学

主题

  • 6,464 篇 computer vision
  • 2,688 篇 training
  • 2,437 篇 pattern recognit...
  • 1,780 篇 computational mo...
  • 1,522 篇 visualization
  • 1,348 篇 three-dimensiona...
  • 1,091 篇 computer archite...
  • 1,063 篇 semantics
  • 997 篇 benchmark testin...
  • 976 篇 codes
  • 970 篇 conferences
  • 854 篇 feature extracti...
  • 830 篇 cameras
  • 771 篇 task analysis
  • 707 篇 deep learning
  • 646 篇 image segmentati...
  • 611 篇 object detection
  • 595 篇 shape
  • 554 篇 transformers
  • 538 篇 neural networks

机构

  • 132 篇 univ sci & techn...
  • 122 篇 carnegie mellon ...
  • 120 篇 tsinghua univ pe...
  • 114 篇 univ chinese aca...
  • 113 篇 chinese univ hon...
  • 94 篇 tsinghua univers...
  • 91 篇 zhejiang univ pe...
  • 91 篇 swiss fed inst t...
  • 85 篇 peng cheng lab p...
  • 81 篇 university of ch...
  • 80 篇 zhejiang univers...
  • 77 篇 shanghai ai lab ...
  • 77 篇 peng cheng labor...
  • 75 篇 university of sc...
  • 69 篇 shanghai jiao to...
  • 68 篇 shanghai jiao to...
  • 67 篇 alibaba grp peop...
  • 67 篇 stanford univ st...
  • 66 篇 univ hong kong p...
  • 64 篇 sensetime res pe...

作者

  • 77 篇 timofte radu
  • 63 篇 van gool luc
  • 45 篇 zhang lei
  • 36 篇 yang yi
  • 36 篇 luc van gool
  • 34 篇 tao dacheng
  • 31 篇 loy chen change
  • 29 篇 chen chen
  • 28 篇 sun jian
  • 28 篇 qi tian
  • 25 篇 li xin
  • 24 篇 liu yang
  • 24 篇 tian qi
  • 24 篇 ying shan
  • 23 篇 wang xinchao
  • 23 篇 zha zheng-jun
  • 23 篇 boxin shi
  • 21 篇 zhou jie
  • 21 篇 vasconcelos nuno
  • 20 篇 luo ping

语言

  • 12,849 篇 英文
  • 9 篇 其他
  • 1 篇 中文
检索条件"任意字段=IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops"
12859 条 记 录,以下是401-410 订阅
排序:
CityDreamer: Compositional Generative Model of Unbounded 3D Cities
CityDreamer: Compositional Generative Model of Unbounded 3D ...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Xie, Haozhe Chen, Zhaoxi Hong, Fangzhou Liu, Ziwei Nanyang Technol Univ S Lab Singapore Singapore
3D city generation is a desirable yet challenging task, since humans are more sensitive to structural distortions in urban environments. Additionally, generating 3D cities is more complex than 3D natural scenes since ... 详细信息
来源: 评论
Language-driven Grasp Detection
Language-driven Grasp Detection
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: An Dinh Vuong Minh Nhat Vu Baoru Huang Nghia Nguyen Hieu Le Thieu Vo Anh Nguyen FPT Software AI Ctr Hanoi Vietnam TU Wien Automat Control Inst Vienna Austria Imperial Coll London London England Ton Duc Thang Univ Ho Chi Minh City Vietnam Univ Liverpool Liverpool Merseyside England
Grasp detection is a persistent and intricate challenge with various industrial applications. Recently, many methods and datasets have been proposed to tackle the grasp detection problem. However, most of them do not ... 详细信息
来源: 评论
DVMNet: Computing Relative Pose for Unseen Objects Beyond Hypotheses
DVMNet: Computing Relative Pose for Unseen Objects Beyond Hy...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Zhao, Chen Zhang, Tong Dang, Zheng Salzmann, Mathieu Ecole Polytech Fed Lausanne Lausanne Switzerland ClearSpace SA Renens Switzerland
Determining the relative pose of an object between two images is pivotal to the success of generalizable object pose estimation. Existing approaches typically approximate the continuous pose representation with a larg... 详细信息
来源: 评论
A case for using rotation invariant features in state of the art feature matchers
A case for using rotation invariant features in state of the...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Bokman, Georg Kahl, Fredrik Chalmers Univ Technol Gothenburg Sweden
The aim of this paper is to demonstrate that a state of the art feature matcher (LoFTR) can be made more robust to rotations by simply replacing the backbone CNN with a steerable CNN which is equivariant to translatio... 详细信息
来源: 评论
Adapting Short-Term Transformers for Action Detection in Untrimmed Videos
Adapting Short-Term Transformers for Action Detection in Unt...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Yang, Min Gao, Huan Guo, Ping Wang, Limin Nanjing Univ State Key Lab Novel Software Technol Nanjing Peoples R China Inchitech Beijing Peoples R China Intel Labs China Hillsboro OR USA Shanghai AI Lab Shanghai Peoples R China
vision Transformer (ViT) has shown high potential in video recognition, owing to its flexible design, adaptable self-attention mechanisms, and the efficacy of masked pretraining. Yet, it remains unclear how to adapt t... 详细信息
来源: 评论
Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf vision-Language Models
Emergent Open-Vocabulary Semantic Segmentation from Off-the-...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Luo, Jiayun Khandelwal, Siddhesh Sigal, Leonid Li, Boyang Nanyang Technol Univ Singapore Singapore Univ British Columbia Vector Inst AI Vancouver BC Canada
From image-text pairs, large-scale vision-language models (VLMs) learn to implicitly associate image regions with words, which prove effective for tasks like visual question answering. However, leveraging the learned ... 详细信息
来源: 评论
PracticalDG: Perturbation Distillation on vision-Language Models for Hybrid Domain Generalization
PracticalDG: Perturbation Distillation on Vision-Language Mo...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Chen, Zining Wang, Weiqiu Zhao, Zhicheng Su, Fei Men, Aidong Meng, Hongying Beijing Univ Posts & Telecommun Sch Artificial Intelligence Beijing Peoples R China Beijing Key Lab Network Syst & Network Culture Beijing Peoples R China Minist Culture & Tourism Key Lab Interact Technol & Experience Syst Beijing Peoples R China Brunel Univ Uxbridge Uxbridge Middx England
Domain Generalization (DG) aims to resolve distribution shifts between source and target domains, and current DG methods are default to the setting that data from source and target domains share identical categories. ... 详细信息
来源: 评论
Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs
Eyes Wide Shut? Exploring the Visual Shortcomings of Multimo...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Tong, Shengbang Liu, Zhuang Zhai, Yuexiang Ma, Yi Lecun, Yann Xie, Saining NYU New York NY 10003 USA Meta FAIR Menlo Pk CA 94025 USA Univ Calif Berkeley Berkeley CA USA
Is vision good enough for language? Recent advancements in multimodal models primarily stem from the powerful reasoning abilities of large language models (LLMs). However, the visual component typically depends only o... 详细信息
来源: 评论
Explaining CLIP's performance disparities on data from blind/low vision users
Explaining CLIP's performance disparities on data from blind...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Massiceti, Daniela Longden, Camilla Slowik, Agnieszka Wills, Samuel Grayson, Martin Morrison, Cecily Microsoft Res Redmond WA 98052 USA World Bank 1818 H St NW Washington DC 20433 USA
Large multi-modal models (LMMs) hold the potential to usher in a new era of automated visual assistance for people who are blind or low vision (BLV). Yet, these models have not been systematically evaluated on data ca... 详细信息
来源: 评论
EgoThink: Evaluating First-Person Perspective Thinking Capability of vision-Language Models
EgoThink: Evaluating First-Person Perspective Thinking Capab...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Cheng, Sijie Guo, Zhicheng Wu, Jingwen Fang, Kechen Li, Peng Liu, Huaping Liu, Yang Tsinghua Univ Dept Comp Sci & Technol Beijing Peoples R China Tsinghua Univ Inst AI Ind Res AIR Beijing Peoples R China Univ Toronto Dept Elect & Comp Engn Toronto ON Canada Tsinghua Univ Zhili Coll Beijing Peoples R China 01 Ai Beijing Peoples R China
vision-language models (VLMs) have recently shown promising results in traditional downstream tasks. Evaluation studies have emerged to assess their abilities, with the majority focusing on the third-person perspectiv... 详细信息
来源: 评论