咨询与建议

限定检索结果

文献类型

  • 11,883 篇 会议
  • 5 篇 期刊文献

馆藏范围

  • 11,888 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 8,055 篇 工学
    • 7,613 篇 计算机科学与技术...
    • 796 篇 机械工程
    • 688 篇 电气工程
    • 356 篇 软件工程
    • 225 篇 控制科学与工程
    • 40 篇 光学工程
    • 19 篇 生物工程
    • 17 篇 信息与通信工程
    • 12 篇 生物医学工程(可授...
    • 6 篇 电子科学与技术(可...
    • 6 篇 建筑学
    • 6 篇 交通运输工程
    • 5 篇 仪器科学与技术
    • 5 篇 化学工程与技术
    • 5 篇 安全科学与工程
    • 4 篇 土木工程
  • 3,344 篇 医学
    • 3,343 篇 临床医学
    • 4 篇 基础医学(可授医学...
    • 4 篇 公共卫生与预防医...
  • 250 篇 理学
    • 198 篇 系统科学
    • 29 篇 物理学
    • 21 篇 生物学
    • 15 篇 数学
    • 9 篇 统计学(可授理学、...
    • 4 篇 化学
  • 17 篇 管理学
    • 12 篇 管理科学与工程(可...
    • 7 篇 图书情报与档案管...
    • 5 篇 工商管理
  • 3 篇 法学
    • 3 篇 社会学
  • 3 篇 教育学
    • 3 篇 教育学
  • 2 篇 农学
  • 1 篇 经济学
  • 1 篇 军事学

主题

  • 5,632 篇 computer vision
  • 2,668 篇 training
  • 2,203 篇 pattern recognit...
  • 1,746 篇 computational mo...
  • 1,502 篇 visualization
  • 1,360 篇 three-dimensiona...
  • 1,074 篇 semantics
  • 999 篇 benchmark testin...
  • 986 篇 codes
  • 959 篇 computer archite...
  • 891 篇 deep learning
  • 777 篇 conferences
  • 754 篇 task analysis
  • 699 篇 feature extracti...
  • 561 篇 transformers
  • 533 篇 face recognition
  • 527 篇 neural networks
  • 495 篇 object detection
  • 490 篇 image segmentati...
  • 468 篇 cameras

机构

  • 174 篇 univ sci & techn...
  • 145 篇 carnegie mellon ...
  • 144 篇 univ chinese aca...
  • 144 篇 tsinghua univ pe...
  • 134 篇 chinese univ hon...
  • 110 篇 zhejiang univ pe...
  • 109 篇 peng cheng lab p...
  • 99 篇 swiss fed inst t...
  • 91 篇 tsinghua univers...
  • 90 篇 shanghai ai lab ...
  • 87 篇 sensetime res pe...
  • 86 篇 shanghai jiao to...
  • 83 篇 zhejiang univers...
  • 82 篇 tech univ munich...
  • 79 篇 university of sc...
  • 79 篇 stanford univ st...
  • 78 篇 univ hong kong p...
  • 77 篇 australian natl ...
  • 76 篇 alibaba grp peop...
  • 75 篇 peng cheng labor...

作者

  • 75 篇 timofte radu
  • 64 篇 van gool luc
  • 50 篇 zhang lei
  • 43 篇 yang yi
  • 37 篇 loy chen change
  • 36 篇 tao dacheng
  • 32 篇 zhou jie
  • 31 篇 chen chen
  • 30 篇 liu yang
  • 30 篇 tian qi
  • 29 篇 sun jian
  • 29 篇 zha zheng-jun
  • 28 篇 li xin
  • 27 篇 qi tian
  • 26 篇 vasconcelos nuno
  • 25 篇 liu xiaoming
  • 25 篇 darrell trevor
  • 24 篇 zheng wei-shi
  • 24 篇 luo ping
  • 24 篇 ying shan

语言

  • 11,862 篇 英文
  • 25 篇 其他
  • 1 篇 中文
检索条件"任意字段=2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024"
11888 条 记 录,以下是71-80 订阅
排序:
Discriminative pattern Calibration Mechanism for Source-Free Domain Adaptation
Discriminative Pattern Calibration Mechanism for Source-Free...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Xia, Haifeng Xia, Siyu Ding, Zhengming Southeast Univ Sch Automat Dhaka Bangladesh Tulane Univ Dept Comp Sci New Orleans LA 70118 USA
Source-free domain adaptation (SFDA) assumes that model adaptation only accesses the well-learned source model and unlabeled target instances for knowledge transfer. However, cross-domain distribution shift easily tri...
来源: 评论
ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain vision Transformers
ALGM: Adaptive Local-then-Global Token Merging for Efficient...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Norouzi, Narges Orlova, Svetlana de Geus, Daan Dubbelman, Gijs Eindhoven Univ Technol Eindhoven Netherlands
This work presents Adaptive Local-then-Global Merging (ALGM), a token reduction method for semantic segmentation networks that use plain vision Transformers. ALGM merges tokens in two stages: (1) In the first network ... 详细信息
来源: 评论
Enhancing vision-Language Pre-training with Rich Supervisions
Enhancing Vision-Language Pre-training with Rich Supervision...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Gao, Yuan Shi, Kunyu Zhu, Pengkai Belval, Edouard Nuriel, Oren Appalaraju, Srikar Ghadar, Shabnam Tu, Zhuowen Mahadevan, Vijay Soatto, Stefano Stanford Univ Stanford CA 94305 USA AWS AI Labs Seattle WA USA Amazon Seattle WA 98109 USA
We propose Strongly Supervised pre-training with ScreenShots (S4) - a novel pre-training paradigm for vision-Language Models using data from large-scale web screenshot rendering. Using web screenshots unlocks a treasu... 详细信息
来源: 评论
Semantics-aware Motion Retargeting with vision-Language Models
Semantics-aware Motion Retargeting with Vision-Language Mode...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Zhang, Haodong Chen, Zhike Xu, Haocheng Hao, Lei Wu, Xiaofei Xu, Songcen Zhang, Zhensong Wang, Yue Xiong, Rong Zhejiang Univ Hangzhou Peoples R China Huawei Noahs Ark Lab Montreal PQ Canada
Capturing and preserving motion semantics is essential to motion retargeting between animation characters. However, most of the previous works neglect the semantic information or rely on human-designed joint-level rep... 详细信息
来源: 评论
Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations
Task-aligned Part-aware Panoptic Segmentation through Joint ...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: de Geus, Daan Dubbelman, Gijs Eindhoven Univ Technol Eindhoven Netherlands
Part-aware panoptic segmentation (PPS) requires (a) that each foreground object and background region in an image is segmented and classified, and (b) that all parts within foreground objects are segmented, classified... 详细信息
来源: 评论
Transcending the Limit of Local Window: Advanced Super-Resolution Transformer with Adaptive Token Dictionary
Transcending the Limit of Local Window: Advanced Super-Resol...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Zhang, Leheng Li, Yawei Zhou, Xingyu Zhao, Xiaorui Gu, Shuhang Univ Elect Sci & Technol China Chengdu Peoples R China Swiss Fed Inst Technol Comp Vis Lab Zurich Switzerland Swiss Fed Inst Technol Integrated Syst Lab Zurich Switzerland
Single Image Super-Resolution is a classic computer vision problem that involves estimating high-resolution (HR) images from low-resolution (LR) ones. Although deep neural networks (DNNs), especially Transformers for ... 详细信息
来源: 评论
ArGue: Attribute-Guided Prompt Tuning for vision-Language Models
ArGue: Attribute-Guided Prompt Tuning for Vision-Language Mo...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Tian, Xinyu Zou, Shu Yang, Zhaoyuan Zhang, Jing Australian Natl Univ Canberra ACT Australia GE Res Niskayuna NY USA
Although soft prompt tuning is effective in efficiently adapting vision-Language (V&L) models for downstream tasks, it shows limitations in dealing with distribution shifts. We address this issue with Attribute-Gu... 详细信息
来源: 评论
LLaFS: When Large Language Models Meet Few-Shot Segmentation
LLaFS: When Large Language Models Meet Few-Shot Segmentation
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Zhu, Lanyun Chen, Tianrun Ji, Deyi Ye, Jieping Liu, Jun Singapore Univ Technol & Design Singapore Singapore Zhejiang Univ Hangzhou Peoples R China Alibaba Grp Hangzhou Peoples R China
This paper proposes LLaFS, the first attempt to leverage large language models (LLMs) in few-shot segmentation. In contrast to the conventional few-shot segmentation methods that only rely on the limited and biased in... 详细信息
来源: 评论
3DInAction: Understanding Human Actions in 3D Point Clouds
3DInAction: Understanding Human Actions in 3D Point Clouds
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Ben-Shabat, Yizhak Shrout, Oren Gould, Stephen Australian Natl Univ Canberra ACT Australia Technion Israel Inst Technol Haifa Israel
We propose a novel method for 3D point cloud action recognition. Understanding human actions in RGB videos has been widely studied in recent years, however, its 3D point cloud counterpart remains under-explored despit... 详细信息
来源: 评论
You Only Need Less Attention at Each Stage in vision Transformers
You Only Need Less Attention at Each Stage in Vision Transfo...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Zhang, Shuoxi Liu, Hanpeng Lin, Stephen He, Kun Huazhong Univ Sci & Technol Wuhan Peoples R China Microsoft Res Asia Beijing Peoples R China
The advent of vision Transformers (ViTs) marks a substantial paradigm shift in the realm of computer vision. ViTs capture the global information of images through self-attention modules, which perform dot product comp... 详细信息
来源: 评论