咨询与建议

限定检索结果

文献类型

  • 11,884 篇 会议
  • 5 篇 期刊文献

馆藏范围

  • 11,889 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 8,055 篇 工学
    • 7,613 篇 计算机科学与技术...
    • 796 篇 机械工程
    • 688 篇 电气工程
    • 356 篇 软件工程
    • 225 篇 控制科学与工程
    • 40 篇 光学工程
    • 19 篇 生物工程
    • 17 篇 信息与通信工程
    • 12 篇 生物医学工程(可授...
    • 6 篇 电子科学与技术(可...
    • 6 篇 建筑学
    • 6 篇 交通运输工程
    • 5 篇 仪器科学与技术
    • 5 篇 化学工程与技术
    • 5 篇 安全科学与工程
    • 4 篇 土木工程
  • 3,344 篇 医学
    • 3,343 篇 临床医学
    • 4 篇 基础医学(可授医学...
    • 4 篇 公共卫生与预防医...
  • 250 篇 理学
    • 198 篇 系统科学
    • 29 篇 物理学
    • 21 篇 生物学
    • 15 篇 数学
    • 9 篇 统计学(可授理学、...
    • 4 篇 化学
  • 17 篇 管理学
    • 12 篇 管理科学与工程(可...
    • 7 篇 图书情报与档案管...
    • 5 篇 工商管理
  • 3 篇 法学
    • 3 篇 社会学
  • 3 篇 教育学
    • 3 篇 教育学
  • 2 篇 农学
  • 1 篇 经济学
  • 1 篇 军事学

主题

  • 5,633 篇 computer vision
  • 2,668 篇 training
  • 2,203 篇 pattern recognit...
  • 1,747 篇 computational mo...
  • 1,502 篇 visualization
  • 1,360 篇 three-dimensiona...
  • 1,074 篇 semantics
  • 999 篇 benchmark testin...
  • 986 篇 codes
  • 959 篇 computer archite...
  • 891 篇 deep learning
  • 777 篇 conferences
  • 754 篇 task analysis
  • 700 篇 feature extracti...
  • 561 篇 transformers
  • 533 篇 face recognition
  • 527 篇 neural networks
  • 495 篇 object detection
  • 490 篇 image segmentati...
  • 468 篇 cameras

机构

  • 174 篇 univ sci & techn...
  • 145 篇 carnegie mellon ...
  • 144 篇 univ chinese aca...
  • 144 篇 tsinghua univ pe...
  • 134 篇 chinese univ hon...
  • 110 篇 zhejiang univ pe...
  • 109 篇 peng cheng lab p...
  • 99 篇 swiss fed inst t...
  • 91 篇 tsinghua univers...
  • 90 篇 shanghai ai lab ...
  • 87 篇 sensetime res pe...
  • 86 篇 shanghai jiao to...
  • 83 篇 zhejiang univers...
  • 82 篇 tech univ munich...
  • 79 篇 university of sc...
  • 79 篇 stanford univ st...
  • 78 篇 univ hong kong p...
  • 77 篇 australian natl ...
  • 76 篇 alibaba grp peop...
  • 75 篇 peng cheng labor...

作者

  • 75 篇 timofte radu
  • 64 篇 van gool luc
  • 50 篇 zhang lei
  • 43 篇 yang yi
  • 37 篇 loy chen change
  • 36 篇 tao dacheng
  • 32 篇 zhou jie
  • 31 篇 chen chen
  • 30 篇 liu yang
  • 30 篇 tian qi
  • 29 篇 sun jian
  • 29 篇 zha zheng-jun
  • 28 篇 li xin
  • 27 篇 qi tian
  • 26 篇 vasconcelos nuno
  • 25 篇 liu xiaoming
  • 25 篇 darrell trevor
  • 24 篇 zheng wei-shi
  • 24 篇 luo ping
  • 24 篇 ying shan

语言

  • 11,863 篇 英文
  • 25 篇 其他
  • 1 篇 中文
检索条件"任意字段=2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024"
11889 条 记 录,以下是151-160 订阅
排序:
Improving Image Restoration through Removing Degradations in Textual Representations
Improving Image Restoration through Removing Degradations in...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Lin, Jingbo Zhang, Zhilu Wei, Yuxiang Ren, Dongwei Jiang, Dongsheng Tian, Qi Zuo, Wangmeng Harbin Inst Technol Harbin Peoples R China Huawei Cloud Comp Co Ltd Shenzhen Peoples R China
In this paper, we introduce a new perspective for improving image restoration by removing degradation in the textual representations of a given degraded image. Intuitively, restoration is much easier on text modality ... 详细信息
来源: 评论
Unsupervised Video Domain Adaptation with Masked Pre-Training and Collaborative Self-Training
Unsupervised Video Domain Adaptation with Masked Pre-Trainin...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Reddy, Arun Paul, William Rivera, Corban Shah, Ketul de Melo, Celso M. Chellappa, Rama Johns Hopkins Univ Baltimore MD 21218 USA Johns Hopkins Univ Dept Elect & Comp Engn Baltimore MD USA DEVCOM US Army Res Lab Aberdeen Proving Ground MD USA
In this work, we tackle the problem of unsupervised domain adaptation (UDA) for video action recognition. Our approach, which we call UNITE, uses an image teacher model to adapt a video student model to the target dom... 详细信息
来源: 评论
Joint Physical-Digital Facial Attack Detection Via Simulating Spoofing Clues
Joint Physical-Digital Facial Attack Detection Via Simulatin...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: He, Xianhua Liang, Dashuang Yang, Song Hao, Zhanlong Ma, Hui Mao, Binjie Li, Xi Wang, Yao Yan, Pengfei Liu, Ajian Meituan Vis AI Dept Beijing Peoples R China MUST Taipa Macao Peoples R China CASIA MAIS Beijing Peoples R China
Face recognition systems are frequently subjected to a variety of physical and digital attacks of different types. Previous methods have achieved satisfactory performance in scenarios that address physical attacks and... 详细信息
来源: 评论
SleepVST: Sleep Staging from Near-Infrared Video Signals using Pre-Trained Transformers
SleepVST: Sleep Staging from Near-Infrared Video Signals usi...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Carter, Jonathan F. Jorge, Joao Gibson, Oliver Tarassenkol, Lionel Univ Oxford Inst Biomed Engn Oxford England Oxehealth Ltd Oxford England
Advances in camera-based physiological monitoring have enabled the robust, non-contact measurement of respiration and the cardiac pulse, which are known to be indicative of the sleep stage. This has led to research in... 详细信息
来源: 评论
Do vision and Language Encoders Represent the World Similarly?
Do Vision and Language Encoders Represent the World Similarl...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Maniparambil, Mayug Akshulakov, Raiymbek Djilali, Yasser Abdelaziz Dahou Seddik, Mohamed El Amine Narayan, Sanath Mangalam, Karttikeya O'Connor, Noel E. Dublin City Univ ML Labs Dublin Ireland Univ Calif Berkeley Berkeley CA 94720 USA Technol Innovat Inst Dublin Ireland
Aligned text-image encoders such as CLIP have become the de-facto model for vision-language tasks. Furthermore, modality-specific encoders achieve impressive performances in their respective domains. This raises a cen... 详细信息
来源: 评论
WALT3D: Generating Realistic Training Data from Time-Lapse Imagery for Reconstructing Dynamic Objects under Occlusion
WALT3D: Generating Realistic Training Data from Time-Lapse I...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Khiem Vuong Reddy, N. Dinesh Tamburo, Robert Narasimhan, Srinivasa G. Carnegie Mellon Univ Pittsburgh PA 15213 USA Amazon Seattle WA USA
Current methods for 2D and 3D object understanding struggle with severe occlusions in busy urban environments, partly due to the lack of large-scale labeled groundtruth annotations for learning occlusion. In this work... 详细信息
来源: 评论
Cross-view and Cross-pose Completion for 3D Human Understanding
Cross-view and Cross-pose Completion for 3D Human Understand...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Armando, Matthieu Galaaoui, Salma Baradel, Fabien Lucas, Thomas Leroy, Vincent Bregier, Romain Weinzaepfel, Philippe Rogez, Gregory NAVER LABS Europe Meylan France
Human perception and understanding is a major domain of computer vision which, like many other vision subdomains recently, stands to gain from the use of large models pre-trained on large datasets. We hypothesize that... 详细信息
来源: 评论
Probing the 3D Awareness of Visual Foundation Models
Probing the 3D Awareness of Visual Foundation Models
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: El Banani, Mohamed Raj, Amit Maninis, Kevis-Kokitsi Kar, Abhishek Li, Yuanzhen Rubinstein, Michael Sun, Deqing Guibas, Leonidas Johnson, Justin Jampani, Varun Univ Michigan Ann Arbor MI 48109 USA Google Mountain View CA 94043 USA Stability AI London ON Canada
Recent advances in large-scale pretraining have yielded visual foundation models with strong capabilities. Not only can recent models generalize to arbitrary images for their training task, their intermediate represen... 详细信息
来源: 评论
Grounding Everything: Emerging Localization Properties in vision-Language Transformers
Grounding Everything: Emerging Localization Properties in Vi...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Bousselham, Walid Petersen, Felix Ferrari, Vittorio Kuehne, Hilde Univ Bonn Bonn Germany Goethe Univ Frankfurt Frankfurt Germany Stanford Univ Stanford CA 94305 USA Synthesia Io London England MIT IBM Watson AI Lab Cambridge MA USA
vision-language foundation models have shown remarkable performance in various zero-shot settings such as image retrieval, classification, or captioning. But so far, those models seem to fall behind when it comes to z... 详细信息
来源: 评论
A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives
A Backpack Full of Skills: Egocentric Video Understanding wi...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Peirone, Simone Alberto Pistilli, Francesca Alliegro, Antonio Averta, Giuseppe Politecnico Torino Turin Italy Ist Italiano Tecnol Genoa Italy
Human comprehension of a video stream is naturally broad: in a few instants, we are able to understand what is happening, the relevance and relationship of objects, and forecast what will follow in the near future, ev... 详细信息
来源: 评论