咨询与建议

限定检索结果

文献类型

  • 11,886 篇 会议
  • 5 篇 期刊文献

馆藏范围

  • 11,891 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 8,060 篇 工学
    • 7,618 篇 计算机科学与技术...
    • 796 篇 机械工程
    • 688 篇 电气工程
    • 361 篇 软件工程
    • 228 篇 控制科学与工程
    • 41 篇 光学工程
    • 19 篇 生物工程
    • 17 篇 信息与通信工程
    • 12 篇 生物医学工程(可授...
    • 7 篇 交通运输工程
    • 6 篇 电子科学与技术(可...
    • 6 篇 建筑学
    • 5 篇 仪器科学与技术
    • 5 篇 化学工程与技术
    • 5 篇 安全科学与工程
    • 4 篇 土木工程
  • 3,347 篇 医学
    • 3,346 篇 临床医学
    • 4 篇 基础医学(可授医学...
    • 4 篇 公共卫生与预防医...
  • 254 篇 理学
    • 198 篇 系统科学
    • 32 篇 物理学
    • 21 篇 生物学
    • 19 篇 数学
    • 9 篇 统计学(可授理学、...
    • 7 篇 化学
  • 17 篇 管理学
    • 12 篇 管理科学与工程(可...
    • 7 篇 图书情报与档案管...
    • 5 篇 工商管理
  • 3 篇 法学
    • 3 篇 社会学
  • 3 篇 教育学
    • 3 篇 教育学
  • 2 篇 农学
  • 1 篇 经济学
  • 1 篇 军事学

主题

  • 5,633 篇 computer vision
  • 2,668 篇 training
  • 2,203 篇 pattern recognit...
  • 1,747 篇 computational mo...
  • 1,502 篇 visualization
  • 1,360 篇 three-dimensiona...
  • 1,074 篇 semantics
  • 999 篇 benchmark testin...
  • 986 篇 codes
  • 959 篇 computer archite...
  • 892 篇 deep learning
  • 777 篇 conferences
  • 754 篇 task analysis
  • 700 篇 feature extracti...
  • 561 篇 transformers
  • 533 篇 face recognition
  • 527 篇 neural networks
  • 495 篇 object detection
  • 490 篇 image segmentati...
  • 468 篇 cameras

机构

  • 174 篇 univ sci & techn...
  • 145 篇 carnegie mellon ...
  • 144 篇 univ chinese aca...
  • 144 篇 tsinghua univ pe...
  • 134 篇 chinese univ hon...
  • 110 篇 zhejiang univ pe...
  • 109 篇 peng cheng lab p...
  • 99 篇 swiss fed inst t...
  • 91 篇 tsinghua univers...
  • 90 篇 shanghai ai lab ...
  • 87 篇 sensetime res pe...
  • 86 篇 shanghai jiao to...
  • 83 篇 zhejiang univers...
  • 82 篇 tech univ munich...
  • 79 篇 university of sc...
  • 79 篇 stanford univ st...
  • 78 篇 univ hong kong p...
  • 77 篇 australian natl ...
  • 76 篇 alibaba grp peop...
  • 75 篇 peng cheng labor...

作者

  • 75 篇 timofte radu
  • 64 篇 van gool luc
  • 50 篇 zhang lei
  • 43 篇 yang yi
  • 37 篇 loy chen change
  • 36 篇 tao dacheng
  • 32 篇 zhou jie
  • 31 篇 chen chen
  • 30 篇 liu yang
  • 30 篇 tian qi
  • 29 篇 sun jian
  • 29 篇 zha zheng-jun
  • 28 篇 li xin
  • 27 篇 qi tian
  • 26 篇 vasconcelos nuno
  • 25 篇 liu xiaoming
  • 25 篇 darrell trevor
  • 24 篇 zheng wei-shi
  • 24 篇 luo ping
  • 24 篇 ying shan

语言

  • 11,850 篇 英文
  • 40 篇 其他
  • 1 篇 中文
检索条件"任意字段=2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024"
11891 条 记 录,以下是1401-1410 订阅
排序:
Event-guided Person Re-Identification via Sparse-Dense Complementary Learning
Event-guided Person Re-Identification via Sparse-Dense Compl...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Cao, Chengzhi Fu, Xueyang Liu, Hongjian Huang, Yukun Wang, Kunyu Luo, Jiebo Zha, Zheng-Jun Univ Sci & Technol China Hefei Peoples R China Univ Rochester Rochester NY 14627 USA
Video-based person re-identification (Re-ID) is a prominent computer vision topic due to its wide range of video surveillance applications. Most existing methods utilize spatial and temporal correlations in frame sequ... 详细信息
来源: 评论
SynthVSR: Scaling Up Visual Speech recognition With Synthetic Supervision
SynthVSR: Scaling Up Visual Speech Recognition With Syntheti...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Liu, Xubo Lakomkin, Egor Vougioukas, Konstantinos Ma, Pingchuan Chen, Honglie Xie, Ruiming Doulaty, Morrie Moritz, Niko Kolar, Jachym Petridis, Stavros Pantic, Maja Fuegen, Christian Univ Surrey Guildford Surrey England Meta AI New York NY 58051 USA
Recently reported state-of-the-art results in visual speech recognition (VSR) often rely on increasingly large amounts of video data, while the publicly available transcribed video datasets are limited in size. In thi... 详细信息
来源: 评论
IKEA Ego 3D Dataset: Understanding furniture assembly actions from ego-view 3D Point Clouds
IKEA Ego 3D Dataset: Understanding furniture assembly action...
收藏 引用
ieee/cvf Winter conference on Applications of computer vision (WACV)
作者: Ben-Shabat, Yizhak Paul, Jonathan Segev, Eviatar Shrout, Oren Gould, Stephen Australian Natl Univ Canberra ACT Australia Technion Israel Inst Technol Haifa Israel
We propose a novel dataset for ego-view 3D point cloud action recognition. While there has been extensive research on understanding human actions in RGB videos in recent years, the exploration of its 3D point cloud co... 详细信息
来源: 评论
A*: Atrous Spatial Temporal Action recognition for Real Time ApplicationsA*: Atrous Spatial Temporal Action recognition for Real Time Applications
A*: Atrous Spatial Temporal Action Recognition for Real Time...
收藏 引用
ieee/cvf Winter conference on Applications of computer vision (WACV)
作者: Kim, Myeongjun Spinola, Federica Benz, Philipp Kim, Tae-hoon Deeping Source Inc Seoul South Korea
Deep learning has become a popular tool across various fields and is increasingly being integrated into real-world applications such as autonomous driving cars and surveillance cameras. One area of active research is ... 详细信息
来源: 评论
Visual Language Pretrained Multiple Instance Zero-Shot Transfer for Histopathology Images
Visual Language Pretrained Multiple Instance Zero-Shot Trans...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Lu, Ming Y. Chen, Bowen Zhang, Andrew Williamson, Drew F. K. Chen, Richard J. Ding, Tong Le, Long Phi Chuang, Yung-Sung Mahmood, Faisal MIT Cambridge MA 02139 USA Harvard Univ Cambridge MA 02138 USA Mass Gen Brigham Boston MA 02199 USA
Contrastive visual language pretraining has emerged as a powerful method for either training new language-aware image encoders or augmenting existing pretrained models with zero-shot visual recognition capabilities. H... 详细信息
来源: 评论
MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical vision Transformers
MixMAE: Mixed and Masked Autoencoder for Efficient Pretraini...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Liu, Jihao Huang, Xin Zheng, Jinliang Liu, Yu Li, Hongsheng CUHK MMLab Shenzhen Peoples R China SenseTime Res Hong Kong Peoples R China InnoHK CPII Hong Kong Peoples R China
In this paper, we propose Mixed and Masked AutoEncoder (MixMAE), a simple but efficient pretraining method that is applicable to various hierarchical vision Transformers. Existing masked image modeling (MIM) methods f... 详细信息
来源: 评论
Decomposed Soft Prompt Guided Fusion Enhancing for Compositional Zero-Shot Learning
Decomposed Soft Prompt Guided Fusion Enhancing for Compositi...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Lu, Xiaocheng Guo, Song Liu, Ziming Guo, Jingcai Hong Kong Polytech Univ Dept Comp Hong Kong Peoples R China Hong Kong Polytech Univ Shenzhen Res Inst Hong Kong Peoples R China
Compositional Zero-Shot Learning (CZSL) aims to recognize novel concepts formed by known states and objects during training. Existing methods either learn the combined state-object representation, challenging the gene... 详细信息
来源: 评论
Masked Image Training for Generalizable Deep Image Denoising
Masked Image Training for Generalizable Deep Image Denoising
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Chen, Haoyu Gu, Jinjin Liu, Yihao Magid, Salma Abdel Dong, Chao Wang, Qiong Pfister, Hanspeter Zhu, Lei Hong Kong Univ Sci & Technol Guangzhou Hong Kong Peoples R China Shanghai AI Lab Shanghai Peoples R China Univ Sydney Sydney Australia Chinese Acad Sci Shenzhen Inst Adv Technol ShenZhen Key Lab Comp Vis & Pattern Recognit Beijing Peoples R China Univ Chinese Acad Sci Beijing Peoples R China Chinese Acad Sci Shenzhen Inst Adv Technol Guangdong Prov Key Lab Comp Vision & Virtual Rea Beijing Peoples R China Harvard Univ Cambridge MA USA Hong Kong Univ Sci & Technol Hong Kong Peoples R China
When capturing and storing images, devices inevitably introduce noise. Reducing this noise is a critical task called image denoising. Deep learning has become the de facto method for image denoising, especially with t... 详细信息
来源: 评论
Advancing Visual Grounding with Scene Knowledge: Benchmark and Method
Advancing Visual Grounding with Scene Knowledge: Benchmark a...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Chen, Zhihong Zhang, Ruifei Song, Yibing Wan, Xiang Li, Guanbin Chinese Univ Hong Kong Shenzhen Peoples R China Sun Yat Sen Univ Guangzhou Peoples R China Shenzhen Res Inst Big Data Shenzhen Peoples R China Tencent AI Lab Shenzhen Peoples R China Fudan Univ AI3 Inst Shanghai Peoples R China
Visual grounding (VG) aims to establish fine-grained alignment between vision and language. Ideally, it can be a testbed for vision-and-language models to evaluate their understanding of the images and texts and their... 详细信息
来源: 评论
Context-aware Alignment and Mutual Masking for 3D-Language Pre-training
Context-aware Alignment and Mutual Masking for 3D-Language P...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Jin, Zhao Hayat, Munawar Yang, Yuwei Guo, Yulan Lei, Yinjie Sichuan Univ Chengdu Peoples R China Monash Univ Melbourne Vic Australia Sun Yat Sen Univ Guangzhou Peoples R China
3D visual language reasoning plays an important role in effective human-computer interaction. The current approaches for 3D visual reasoning are task-specific, and lack pre-training methods to learn generic representa... 详细信息
来源: 评论