咨询与建议

限定检索结果

文献类型

  • 11,886 篇 会议
  • 5 篇 期刊文献

馆藏范围

  • 11,891 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 8,060 篇 工学
    • 7,618 篇 计算机科学与技术...
    • 796 篇 机械工程
    • 688 篇 电气工程
    • 361 篇 软件工程
    • 228 篇 控制科学与工程
    • 41 篇 光学工程
    • 19 篇 生物工程
    • 17 篇 信息与通信工程
    • 12 篇 生物医学工程(可授...
    • 7 篇 交通运输工程
    • 6 篇 电子科学与技术(可...
    • 6 篇 建筑学
    • 5 篇 仪器科学与技术
    • 5 篇 化学工程与技术
    • 5 篇 安全科学与工程
    • 4 篇 土木工程
  • 3,347 篇 医学
    • 3,346 篇 临床医学
    • 4 篇 基础医学(可授医学...
    • 4 篇 公共卫生与预防医...
  • 254 篇 理学
    • 198 篇 系统科学
    • 32 篇 物理学
    • 21 篇 生物学
    • 19 篇 数学
    • 9 篇 统计学(可授理学、...
    • 7 篇 化学
  • 17 篇 管理学
    • 12 篇 管理科学与工程(可...
    • 7 篇 图书情报与档案管...
    • 5 篇 工商管理
  • 3 篇 法学
    • 3 篇 社会学
  • 3 篇 教育学
    • 3 篇 教育学
  • 2 篇 农学
  • 1 篇 经济学
  • 1 篇 军事学

主题

  • 5,633 篇 computer vision
  • 2,668 篇 training
  • 2,203 篇 pattern recognit...
  • 1,747 篇 computational mo...
  • 1,502 篇 visualization
  • 1,360 篇 three-dimensiona...
  • 1,074 篇 semantics
  • 999 篇 benchmark testin...
  • 986 篇 codes
  • 959 篇 computer archite...
  • 892 篇 deep learning
  • 777 篇 conferences
  • 754 篇 task analysis
  • 700 篇 feature extracti...
  • 561 篇 transformers
  • 533 篇 face recognition
  • 527 篇 neural networks
  • 495 篇 object detection
  • 490 篇 image segmentati...
  • 468 篇 cameras

机构

  • 174 篇 univ sci & techn...
  • 145 篇 carnegie mellon ...
  • 144 篇 univ chinese aca...
  • 144 篇 tsinghua univ pe...
  • 134 篇 chinese univ hon...
  • 110 篇 zhejiang univ pe...
  • 109 篇 peng cheng lab p...
  • 99 篇 swiss fed inst t...
  • 91 篇 tsinghua univers...
  • 90 篇 shanghai ai lab ...
  • 87 篇 sensetime res pe...
  • 86 篇 shanghai jiao to...
  • 83 篇 zhejiang univers...
  • 82 篇 tech univ munich...
  • 79 篇 university of sc...
  • 79 篇 stanford univ st...
  • 78 篇 univ hong kong p...
  • 77 篇 australian natl ...
  • 76 篇 alibaba grp peop...
  • 75 篇 peng cheng labor...

作者

  • 75 篇 timofte radu
  • 64 篇 van gool luc
  • 50 篇 zhang lei
  • 43 篇 yang yi
  • 37 篇 loy chen change
  • 36 篇 tao dacheng
  • 32 篇 zhou jie
  • 31 篇 chen chen
  • 30 篇 liu yang
  • 30 篇 tian qi
  • 29 篇 sun jian
  • 29 篇 zha zheng-jun
  • 28 篇 li xin
  • 27 篇 qi tian
  • 26 篇 vasconcelos nuno
  • 25 篇 liu xiaoming
  • 25 篇 darrell trevor
  • 24 篇 zheng wei-shi
  • 24 篇 luo ping
  • 24 篇 ying shan

语言

  • 11,849 篇 英文
  • 41 篇 其他
  • 1 篇 中文
检索条件"任意字段=2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024"
11891 条 记 录,以下是1411-1420 订阅
排序:
Context-aware Alignment and Mutual Masking for 3D-Language Pre-training
Context-aware Alignment and Mutual Masking for 3D-Language P...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Jin, Zhao Hayat, Munawar Yang, Yuwei Guo, Yulan Lei, Yinjie Sichuan Univ Chengdu Peoples R China Monash Univ Melbourne Vic Australia Sun Yat Sen Univ Guangzhou Peoples R China
3D visual language reasoning plays an important role in effective human-computer interaction. The current approaches for 3D visual reasoning are task-specific, and lack pre-training methods to learn generic representa... 详细信息
来源: 评论
Joint Depth Prediction and Semantic Segmentation with Multi-View SAM
Joint Depth Prediction and Semantic Segmentation with Multi-...
收藏 引用
ieee/cvf Winter conference on Applications of computer vision (WACV)
作者: Shvets, Mykhailo Zhao, Dongxu Niethammer, Marc Sengupta, Roni Berg, Alexander C. Univ North Carolina Chapel Hill NC 27515 USA Univ Calif Irvine Irvine CA USA
Multi-task approaches to joint depth and segmentation prediction are well-studied for monocular images. Yet, predictions from a single-view are inherently limited, while multiple views are available in many robotics a... 详细信息
来源: 评论
CVT-SLR: Contrastive Visual-Textual Transformation for Sign Language recognition with Variational Alignment
CVT-SLR: Contrastive Visual-Textual Transformation for Sign ...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Zheng, Jiangbin Wang, Yile Tan, Cheng Li, Siyuan Wang, Ge Xia, Jun Chen, Yidong Li, Stan Z. Westlake Univ AI Lab Res Ctr Ind Future Hangzhou Peoples R China Tsinghua Univ Inst AI Ind Res AIR Beijing Peoples R China Xiamen Univ Sch Informat Xiamen Peoples R China
Sign language recognition (SLR) is a weakly supervised task that annotates sign videos as textual glosses. Recent studies show that insufficient training caused by the lack of large-scale available sign datasets becom... 详细信息
来源: 评论
PGVT: Pose-Guided Video Transformer for Fine-Grained Action recognition
PGVT: Pose-Guided Video Transformer for Fine-Grained Action ...
收藏 引用
ieee/cvf Winter conference on Applications of computer vision (WACV)
作者: Zhang, Haosong Leong, Mei Chee Li, Liyuan Lin, Weisi ASTAR Inst Infocomm Res I2R Singapore Singapore Nanyang Technol Univ Singapore Singapore
Based on recent advancements in transformer-based video models and multi-modal joint learning, we propose a novel model, named Pose-Guided Video Transformer (PGVT), to incorporate sparse high-level body joints locatio... 详细信息
来源: 评论
Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information
Towards All-in-one Pre-training via Maximizing Multi-modal M...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Su, Weijie Zhu, Xizhou Tao, Chenxin Lu, Lewei Li, Bin Huang, Gao Qiao, Yu Wang, Xiaogang Zhou, Jie Dai, Jifeng Univ Sci & Technol China Beijing Peoples R China SenseTime Res Hong Kong Peoples R China Tsinghua Univ Beijing Peoples R China Shanghai Artificial Intelligence Lab Shanghai Peoples R China Chinese Univ Hong Kong Hong Kong Peoples R China
To effectively exploit the potential of large-scale models, various pre-training strategies supported by massive data from different sources are proposed, including supervised pre-training, weakly-supervised pre-train... 详细信息
来源: 评论
Semantic-aware Video Representation for Few-shot Action recognition
Semantic-aware Video Representation for Few-shot Action Reco...
收藏 引用
ieee/cvf Winter conference on Applications of computer vision (WACV)
作者: Tang, Yutao Bejar, Benjamin Vidal, Rene Johns Hopkins Univ Baltimore MD 21218 USA Paul Scherrer Inst Wurenlingen Switzerland Univ Penn Philadelphia PA USA
Recent work on action recognition leverages 3D features and textual information to achieve state-of-the-art performance. However, most of the current few-shot action recognition methods still rely on 2D frame-level re... 详细信息
来源: 评论
Towards Trustable Skin Cancer Diagnosis via Rewriting Model's Decision
Towards Trustable Skin Cancer Diagnosis via Rewriting Model'...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Yan, Siyuan Yu, Zhen Zhang, Xuelin Mahapatra, Dwarikanath Chandra, Shekhar S. Janda, Monaca Soyer, Peter Ge, Zongyuan Monash Univ Clayton Vic Australia Monash Med AI Grp Melbourne Vic Australia Univ Queensland Brisbane Qld Australia Incept Inst AI Abu Dhabi U Arab Emirates
Deep neural networks have demonstrated promising performance on image recognition tasks. However, they may heavily rely on confounding factors, using irrelevant artifacts or bias within the dataset as the cue to impro... 详细信息
来源: 评论
Siamese Image Modeling for Self-Supervised vision Representation Learning
Siamese Image Modeling for Self-Supervised Vision Representa...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Tao, Chenxin Zhu, Xizhou Su, Weijie Huang, Gao Li, Bin Zhou, Jie Qiao, Yu Wang, Xiaogang Dai, Jifeng Tsinghua Univ Beijing Peoples R China SenseTime Res Hong Kong Peoples R China Univ Sci & Technol China Hefei Peoples R China Shanghai Artificial Intelligence Lab Shanghai Peoples R China Chinese Univ Hong Kong Hong Kong Peoples R China
Self-supervised learning (SSL) has delivered superior performance on a variety of downstream vision tasks. Two main-stream SSL frameworks have been proposed, i.e., Instance Discrimination (ID) and Masked Image Modelin... 详细信息
来源: 评论
ProcSim: Proxy-based Confidence for Robust Similarity Learning
ProcSim: Proxy-based Confidence for Robust Similarity Learni...
收藏 引用
ieee/cvf Winter conference on Applications of computer vision (WACV)
作者: Barbany, Oriol Lin, Xiaofan Bastan, Muhammet Dhua, Arnab CSIC UPC Inst Robot & Informat Ind Barcelona Spain Amazon Visual Search AR Seattle WA USA Amazon Seattle WA USA
Deep Metric Learning (DML) methods aim at learning an embedding space in which distances are closely related to the inherent semantic similarity of the inputs. Previous studies have shown that popular benchmark datase... 详细信息
来源: 评论
Hint-Aug: Drawing Hints from Foundation vision Transformers towards Boosted Few-shot Parameter-Efficient Tuning
Hint-Aug: Drawing Hints from Foundation Vision Transformers ...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Yu, Zhongzhi Wu, Shang Fu, Yonggan Zhang, Shunyao Lin, Yingyan (Celine) Georgia Inst Technol Atlanta GA 30332 USA Rice Univ Houston TX USA
Despite the growing demand for tuning foundation vision transformers (FViTs) on downstream tasks, fully unleashing FViTs' potential under data-limited scenarios (e.g., few-shot tuning) remains a challenge due to F... 详细信息
来源: 评论