咨询与建议

限定检索结果

文献类型

  • 11,885 篇 会议
  • 5 篇 期刊文献

馆藏范围

  • 11,890 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 8,059 篇 工学
    • 7,617 篇 计算机科学与技术...
    • 796 篇 机械工程
    • 688 篇 电气工程
    • 360 篇 软件工程
    • 228 篇 控制科学与工程
    • 40 篇 光学工程
    • 19 篇 生物工程
    • 17 篇 信息与通信工程
    • 12 篇 生物医学工程(可授...
    • 6 篇 电子科学与技术(可...
    • 6 篇 建筑学
    • 6 篇 交通运输工程
    • 5 篇 仪器科学与技术
    • 5 篇 化学工程与技术
    • 5 篇 安全科学与工程
    • 4 篇 土木工程
  • 3,347 篇 医学
    • 3,346 篇 临床医学
    • 4 篇 基础医学(可授医学...
    • 4 篇 公共卫生与预防医...
  • 253 篇 理学
    • 198 篇 系统科学
    • 32 篇 物理学
    • 21 篇 生物学
    • 18 篇 数学
    • 9 篇 统计学(可授理学、...
    • 7 篇 化学
  • 17 篇 管理学
    • 12 篇 管理科学与工程(可...
    • 7 篇 图书情报与档案管...
    • 5 篇 工商管理
  • 3 篇 法学
    • 3 篇 社会学
  • 3 篇 教育学
    • 3 篇 教育学
  • 2 篇 农学
  • 1 篇 经济学
  • 1 篇 军事学

主题

  • 5,633 篇 computer vision
  • 2,668 篇 training
  • 2,203 篇 pattern recognit...
  • 1,747 篇 computational mo...
  • 1,502 篇 visualization
  • 1,360 篇 three-dimensiona...
  • 1,074 篇 semantics
  • 999 篇 benchmark testin...
  • 986 篇 codes
  • 959 篇 computer archite...
  • 891 篇 deep learning
  • 777 篇 conferences
  • 754 篇 task analysis
  • 700 篇 feature extracti...
  • 561 篇 transformers
  • 533 篇 face recognition
  • 527 篇 neural networks
  • 495 篇 object detection
  • 490 篇 image segmentati...
  • 468 篇 cameras

机构

  • 174 篇 univ sci & techn...
  • 145 篇 carnegie mellon ...
  • 144 篇 univ chinese aca...
  • 144 篇 tsinghua univ pe...
  • 134 篇 chinese univ hon...
  • 110 篇 zhejiang univ pe...
  • 109 篇 peng cheng lab p...
  • 99 篇 swiss fed inst t...
  • 91 篇 tsinghua univers...
  • 90 篇 shanghai ai lab ...
  • 87 篇 sensetime res pe...
  • 86 篇 shanghai jiao to...
  • 83 篇 zhejiang univers...
  • 82 篇 tech univ munich...
  • 79 篇 university of sc...
  • 79 篇 stanford univ st...
  • 78 篇 univ hong kong p...
  • 77 篇 australian natl ...
  • 76 篇 alibaba grp peop...
  • 75 篇 peng cheng labor...

作者

  • 75 篇 timofte radu
  • 64 篇 van gool luc
  • 50 篇 zhang lei
  • 43 篇 yang yi
  • 37 篇 loy chen change
  • 36 篇 tao dacheng
  • 32 篇 zhou jie
  • 31 篇 chen chen
  • 30 篇 liu yang
  • 30 篇 tian qi
  • 29 篇 sun jian
  • 29 篇 zha zheng-jun
  • 28 篇 li xin
  • 27 篇 qi tian
  • 26 篇 vasconcelos nuno
  • 25 篇 liu xiaoming
  • 25 篇 darrell trevor
  • 24 篇 zheng wei-shi
  • 24 篇 luo ping
  • 24 篇 ying shan

语言

  • 11,863 篇 英文
  • 26 篇 其他
  • 1 篇 中文
检索条件"任意字段=2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024"
11890 条 记 录,以下是331-340 订阅
排序:
MAPLM: A Real-World Large-Scale vision-Language Benchmark for Map and Traffic Scene Understanding
MAPLM: A Real-World Large-Scale Vision-Language Benchmark fo...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Cao, Xu Zhou, Tong Ma, Yunsheng Ye, Wenqian Cui, Can Tang, Kun Cao, Zhipeng Liang, Kaizhao Wang, Ziran Rehg, James M. Zheng, Chao Tencent T Lab Palo Alto CA 94306 USA Univ Illinois Champaign IL USA Purdue Univ W Lafayette IN USA Univ Virginia Charlottesville VA USA SambaNova Syst Inc Palo Alto CA USA
vision-language generative AI has demonstrated remarkable promise for empowering cross-modal scene understanding of autonomous driving and high-definition (HD) map systems. However, current benchmark datasets lack mul... 详细信息
来源: 评论
TIM: A Time Interval Machine for Audio-Visual Action recognition
TIM: A Time Interval Machine for Audio-Visual Action Recogni...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Chalk, Jacob Huh, Jaesung Kazakos, Evangelos Zisserman, Andrew Damen, Dima Univ Bristol Bristol Avon England Univ Oxford VGG Oxford England Czech Tech Univ Prague Czech Republic
Diverse actions give rise to rich audio-visual signals in long videos. Recent works showcase that the two modalities of audio and video exhibit different temporal extents of events and distinct labels. We address the ... 详细信息
来源: 评论
DRESS: Instructing Large vision-Language Models to Align and Interact with Humans via Natural Language Feedback
DRESS: Instructing Large Vision-Language Models to Align and...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Chen, Yangyi Sikka, Karan Cogswell, Michael Ji, Heng Divakaran, Ajay SRI Int Menlo Pk CA 94025 USA Univ Illinois Champaign IL 61820 USA
We present DRESS, a large vision language model (LVLM) that innovatively exploits Natural Language feedback (NLF) from Large Language Models to enhance its alignment and interactions by addressing two key limitations ... 详细信息
来源: 评论
MaskCLR: Attention-Guided Contrastive Learning for Robust Action Representation Learning
MaskCLR: Attention-Guided Contrastive Learning for Robust Ac...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Abdelfattah, Mohamed Hassan, Mariam Alahi, Alexandre Ecole Polytech Fed Lausanne EPFL Lausanne Switzerland
Current transformer-based skeletal action recognition models tend to focus on a limited set of joints and low-level motion patterns to predict action classes. This results in significant performance degradation under ... 详细信息
来源: 评论
Prompting vision Foundation Models for Pathology Image Analysis
Prompting Vision Foundation Models for Pathology Image Analy...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Yin, Chong Liu, Siqi Zhou, Kaiyang Wong, Vincent Wai-Sun Yuen, Pong C. Hong Kong Baptist Univ Dept Comp Sci Hong Kong Peoples R China Chinese Univ Hong Kong Shenzhen Res Inst Big Data Shenzhen Peoples R China Chinese Univ Hong Kong Dept Med & Therapeut Hong Kong Peoples R China
The rapid increase in cases of non-alcoholic fatty liver disease (NAFLD) in recent years has raised significant public concern. Accurately identifying tissue alteration regions is crucial for the diagnosis of NAFLD, b... 详细信息
来源: 评论
InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models
InteractDiffusion: Interaction Control in Text-to-Image Diff...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Hoe, Jiun Tian Jiang, Xudong Chan, Chee Seng Tan, Yap-Peng Hu, Weipeng Nanyang Technol Univ Sch EEE Singapore Singapore Univ Malaya CISiP Kuala Lumpur Malaysia
Large-scale text-to-image (T2I) diffusion models have showcased incredible capabilities in generating coherent images based on textual descriptions, enabling vast applications in content generation. While recent advan... 详细信息
来源: 评论
RoMa: Robust Dense Feature Matching
RoMa: Robust Dense Feature Matching
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Edstedt, Johan Sun, Qiyu Bokman, Georg Wadenback, Marten Felsberg, Michael Linkoping Univ Linkoping Sweden East China Univ Sci & Technol Shanghai Peoples R China Chalmers Univ Technol Gothenburg Sweden
Feature matching is an important computer vision task that involves estimating correspondences between two images of a 3D scene, and dense methods estimate all such correspondences. The aim is to learn a robust model,... 详细信息
来源: 评论
Domain Prompt Learning with Quaternion Networks
Domain Prompt Learning with Quaternion Networks
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Cao, Qinglong Xu, Zhengqin Chen, Yuntian Ma, Chao Yang, Xiaokang Shanghai Jiao Tong Univ AI Inst MoE Key Lab Artificial Intelligence Shanghai Peoples R China Eastern Inst Technol Ningbo Inst Digital Twin Ningbo Peoples R China
Prompt learning has emerged as a potent and resource-efficient technique in large vision-Language Models (VLMs). However, its application in adapting VLMs to specialized domains like remote sensing and medical imaging... 详细信息
来源: 评论
Robust Emotion recognition in Context Debiasing
Robust Emotion Recognition in Context Debiasing
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Yang, Dingkang Yang, Kun Li, Mingcheng Wang, Shunli Wang, Shuaibing Zhang, Lihua Fudan Univ Acad Engn & Technol Shanghai Peoples R China Cognit & Intelligent Technol Lab CIT Lab Beijing Peoples R China Jilin Prov Key Lab Intelligence Sci & Engn Changchun Peoples R China Minist Educ Engn Res Ctr AI & Robot Shanghai Peoples R China
Context-aware emotion recognition (CAER) has recently boosted the practical applications of affective computing techniques in unconstrained environments. Mainstream CAER methods invariably extract ensemble representat... 详细信息
来源: 评论
One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained vision-Language Models
One Prompt Word is Enough to Boost Adversarial Robustness fo...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Lin, L. Guan, Haoyan Qiu, Jianing Spratling, Michael Kings Coll London London England Imperial Coll London London England
Large pre-trained vision-Language Models (VLMs) like CLIP, despite having remarkable generalization ability, are highly vulnerable to adversarial examples. This work studies the adversarial robustness of VLMs from the... 详细信息
来源: 评论