咨询与建议

限定检索结果

文献类型

  • 23,000 篇 会议
  • 126 册 图书
  • 92 篇 期刊文献

馆藏范围

  • 23,217 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 13,622 篇 工学
    • 11,107 篇 计算机科学与技术...
    • 3,479 篇 软件工程
    • 2,445 篇 机械工程
    • 1,716 篇 光学工程
    • 1,075 篇 电气工程
    • 1,014 篇 控制科学与工程
    • 784 篇 信息与通信工程
    • 411 篇 仪器科学与技术
    • 352 篇 生物工程
    • 251 篇 生物医学工程(可授...
    • 196 篇 电子科学与技术(可...
    • 114 篇 化学工程与技术
    • 107 篇 安全科学与工程
    • 100 篇 测绘科学与技术
    • 88 篇 建筑学
    • 86 篇 交通运输工程
    • 84 篇 土木工程
  • 3,494 篇 医学
    • 3,481 篇 临床医学
    • 81 篇 基础医学(可授医学...
  • 3,241 篇 理学
    • 1,939 篇 物理学
    • 1,640 篇 数学
    • 563 篇 统计学(可授理学、...
    • 500 篇 生物学
    • 249 篇 系统科学
    • 106 篇 化学
  • 521 篇 管理学
    • 311 篇 图书情报与档案管...
    • 223 篇 管理科学与工程(可...
    • 76 篇 工商管理
  • 276 篇 艺术学
    • 276 篇 设计学(可授艺术学...
  • 66 篇 法学
    • 63 篇 社会学
  • 38 篇 农学
  • 28 篇 教育学
  • 22 篇 经济学
  • 10 篇 军事学
  • 3 篇 文学

主题

  • 10,186 篇 computer vision
  • 3,966 篇 pattern recognit...
  • 3,005 篇 training
  • 2,007 篇 computational mo...
  • 1,818 篇 visualization
  • 1,815 篇 cameras
  • 1,515 篇 feature extracti...
  • 1,481 篇 shape
  • 1,455 篇 three-dimensiona...
  • 1,438 篇 image segmentati...
  • 1,287 篇 robustness
  • 1,205 篇 computer archite...
  • 1,155 篇 semantics
  • 1,147 篇 conferences
  • 1,107 篇 layout
  • 1,092 篇 computer science
  • 1,087 篇 object detection
  • 1,025 篇 benchmark testin...
  • 970 篇 codes
  • 922 篇 face recognition

机构

  • 136 篇 univ sci & techn...
  • 121 篇 univ chinese aca...
  • 118 篇 chinese univ hon...
  • 107 篇 carnegie mellon ...
  • 101 篇 tsinghua univers...
  • 101 篇 microsoft resear...
  • 95 篇 swiss fed inst t...
  • 93 篇 zhejiang univ pe...
  • 82 篇 university of sc...
  • 81 篇 zhejiang univers...
  • 80 篇 university of ch...
  • 77 篇 shanghai ai lab ...
  • 72 篇 shanghai jiao to...
  • 69 篇 national laborat...
  • 67 篇 microsoft res as...
  • 67 篇 alibaba grp peop...
  • 64 篇 adobe research
  • 61 篇 tsinghua univ pe...
  • 60 篇 peking univ peop...
  • 59 篇 univ oxford oxfo...

作者

  • 81 篇 van gool luc
  • 72 篇 timofte radu
  • 64 篇 zhang lei
  • 47 篇 luc van gool
  • 40 篇 yang yi
  • 40 篇 li stan z.
  • 37 篇 loy chen change
  • 34 篇 chen chen
  • 33 篇 xiaoou tang
  • 32 篇 liu yang
  • 32 篇 qi tian
  • 31 篇 tian qi
  • 31 篇 sun jian
  • 30 篇 murino vittorio
  • 30 篇 pascal fua
  • 29 篇 darrell trevor
  • 29 篇 li fei-fei
  • 28 篇 li xin
  • 28 篇 ying shan
  • 27 篇 vasconcelos nuno

语言

  • 23,137 篇 英文
  • 52 篇 其他
  • 22 篇 中文
  • 5 篇 土耳其文
  • 2 篇 日文
检索条件"任意字段=IEEE Conference on Computer Vision and Pattern Recognition Workshops"
23218 条 记 录,以下是351-360 订阅
ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
ViP-LLaVA: Making Large Multimodal Models Understand Arbitra...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Cai, Mu Liu, Haotian Mustikovela, Siva Karthik Meyer, Gregory P. Chai, Yuning Park, Dennis Lee, Yong Jae Univ Wisconsin Madison WI 53706 USA Cruise LLC San Francisco CA USA
While existing large vision-language multimodal models focus on whole image understanding, there is a prominent gap in achieving region-specific comprehension. Current approaches that use textual coordinates or spatia... 详细信息
来源: 评论
Grounding Everything: Emerging Localization Properties in vision-Language Transformers
Grounding Everything: Emerging Localization Properties in Vi...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Bousselham, Walid Petersen, Felix Ferrari, Vittorio Kuehne, Hilde Univ Bonn Bonn Germany Goethe Univ Frankfurt Frankfurt Germany Stanford Univ Stanford CA 94305 USA Synthesia Io London England MIT IBM Watson AI Lab Cambridge MA USA
vision-language foundation models have shown remarkable performance in various zero-shot settings such as image retrieval, classification, or captioning. But so far, those models seem to fall behind when it comes to z... 详细信息
来源: 评论
Unseen And Adverse Outdoor Scenes recognition Through Event-based Captions
Unseen And Adverse Outdoor Scenes Recognition Through Event-...
收藏 引用
ieee/CVF International conference on computer vision (ICCV)
作者: Sakaino, Hidetomo Weathernews Inc Weather Transportat Lab Visual Recognit Grp Chiba Japan
This paper presents EventCAP, i.e., event-based captions, for refined and enriched qualitative and quantitative captions by Deep Learning (DL) models and vision Language Models (VLMs) with different tasks in a complem... 详细信息
来源: 评论
Random Entangled Tokens for Adversarially Robust vision Transformer
Random Entangled Tokens for Adversarially Robust Vision Tran...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Gong, Huihui Dong, Mingjing Mao, Siqi Camtepe, Seyit Nepal, Surya Xu, Chang Univ Sydney Sydney NSW Australia CSIRO Data61 Eveleigh Australia City Univ Hong Kong Hong Kong Peoples R China Univ New South Wales Sydney NSW Australia
vision Transformers (ViTs) have emerged as a compelling alternative to Convolutional Neural Networks ( CNNs) in the realm of computer vision, showcasing tremendous potential. However, recent research has unveiled a su... 详细信息
来源: 评论
LowRankOcc: Tensor Decomposition and Low-Rank Recovery for vision-based 3D Semantic Occupancy Prediction
LowRankOcc: Tensor Decomposition and Low-Rank Recovery for V...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Zhao, Linqing Xu, Xiuwei Wang, Ziwei Zhang, Yunpeng Zhang, Borui Zheng, Wenzhao Du, Dalong Zhou, Jie Lu, Jiwen Tsinghua Univ Dept Automat Beijing Peoples R China Tianjin Univ Sch Elect & Informat Engn Tianjin Peoples R China PhiGent Robot Beijing Peoples R China
In this paper, we present a tensor decomposition and low-rank recovery approach (LowRankOcc) for vision-based 3D semantic occupancy prediction. Conventional methods model outdoor scenes with fine-grained 3D grids, but... 详细信息
来源: 评论
GenZI: Zero-Shot 3D Human-Scene Interaction Generation
GenZI: Zero-Shot 3D Human-Scene Interaction Generation
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Li, Lei Dai, Angela Tech Univ Munich Munich Germany
Can we synthesize 3D humans interacting with scenes without learning from any 3D human-scene interaction data? We propose GenZI(1), the first zero-shot approach to generating 3D human-scene interactions. Key to GenZI ... 详细信息
来源: 评论
Decomposing Disease Descriptions for Enhanced Pathology Detection: A Multi-Aspect vision-Language Pre-training Framework
Decomposing Disease Descriptions for Enhanced Pathology Dete...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Vu Minh Hieu Phan Xie, Yutong Qi, Yuankai Liu, Linggiao Liu, Liyang Zhang, Bowen Liao, Zhibin Wu, Qi To, Minh-Son Verjans, Johan W. Univ Adelaide Australian Inst Machine Learning Adelaide SA Australia Macquarie Univ Sydney NSW Australia Flinders Univ S Australia Adelaide SA Australia
Medical vision language pre-training (VLP) has emerged as a frontier of research, enabling zero-shot pathological recognition by comparing the query image with the textual descriptions for each disease. Due to the com... 详细信息
来源: 评论
SyncMask: Synchronized Attentional Masking for Fashion-centric vision-Language Pretraining
SyncMask: Synchronized Attentional Masking for Fashion-centr...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Song, Chull Hwan Hwang, Taebaek Yoon, Jooyoung Choi, Shunghyun Gu, Yeong Hyeon Dealicious Inc Seoul South Korea Sejong Univ Seoul South Korea
vision-language models (VLMs) have made significant strides in cross-modal understanding through large-scale paired datasets. However, in fashion domain, datasets of-en exhibit a disparity between the information conv... 详细信息
来源: 评论
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Sun, Zeyi Fang, Ye Wu, Tong Zhang, Pan Zang, Yuhang Kong, Shu Xiong, Yuanjun Lin, Dahua Wang, Jiaqi Shanghai Jiao Tong Univ Shanghai Peoples R China Fudan Univ Shanghai Peoples R China Chinese Univ Hong Kong Hong Kong Peoples R China Shanghai AI Lab Shanghai Peoples R China Univ Macau Taipa Macao Peoples R China MThreads Inc Beijing Peoples R China
Contrastive Language-Image Pre-training (CLIP) plays an essential role in extracting valuable content information from images across diverse tasks. It aligns textual and visual modalities to comprehend the entire imag... 详细信息
来源: 评论
The Gender Gap in Face recognition Accuracy Is a Hairy Problem
The Gender Gap in Face Recognition Accuracy Is a Hairy Probl...
收藏 引用
23rd ieee/CVF Winter conference on Applications of computer vision (WACV)
作者: Bhatta, Aman Albiero, Vitor Bowyer, Kevin W. King, Michael C. Univ Notre Dame Notre Dame IN 46556 USA Florida Inst Technol Melbourne FL 32901 USA
It is broadly accepted that there is a "gender gap" in face recognition accuracy, with females having lower accuracy. However, relatively little is known about the cause(s) of this gender gap. We first demon... 详细信息
来源: 评论