咨询与建议

限定检索结果

文献类型

  • 50,479 篇 会议
  • 1,421 册 图书
  • 1,041 篇 期刊文献
  • 1 篇 学位论文

馆藏范围

  • 52,940 篇 电子文献
  • 4 种 纸本馆藏

日期分布

学科分类号

  • 31,811 篇 工学
    • 24,804 篇 计算机科学与技术...
    • 12,568 篇 软件工程
    • 5,153 篇 光学工程
    • 4,756 篇 电气工程
    • 4,436 篇 信息与通信工程
    • 4,257 篇 机械工程
    • 3,956 篇 控制科学与工程
    • 2,474 篇 生物工程
    • 1,728 篇 生物医学工程(可授...
    • 1,584 篇 仪器科学与技术
    • 1,317 篇 电子科学与技术(可...
    • 793 篇 化学工程与技术
    • 698 篇 安全科学与工程
    • 542 篇 交通运输工程
    • 379 篇 建筑学
    • 331 篇 土木工程
  • 11,839 篇 理学
    • 6,434 篇 物理学
    • 5,405 篇 数学
    • 2,761 篇 生物学
    • 1,910 篇 统计学(可授理学、...
    • 801 篇 化学
    • 669 篇 系统科学
  • 5,305 篇 医学
    • 5,094 篇 临床医学
    • 729 篇 基础医学(可授医学...
    • 459 篇 药学(可授医学、理...
  • 3,350 篇 管理学
    • 1,953 篇 图书情报与档案管...
    • 1,535 篇 管理科学与工程(可...
    • 479 篇 工商管理
  • 720 篇 艺术学
    • 718 篇 设计学(可授艺术学...
  • 428 篇 法学
    • 401 篇 社会学
  • 297 篇 农学
  • 197 篇 教育学
  • 163 篇 经济学
  • 63 篇 文学
  • 49 篇 军事学

主题

  • 17,385 篇 computer vision
  • 9,017 篇 pattern recognit...
  • 4,196 篇 training
  • 3,815 篇 feature extracti...
  • 3,134 篇 cameras
  • 2,870 篇 computational mo...
  • 2,789 篇 image segmentati...
  • 2,622 篇 visualization
  • 2,573 篇 shape
  • 2,533 篇 face recognition
  • 2,171 篇 robustness
  • 2,123 篇 computer science
  • 1,973 篇 object detection
  • 1,959 篇 computer archite...
  • 1,878 篇 layout
  • 1,853 篇 object recogniti...
  • 1,802 篇 three-dimensiona...
  • 1,725 篇 neural networks
  • 1,708 篇 humans
  • 1,691 篇 image recognitio...

机构

  • 165 篇 univ chinese aca...
  • 144 篇 tsinghua univers...
  • 136 篇 national laborat...
  • 108 篇 univ sci & techn...
  • 104 篇 zhejiang univers...
  • 100 篇 shanghai jiao to...
  • 95 篇 microsoft resear...
  • 94 篇 university of sc...
  • 86 篇 zhejiang univ pe...
  • 84 篇 shanghai ai lab ...
  • 74 篇 school of comput...
  • 69 篇 computer vision ...
  • 68 篇 peking univ peop...
  • 68 篇 chinese acad sci...
  • 65 篇 chinese univ hon...
  • 63 篇 institute of inf...
  • 62 篇 google res mount...
  • 61 篇 univ oxford oxfo...
  • 59 篇 univ toronto on
  • 57 篇 swiss fed inst t...

作者

  • 91 篇 van gool luc
  • 87 篇 umapada pal
  • 76 篇 zhang lei
  • 64 篇 lee seong-whan
  • 49 篇 vittorio murino
  • 42 篇 yang yi
  • 34 篇 nassir navab
  • 33 篇 li xin
  • 33 篇 jie yang
  • 32 篇 liu yang
  • 31 篇 escalera sergio
  • 31 篇 loy chen change
  • 30 篇 ling haibin
  • 30 篇 h. bischof
  • 29 篇 zhou jie
  • 29 篇 vasconcelos nuno
  • 29 篇 jan-michael frah...
  • 29 篇 hanqing lu
  • 28 篇 blumenstein mich...
  • 27 篇 jia yunde

语言

  • 51,871 篇 英文
  • 835 篇 其他
  • 241 篇 中文
  • 22 篇 土耳其文
  • 5 篇 西班牙文
  • 2 篇 日文
  • 2 篇 葡萄牙文
  • 2 篇 俄文
检索条件"任意字段=IEEE Conference on Computer Vision and Pattern Recognition"
52943 条 记 录,以下是131-140 订阅
排序:
Towards Efficient Audio-Visual Learners via Empowering Pre-trained vision Transformers with Cross-Modal Adaptation
Towards Efficient Audio-Visual Learners via Empowering Pre-t...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Wang, Kai Tian, Yapeng Hatzinakos, Dimitrios Univ Toronto Toronto ON Canada Univ Texas Dallas Richardson TX 75083 USA
In this paper, we explore the cross-modal adaptation of pre-trained vision Transformers (ViTs) for the audio-visual domain by incorporating a limited set of trainable parameters. To this end, we propose a Spatial-Temp... 详细信息
来源: 评论
A Theory of Joint Light and Heat Transport for Lambertian Scenes
A Theory of Joint Light and Heat Transport for Lambertian Sc...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Ramanagopal, Mani Narayanan, Sriram Sankaranarayanan, Aswin C. Narasimhan, Srinivasa G. Carnegie Mellon Univ Pittsburgh PA 15213 USA
We present a novel theory that establishes the relationship between light transport in visible and thermal infrared, and heat transport in solids. We show that heat generated due to light absorption can be estimated b... 详细信息
来源: 评论
Dense vision Transformer Compression with Few Samples
Dense Vision Transformer Compression with Few Samples
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Zhang, Hanxiao Zhou, Yifan Wang, Guo-Hua Nanjing Univ Natl Key Lab Novel Software Technol Nanjing Peoples R China Nanjing Univ Sch Artificial Intelligence Nanjing Peoples R China
Few-shot model compression aims to compress a large model into a more compact one with only a tiny training set (even without labels). Block-level pruning has recently emerged as a leading technique in achieving high ... 详细信息
来源: 评论
Adaptive Hyper-graph Aggregation for Modality-Agnostic Federated Learning
Adaptive Hyper-graph Aggregation for Modality-Agnostic Feder...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Qi, Fan Li, Shuai Tianjin Univ Technol Tianjin Peoples R China
In Federated Learning (FL), the issue of statistical data heterogeneity has been a significant challenge to the field's ongoing development. This problem is further exacerbated when clients' data vary in modal... 详细信息
来源: 评论
Low-Rank Few-Shot Adaptation of vision-Language Models
Low-Rank Few-Shot Adaptation of Vision-Language Models
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Zanella, Maxime Ben Ayed, Ismail UCLouvain Louvain Belgium UMons Mons Belgium ETS Montreal Montreal PQ Canada
Recent progress in the few-shot adaptation of vision-Language Models (VLMs) has further pushed their generalization capabilities, at the expense of just a few labeled samples within the target downstream task. However... 详细信息
来源: 评论
PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs
PIN: Positional Insert Unlocks Object Localisation Abilities...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Dorkenwald, Michael Barazani, Nimrod Snoek, Cees G. M. Asano, Yuki M. Univ Amsterdam Amsterdam Netherlands
vision-Language Models (VLMs), such as Flamingo and GPT-4V, have shown immense potential by integrating large language models with vision systems. Nevertheless, these models face challenges in the fundamental computer... 详细信息
来源: 评论
FPN-IAIA-BL: A Multi-Scale Interpretable Deep Learning Model for Classification of Mass Margins in Digital Mammography
FPN-IAIA-BL: A Multi-Scale Interpretable Deep Learning Model...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Yang, Julia Barnett, Alina Jade Donnelly, Jon Kishore, Satvik Fang, Jerry Schwartz, Fides Regina Chen, Chaofan Lo, Joseph Y. Rudin, Cynthia Duke Univ Durham NC 27708 USA Brigham & Womens Hosp 75 Francis St Boston MA 02115 USA Univ Maine Orono ME USA
Digital mammography is essential to breast cancer detection, and deep learning offers promising tools for faster and more accurate mammogram analysis. In radiology and other high-stakes environments, uninterpretable (... 详细信息
来源: 评论
Iterated Learning Improves Compositionality in Large vision-Language Models
Iterated Learning Improves Compositionality in Large Vision-...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Zheng, Chenhao Zhang, Jieyu Kembhavi, Aniruddha Krishna, Ranjay Univ Washington Seattle WA 98195 USA Univ Michigan Ann Arbor MI 48109 USA Allen Inst Artificial Intelligence Seattle WA USA
A fundamental characteristic common to both human vision and natural language is their compositional nature. Yet, despite the performance gains contributed by large vision and language pretraining, recent investigatio...
来源: 评论
ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain vision Transformers
ALGM: Adaptive Local-then-Global Token Merging for Efficient...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Norouzi, Narges Orlova, Svetlana de Geus, Daan Dubbelman, Gijs Eindhoven Univ Technol Eindhoven Netherlands
This work presents Adaptive Local-then-Global Merging (ALGM), a token reduction method for semantic segmentation networks that use plain vision Transformers. ALGM merges tokens in two stages: (1) In the first network ... 详细信息
来源: 评论
Streaming Dense Video Captioning
Streaming Dense Video Captioning
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Zhou, Xingyi Arnab, Anurag Buch, Shyamal Yan, Shen Myers, Austin Xiong, Xuehan Nagrani, Arsha Schmid, Cordelia Google Mountain View CA 94043 USA
An ideal model for dense video captioning - predicting captions localized temporally in a video - should be able to handle long input videos, predict rich, detailed textual descriptions, and be able to produce outputs... 详细信息
来源: 评论