咨询与建议

限定检索结果

文献类型

  • 11,883 篇 会议
  • 5 篇 期刊文献

馆藏范围

  • 11,888 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 8,055 篇 工学
    • 7,613 篇 计算机科学与技术...
    • 796 篇 机械工程
    • 688 篇 电气工程
    • 356 篇 软件工程
    • 225 篇 控制科学与工程
    • 40 篇 光学工程
    • 19 篇 生物工程
    • 17 篇 信息与通信工程
    • 12 篇 生物医学工程(可授...
    • 6 篇 电子科学与技术(可...
    • 6 篇 建筑学
    • 6 篇 交通运输工程
    • 5 篇 仪器科学与技术
    • 5 篇 化学工程与技术
    • 5 篇 安全科学与工程
    • 4 篇 土木工程
  • 3,344 篇 医学
    • 3,343 篇 临床医学
    • 4 篇 基础医学(可授医学...
    • 4 篇 公共卫生与预防医...
  • 250 篇 理学
    • 198 篇 系统科学
    • 29 篇 物理学
    • 21 篇 生物学
    • 15 篇 数学
    • 9 篇 统计学(可授理学、...
    • 4 篇 化学
  • 17 篇 管理学
    • 12 篇 管理科学与工程(可...
    • 7 篇 图书情报与档案管...
    • 5 篇 工商管理
  • 3 篇 法学
    • 3 篇 社会学
  • 3 篇 教育学
    • 3 篇 教育学
  • 2 篇 农学
  • 1 篇 经济学
  • 1 篇 军事学

主题

  • 5,632 篇 computer vision
  • 2,668 篇 training
  • 2,203 篇 pattern recognit...
  • 1,746 篇 computational mo...
  • 1,502 篇 visualization
  • 1,360 篇 three-dimensiona...
  • 1,074 篇 semantics
  • 999 篇 benchmark testin...
  • 986 篇 codes
  • 959 篇 computer archite...
  • 891 篇 deep learning
  • 777 篇 conferences
  • 754 篇 task analysis
  • 699 篇 feature extracti...
  • 561 篇 transformers
  • 533 篇 face recognition
  • 527 篇 neural networks
  • 495 篇 object detection
  • 490 篇 image segmentati...
  • 468 篇 cameras

机构

  • 174 篇 univ sci & techn...
  • 145 篇 carnegie mellon ...
  • 144 篇 univ chinese aca...
  • 144 篇 tsinghua univ pe...
  • 134 篇 chinese univ hon...
  • 110 篇 zhejiang univ pe...
  • 109 篇 peng cheng lab p...
  • 99 篇 swiss fed inst t...
  • 91 篇 tsinghua univers...
  • 90 篇 shanghai ai lab ...
  • 87 篇 sensetime res pe...
  • 86 篇 shanghai jiao to...
  • 83 篇 zhejiang univers...
  • 82 篇 tech univ munich...
  • 79 篇 university of sc...
  • 79 篇 stanford univ st...
  • 78 篇 univ hong kong p...
  • 77 篇 australian natl ...
  • 76 篇 alibaba grp peop...
  • 75 篇 peng cheng labor...

作者

  • 75 篇 timofte radu
  • 64 篇 van gool luc
  • 50 篇 zhang lei
  • 43 篇 yang yi
  • 37 篇 loy chen change
  • 36 篇 tao dacheng
  • 32 篇 zhou jie
  • 31 篇 chen chen
  • 30 篇 liu yang
  • 30 篇 tian qi
  • 29 篇 sun jian
  • 29 篇 zha zheng-jun
  • 28 篇 li xin
  • 27 篇 qi tian
  • 26 篇 vasconcelos nuno
  • 25 篇 liu xiaoming
  • 25 篇 darrell trevor
  • 24 篇 zheng wei-shi
  • 24 篇 luo ping
  • 24 篇 ying shan

语言

  • 11,862 篇 英文
  • 25 篇 其他
  • 1 篇 中文
检索条件"任意字段=2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024"
11888 条 记 录,以下是61-70 订阅
排序:
Honeybee: Locality-enhanced Projector for Multimodal LLM
Honeybee: Locality-enhanced Projector for Multimodal LLM
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Cha, Junbum Kang, Wooyoung Mun, Jonghwan Roh, Byungseok Kakao Brain Seongnam South Korea
In Multimodal Large Language Models (MLLMs), a visual projector plays a crucial role in bridging pre-trained vision encoders with LLMs, enabling profound visual understanding while harnessing the LLMs' robust capa... 详细信息
来源: 评论
Making vision Transformers Truly Shift-Equivariant
Making Vision Transformers Truly Shift-Equivariant
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Rojas-Gomez, Renan A. Lim, Teck-Yian Do, Minh N. Yeh, Raymond A. UIUC Dept Elect Engn Urbana IL 61801 USA UIUC VinUni Illinois Smart Hlth Ctr Urbana IL USA Purdue Univ Dept Comp Sci W Lafayette IN 47907 USA
In the field of computer vision, vision Transformers (ViTs) have emerged as a prominent deep learning architecture. Despite being inspired by Convolutional Neural Networks (CNNs), ViTs are susceptible to small spatial... 详细信息
来源: 评论
SpikingResformer: Bridging ResNet and vision Transformer in Spiking Neural Networks
SpikingResformer: Bridging ResNet and Vision Transformer in ...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Shi, Xinyu Hao, Zecheng Yu, Zhaofei Peking Univ Inst Artificial Intelligence Beijing Peoples R China Peking Univ Sch Comp Sci Beijing Peoples R China
The remarkable success of vision Transformers in Artificial Neural Networks (ANNs) has led to a growing interest in incorporating the self-attention mechanism and transformer-based architecture into Spiking Neural Net... 详细信息
来源: 评论
Intrinsic Image Diffusion for Indoor Single-view Material Estimation
Intrinsic Image Diffusion for Indoor Single-view Material Es...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Kocsis, Peter Sitzmann, Vincent Niessner, Matthias Tech Univ Munich Munich Germany MIT EECS Cambridge MA 02139 USA
We present Intrinsic Image Diffusion, a generative model for appearance decomposition of indoor scenes. Given a single input view, we sample multiple possible material explanations represented as albedo, roughness, an... 详细信息
来源: 评论
Bi-Causal: Group Activity recognition via Bidirectional Causality
Bi-Causal: Group Activity Recognition via Bidirectional Caus...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Zhang, Youliang Liu, Wenxuan Xu, Danni Zhou, Zhuo Wang, Zheng Wuhan Univ Natl Engn Res Ctr Multimedia Software Sch Comp Sci Inst Artificial Intelligence Wuhan Hubei Peoples R China Hubei Key Lab Multimedia & Network Commun Engn Wuhan Hubei Peoples R China Wuhan Univ Technol Wuhan Hubei Peoples R China Natl Univ Singapore Singapore Singapore
Current approaches in Group Activity recognition (GAR) predominantly emphasize Human Relations (HRs) while often neglecting the impact of Human-Object Interactions (HOIs). This study prioritizes the consideration of b... 详细信息
来源: 评论
Hybrid Functional Maps for Crease-Aware Non-Isometric Shape Matching
Hybrid Functional Maps for Crease-Aware Non-Isometric Shape ...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Bastian, Lennart Xie, Yizheng Navab, Nassir Laehner, Zorah Tech Univ Munich Munich Germany Univ Siegen Siegen Germany Univ Bonn Bonn Germany Lamarr Inst Bonn Germany
Non-isometric shape correspondence remains a fundamental challenge in computer vision. Traditional methods using Laplace-Beltrami operator (LBO) eigenmodes face limitations in characterizing high-frequency extrinsic s... 详细信息
来源: 评论
On the Robustness of Language Guidance for Low-Level vision Tasks: Findings from Depth Estimation
On the Robustness of Language Guidance for Low-Level Vision ...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Chatterjee, Agneet Gokhale, Tejas Baral, Chitta Yang, Yezhou Arizona State Univ Tempe AZ 85281 USA Univ Maryland Baltimore Cty Baltimore MD 21228 USA
Recent advances in monocular depth estimation have been made by incorporating natural language as additional guidance. Although yielding impressive results, the impact of the language prior, particularly in terms of g... 详细信息
来源: 评论
Question Aware vision Transformer for Multimodal Reasoning
Question Aware Vision Transformer for Multimodal Reasoning
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Ganz, Roy Kittenplont, Yair Aberdam, Aviad Ben Avraham, Elad Nuriel, Oren Mazor, Shai Litmant, Ron Technion Haifa Israel AWS AI Labs Seattle WA 98019 USA
vision-Language (VL) models have gained significant research focus, enabling remarkable advances in multimodal reasoning. These architectures typically comprise a vision encoder, a Large Language Model (LLM), and a pr...
来源: 评论
PointInfinity: Resolution-Invariant Point Diffusion Models
PointInfinity: Resolution-Invariant Point Diffusion Models
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Huang, Zixuan Johnson, Justin Debnath, Shoubhik Rehg, James M. Wu, Chao-Yuan Meta FAIR Menlo Pk CA 94025 USA Univ Illinois Champaign IL 61820 USA
We present PointInfinity, an efficient family of point cloud diffusion models. Our core idea is to use a transformer-based architecture with a fixed-size, resolution-invariant latent representation. This enables effic... 详细信息
来源: 评论
Can Biases in ImageNet Models Explain Generalization?
Can Biases in ImageNet Models Explain Generalization?
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Gavrikov, Paul Keuper, Janis Offenburg Univ IMLA Offenburg Germany Univ Mannheim Mannheim Germany
The robust generalization of models to rare, in-distribution (ID) samples drawn from the long tail of the training distribution and to out-of-training-distribution (OOD) samples is one of the major challenges of curre... 详细信息
来源: 评论