咨询与建议

限定检索结果

文献类型

  • 20,798 篇 会议
  • 88 篇 期刊文献
  • 5 册 图书

馆藏范围

  • 20,890 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 13,275 篇 工学
    • 10,923 篇 计算机科学与技术...
    • 2,484 篇 机械工程
    • 2,307 篇 软件工程
    • 913 篇 光学工程
    • 771 篇 电气工程
    • 556 篇 控制科学与工程
    • 405 篇 信息与通信工程
    • 210 篇 测绘科学与技术
    • 131 篇 生物医学工程(可授...
    • 104 篇 电子科学与技术(可...
    • 100 篇 生物工程
    • 92 篇 仪器科学与技术
    • 56 篇 化学工程与技术
    • 52 篇 建筑学
    • 48 篇 土木工程
    • 44 篇 安全科学与工程
    • 38 篇 力学(可授工学、理...
    • 38 篇 航空宇航科学与技...
    • 35 篇 交通运输工程
  • 3,457 篇 医学
    • 3,449 篇 临床医学
    • 34 篇 基础医学(可授医学...
  • 2,315 篇 理学
    • 1,154 篇 数学
    • 1,132 篇 物理学
    • 417 篇 统计学(可授理学、...
    • 386 篇 生物学
    • 252 篇 系统科学
    • 57 篇 化学
  • 353 篇 管理学
    • 184 篇 图书情报与档案管...
    • 176 篇 管理科学与工程(可...
    • 32 篇 工商管理
  • 28 篇 法学
  • 20 篇 农学
  • 15 篇 教育学
  • 9 篇 经济学
  • 8 篇 艺术学
  • 5 篇 文学
  • 5 篇 军事学

主题

  • 8,203 篇 computer vision
  • 2,966 篇 pattern recognit...
  • 2,732 篇 training
  • 1,769 篇 computational mo...
  • 1,653 篇 visualization
  • 1,483 篇 cameras
  • 1,415 篇 shape
  • 1,369 篇 three-dimensiona...
  • 1,369 篇 face recognition
  • 1,285 篇 image segmentati...
  • 1,272 篇 feature extracti...
  • 1,178 篇 robustness
  • 1,090 篇 semantics
  • 1,040 篇 layout
  • 1,007 篇 object detection
  • 975 篇 object recogniti...
  • 969 篇 computer science
  • 948 篇 computer archite...
  • 946 篇 benchmark testin...
  • 931 篇 codes

机构

  • 174 篇 univ sci & techn...
  • 154 篇 carnegie mellon ...
  • 148 篇 univ chinese aca...
  • 144 篇 chinese univ hon...
  • 112 篇 microsoft resear...
  • 103 篇 zhejiang univ pe...
  • 99 篇 swiss fed inst t...
  • 97 篇 tsinghua univ pe...
  • 93 篇 tsinghua univers...
  • 91 篇 microsoft res as...
  • 88 篇 shanghai ai lab ...
  • 81 篇 zhejiang univers...
  • 76 篇 alibaba grp peop...
  • 74 篇 hong kong univ s...
  • 73 篇 university of sc...
  • 72 篇 peking univ peop...
  • 69 篇 university of ch...
  • 68 篇 shanghai jiao to...
  • 66 篇 google res mount...
  • 66 篇 univ oxford oxfo...

作者

  • 80 篇 van gool luc
  • 71 篇 zhang lei
  • 59 篇 timofte radu
  • 48 篇 yang yi
  • 46 篇 xiaoou tang
  • 44 篇 darrell trevor
  • 43 篇 tian qi
  • 43 篇 luc van gool
  • 42 篇 loy chen change
  • 42 篇 sun jian
  • 42 篇 li fei-fei
  • 40 篇 li stan z.
  • 40 篇 qi tian
  • 37 篇 liu yang
  • 37 篇 chen xilin
  • 36 篇 shan shiguang
  • 35 篇 liu xiaoming
  • 35 篇 vasconcelos nuno
  • 35 篇 torralba antonio
  • 32 篇 zhou jie

语言

  • 20,868 篇 英文
  • 14 篇 中文
  • 6 篇 其他
  • 2 篇 日文
  • 2 篇 土耳其文
检索条件"任意字段=2009 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009"
20891 条 记 录,以下是51-60 订阅
排序:
Learning Group Activity Features Through Person Attribute Prediction
Learning Group Activity Features Through Person Attribute Pr...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Nakatani, Chihiro Kawashima, Hiroaki Ukita, Norimichi Toyota Technol Inst Toyota Japan Univ Hyogo Himeji Hyogo Japan
This paper proposes Group Activity Feature (GAF) learning in which features of multi-person activity are learned as a compact latent vector. Unlike prior work in which the manual annotation of group activities is requ... 详细信息
来源: 评论
Token Transformation Matters: Towards Faithful Post-hoc Explanation for vision Transformer
Token Transformation Matters: Towards Faithful Post-hoc Expl...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Wu, Junyi Duan, Bin Kang, Weitai Tang, Hao Yan, Yan IIT Dept Comp Sci Chicago IL 60616 USA Carnegie Mellon Univ Robot Inst Pittsburgh PA 15213 USA
While Transformers have rapidly gained popularity in various computer vision applications, post-hoc explanations of their internal mechanisms remain largely unexplored. vision Transformers extract visual information b... 详细信息
来源: 评论
RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
RoDLA: Benchmarking the Robustness of Document Layout Analys...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Chen, Yufan Zhang, Jiaming Peng, Kunyu Zheng, Junwei Liu, Ruiping Torre, Philip Stiefelhagen, Rainer Karlsruhe Inst Technol Karlsruhe Germany Univ Oxford Oxford England
Before developing a Document Layout Analysis (DLA) model in real-world applications, conducting comprehensive robustness testing is essential. However, the robustness of DLA models remains underexplored in the literat... 详细信息
来源: 评论
Object recognition as Next Token Prediction
Object Recognition as Next Token Prediction
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Yue, Kaiyu Chen, Bor-Chun Geiping, Jonas Li, Hengduo Goldstein, Tom Lim, Ser-Nam Meta Menlo Pk CA 94025 USA Univ Maryland College Pk MD 20742 USA ELLIS Inst Tubingen Germany MPI IS Tubingen Tubingen Germany Univ Cent Florida Orlando FL 32816 USA Meta AI Menlo Pk CA USA
We present an approach to pose object recognition as next token prediction. The idea is to apply a language decoder that auto-regressively predicts the text tokens from image embeddings to form labels. To ground this ... 详细信息
来源: 评论
vision-and-Language Navigation via Causal Learning
Vision-and-Language Navigation via Causal Learning
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Wang, Liuyi He, Zongtao Dang, Ronghao Shen, Mengjiao Liu, Chengju Chen, Qijun Tongji Univ Sch Elect & Informat Engn Shanghai Peoples R China
In the pursuit of robust and generalizable environment perception and language understanding, the ubiquitous challenge of dataset bias continues to plague vision-and-language navigation (VLN) agents, hindering their p... 详细信息
来源: 评论
Making vision Transformers Truly Shift-Equivariant
Making Vision Transformers Truly Shift-Equivariant
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Rojas-Gomez, Renan A. Lim, Teck-Yian Do, Minh N. Yeh, Raymond A. UIUC Dept Elect Engn Urbana IL 61801 USA UIUC VinUni Illinois Smart Hlth Ctr Urbana IL USA Purdue Univ Dept Comp Sci W Lafayette IN 47907 USA
In the field of computer vision, vision Transformers (ViTs) have emerged as a prominent deep learning architecture. Despite being inspired by Convolutional Neural Networks (CNNs), ViTs are susceptible to small spatial... 详细信息
来源: 评论
Hybrid Functional Maps for Crease-Aware Non-Isometric Shape Matching
Hybrid Functional Maps for Crease-Aware Non-Isometric Shape ...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Bastian, Lennart Xie, Yizheng Navab, Nassir Laehner, Zorah Tech Univ Munich Munich Germany Univ Siegen Siegen Germany Univ Bonn Bonn Germany Lamarr Inst Bonn Germany
Non-isometric shape correspondence remains a fundamental challenge in computer vision. Traditional methods using Laplace-Beltrami operator (LBO) eigenmodes face limitations in characterizing high-frequency extrinsic s... 详细信息
来源: 评论
Classes Are Not Equal: An Empirical Study on Image recognition Fairness
Classes Are Not Equal: An Empirical Study on Image Recogniti...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Cui, Jiequan Zhu, Beier Wen, Xin Qi, Xiaojuan Yu, Bei Zhang, Hanwang Nanyang Technol Univ Singapore Singapore Univ Hong Kong Hong Kong Peoples R China Chinese Univ Hong Kong Hong Kong Peoples R China
In this paper, we present an empirical study on image unfairness, i.e., extreme class accuracy disparity on balanced data like ImageNet. We demonstrate that are not equal and unfairness is prevalent for image classifi... 详细信息
来源: 评论
Accurate Training Data for Occupancy Map Prediction in Automated Driving Using Evidence Theory
Accurate Training Data for Occupancy Map Prediction in Autom...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Kaelble, Jonas Wirges, Sascha Tatarchenko, Maxim Ilg, Eddy Bosch Ctr Artificial Intelligence Renningen Germany Saarland Univ Saarbrucken Germany
Automated driving fundamentally requires knowledge about the surrounding geometry of the scene. Modern approaches use only captured images to predict occupancy maps that represent the geometry. Training these approach... 详细信息
来源: 评论
Honeybee: Locality-enhanced Projector for Multimodal LLM
Honeybee: Locality-enhanced Projector for Multimodal LLM
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Cha, Junbum Kang, Wooyoung Mun, Jonghwan Roh, Byungseok Kakao Brain Seongnam South Korea
In Multimodal Large Language Models (MLLMs), a visual projector plays a crucial role in bridging pre-trained vision encoders with LLMs, enabling profound visual understanding while harnessing the LLMs' robust capa... 详细信息
来源: 评论