咨询与建议

限定检索结果

文献类型

  • 20,798 篇 会议
  • 88 篇 期刊文献
  • 65 册 图书

馆藏范围

  • 20,950 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 13,275 篇 工学
    • 10,923 篇 计算机科学与技术...
    • 2,484 篇 机械工程
    • 2,307 篇 软件工程
    • 913 篇 光学工程
    • 771 篇 电气工程
    • 556 篇 控制科学与工程
    • 405 篇 信息与通信工程
    • 210 篇 测绘科学与技术
    • 131 篇 生物医学工程(可授...
    • 104 篇 电子科学与技术(可...
    • 100 篇 生物工程
    • 92 篇 仪器科学与技术
    • 56 篇 化学工程与技术
    • 52 篇 建筑学
    • 48 篇 土木工程
    • 44 篇 安全科学与工程
    • 38 篇 力学(可授工学、理...
    • 38 篇 航空宇航科学与技...
    • 35 篇 交通运输工程
  • 3,457 篇 医学
    • 3,449 篇 临床医学
    • 34 篇 基础医学(可授医学...
  • 2,315 篇 理学
    • 1,154 篇 数学
    • 1,132 篇 物理学
    • 417 篇 统计学(可授理学、...
    • 386 篇 生物学
    • 252 篇 系统科学
    • 57 篇 化学
  • 353 篇 管理学
    • 184 篇 图书情报与档案管...
    • 176 篇 管理科学与工程(可...
    • 32 篇 工商管理
  • 28 篇 法学
  • 20 篇 农学
  • 15 篇 教育学
  • 9 篇 经济学
  • 8 篇 艺术学
  • 5 篇 文学
  • 5 篇 军事学

主题

  • 8,203 篇 computer vision
  • 3,010 篇 pattern recognit...
  • 2,732 篇 training
  • 1,769 篇 computational mo...
  • 1,657 篇 visualization
  • 1,483 篇 cameras
  • 1,415 篇 shape
  • 1,369 篇 three-dimensiona...
  • 1,369 篇 face recognition
  • 1,285 篇 image segmentati...
  • 1,272 篇 feature extracti...
  • 1,178 篇 robustness
  • 1,090 篇 semantics
  • 1,040 篇 layout
  • 1,007 篇 object detection
  • 975 篇 object recogniti...
  • 969 篇 computer science
  • 946 篇 computer archite...
  • 946 篇 benchmark testin...
  • 931 篇 codes

机构

  • 174 篇 univ sci & techn...
  • 154 篇 carnegie mellon ...
  • 148 篇 univ chinese aca...
  • 144 篇 chinese univ hon...
  • 113 篇 microsoft resear...
  • 103 篇 zhejiang univ pe...
  • 99 篇 swiss fed inst t...
  • 97 篇 tsinghua univ pe...
  • 93 篇 tsinghua univers...
  • 91 篇 microsoft res as...
  • 88 篇 shanghai ai lab ...
  • 81 篇 zhejiang univers...
  • 76 篇 alibaba grp peop...
  • 74 篇 hong kong univ s...
  • 73 篇 university of sc...
  • 72 篇 peking univ peop...
  • 69 篇 university of ch...
  • 68 篇 shanghai jiao to...
  • 66 篇 google res mount...
  • 66 篇 univ oxford oxfo...

作者

  • 80 篇 van gool luc
  • 71 篇 zhang lei
  • 59 篇 timofte radu
  • 48 篇 yang yi
  • 47 篇 xiaoou tang
  • 44 篇 darrell trevor
  • 43 篇 tian qi
  • 43 篇 luc van gool
  • 42 篇 loy chen change
  • 42 篇 sun jian
  • 42 篇 li fei-fei
  • 40 篇 qi tian
  • 39 篇 li stan z.
  • 37 篇 liu yang
  • 37 篇 chen xilin
  • 36 篇 shan shiguang
  • 35 篇 liu xiaoming
  • 35 篇 vasconcelos nuno
  • 35 篇 torralba antonio
  • 32 篇 zhou jie

语言

  • 20,928 篇 英文
  • 14 篇 中文
  • 6 篇 其他
  • 2 篇 日文
  • 2 篇 土耳其文
检索条件"任意字段=2009 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009"
20951 条 记 录,以下是331-340 订阅
排序:
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image recognition
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Ding, Xiaohan Zhang, Yiyuan Ge, Yixiao Zhao, Sijie Song, Lin Yue, Xiangyu Shan, Ying Tencent AI Lab Shenzhen Peoples R China Chinese Univ Hong Kong Hong Kong Peoples R China
Large-kernel convolutional neural networks (ConvNets) have recently received extensive research attention, but two unresolved and critical issues demand further investigation. 1) The architectures of existing large-ke... 详细信息
来源: 评论
Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs
Eyes Wide Shut? Exploring the Visual Shortcomings of Multimo...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Tong, Shengbang Liu, Zhuang Zhai, Yuexiang Ma, Yi Lecun, Yann Xie, Saining NYU New York NY 10003 USA Meta FAIR Menlo Pk CA 94025 USA Univ Calif Berkeley Berkeley CA USA
Is vision good enough for language? Recent advancements in multimodal models primarily stem from the powerful reasoning abilities of large language models (LLMs). However, the visual component typically depends only o... 详细信息
来源: 评论
Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine vision-Language Model
Jack of All Tasks, Master of Many: Designing General-purpose...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Pramanick, Shraman Han, Guangxing Hou, Rui Nag, Sayan Lim, Ser-Nam Ballas, Nicolas Wang, Qifan Chellappa, Rama Almahairi, Amjad Johns Hopkins Univ Baltimore MD 21218 USA Meta New York NY 10003 USA Univ Toronto Toronto ON Canada Univ Cent Florida Orlando FL 32816 USA
The ability of large language models (LLMs) to process visual inputs has given rise to general-purpose vision systems, unifying various vision-language (VL) tasks by instruction tuning. However, due to the enormous di...
来源: 评论
Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf vision-Language Models
Emergent Open-Vocabulary Semantic Segmentation from Off-the-...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Luo, Jiayun Khandelwal, Siddhesh Sigal, Leonid Li, Boyang Nanyang Technol Univ Singapore Singapore Univ British Columbia Vector Inst AI Vancouver BC Canada
From image-text pairs, large-scale vision-language models (VLMs) learn to implicitly associate image regions with words, which prove effective for tasks like visual question answering. However, leveraging the learned ... 详细信息
来源: 评论
MULTIFLOW: Shifting Towards Task-Agnostic vision-Language Pruning
MULTIFLOW: Shifting Towards Task-Agnostic Vision-Language Pr...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Farina, Matteo Mancini, Massimiliano Cunegatti, Elia Liu, Gaowen Iacca, Giovanni Ricci, Elisa Univ Trento Trento Italy Cisco Res Res Triangle Pk NC USA Fdn Bruno Kessler Povo Italy
While excellent in transfer learning, vision-Language models (VLMs) come with high computational costs due to their large number of parameters. To address this issue, removing parameters via model pruning is a viable ... 详细信息
来源: 评论
RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding
RegionPLC: Regional Point-Language Contrastive Learning for ...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Yang, Jihan Ding, Runyu Deng, Weipeng Wang, Zhe Qi, Xiaojuan Univ Hong Kong Hong Kong Peoples R China SenseTime Res Hong Kong Peoples R China
We propose a lightweight and scalable Regional Point-Language Contrastive learning framework, namely RegionPLC, for open-world 3D scene understanding, aiming to identify and recognize open-set objects and categories. ... 详细信息
来源: 评论
Unravelling Robustness of Deep Face recognition Networks Against Illicit Drug Abuse Images
Unravelling Robustness of Deep Face Recognition Networks Aga...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Dhake, Hruturaj Agarwal, Akshay IISER Bhopal Data Sci & Engn Bhopal India
Alteration in facial features can lead to a significant drop in recognition performance. These alterations can be due to several factors: one such prominent and less explored factor is illicit drug abuse. To advance t...
来源: 评论
Equivariant Multi-Modality Image Fusion
Equivariant Multi-Modality Image Fusion
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Zhao, Zixiang Hai, Haowen Zhang, Jiangshe Zhang, Yulun Zhane, Kai Xu, Shuang Chen, Dongdong Timofte, Radu Van Gool, Luc Xi An Jiao Tong Univ Xian Peoples R China Swiss Fed Inst Technol Zurich Switzerland Shanghai Jiao Tong Univ Shanghai Peoples R China Nanjing Univ Nanjing Peoples R China Northwestern Polytech Univ Xian Peoples R China Heriot Watt Univ Edinburgh Midlothian Scotland Univ Wurzburg Wurzburg Germany INSAIT Sofia Bulgaria
Multi-modality image fusion is a technique that combines information from different sensors or modalities, enabling the fused image to retain complementary features from each modality, such as functional highlights an... 详细信息
来源: 评论
Sharingan: A Transformer Architecture for Multi-Person Gaze Following
Sharingan: A Transformer Architecture for Multi-Person Gaze ...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Tafasca, Samy Gupta, Anshul Odobez, Jean-Marc Idiap Res Inst Martigny Switzerland Ecole Polytech Fed Lausanne Lausanne Switzerland
Gaze is a powerful form of non-verbal communication that humans develop from an early age. As such, modeling this behavior is an important task that can benefit a broad set of application domains ranging from robotics... 详细信息
来源: 评论
EgoThink: Evaluating First-Person Perspective Thinking Capability of vision-Language Models
EgoThink: Evaluating First-Person Perspective Thinking Capab...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Cheng, Sijie Guo, Zhicheng Wu, Jingwen Fang, Kechen Li, Peng Liu, Huaping Liu, Yang Tsinghua Univ Dept Comp Sci & Technol Beijing Peoples R China Tsinghua Univ Inst AI Ind Res AIR Beijing Peoples R China Univ Toronto Dept Elect & Comp Engn Toronto ON Canada Tsinghua Univ Zhili Coll Beijing Peoples R China 01 Ai Beijing Peoples R China
vision-language models (VLMs) have recently shown promising results in traditional downstream tasks. Evaluation studies have emerged to assess their abilities, with the majority focusing on the third-person perspectiv... 详细信息
来源: 评论