咨询与建议

限定检索结果

文献类型

  • 23,135 篇 会议
  • 91 篇 期刊文献
  • 15 册 图书

馆藏范围

  • 23,240 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 13,631 篇 工学
    • 11,162 篇 计算机科学与技术...
    • 3,337 篇 软件工程
    • 2,414 篇 机械工程
    • 1,662 篇 光学工程
    • 1,204 篇 电气工程
    • 973 篇 控制科学与工程
    • 738 篇 信息与通信工程
    • 381 篇 仪器科学与技术
    • 322 篇 生物工程
    • 239 篇 生物医学工程(可授...
    • 188 篇 电子科学与技术(可...
    • 109 篇 化学工程与技术
    • 104 篇 安全科学与工程
    • 99 篇 测绘科学与技术
    • 85 篇 建筑学
    • 82 篇 土木工程
    • 82 篇 交通运输工程
    • 56 篇 力学(可授工学、理...
  • 3,696 篇 医学
    • 3,684 篇 临床医学
    • 76 篇 基础医学(可授医学...
  • 3,137 篇 理学
    • 1,880 篇 物理学
    • 1,604 篇 数学
    • 547 篇 统计学(可授理学、...
    • 466 篇 生物学
    • 243 篇 系统科学
    • 107 篇 化学
  • 491 篇 管理学
    • 290 篇 图书情报与档案管...
    • 212 篇 管理科学与工程(可...
    • 74 篇 工商管理
  • 252 篇 艺术学
    • 251 篇 设计学(可授艺术学...
  • 58 篇 法学
  • 38 篇 农学
  • 25 篇 教育学
  • 19 篇 经济学
  • 10 篇 军事学
  • 3 篇 文学

主题

  • 10,396 篇 computer vision
  • 3,893 篇 pattern recognit...
  • 3,101 篇 training
  • 2,104 篇 computational mo...
  • 1,898 篇 visualization
  • 1,800 篇 cameras
  • 1,487 篇 feature extracti...
  • 1,475 篇 three-dimensiona...
  • 1,464 篇 shape
  • 1,447 篇 image segmentati...
  • 1,287 篇 robustness
  • 1,234 篇 computer archite...
  • 1,213 篇 semantics
  • 1,112 篇 benchmark testin...
  • 1,111 篇 conferences
  • 1,104 篇 layout
  • 1,093 篇 object detection
  • 1,085 篇 computer science
  • 1,026 篇 codes
  • 907 篇 face recognition

机构

  • 137 篇 univ sci & techn...
  • 124 篇 univ chinese aca...
  • 121 篇 chinese univ hon...
  • 108 篇 tsinghua univers...
  • 108 篇 carnegie mellon ...
  • 105 篇 microsoft resear...
  • 97 篇 zhejiang univ pe...
  • 91 篇 swiss fed inst t...
  • 85 篇 university of sc...
  • 84 篇 zhejiang univers...
  • 81 篇 shanghai ai lab ...
  • 79 篇 university of ch...
  • 75 篇 shanghai jiao to...
  • 69 篇 microsoft res as...
  • 68 篇 alibaba grp peop...
  • 66 篇 adobe research
  • 65 篇 national laborat...
  • 64 篇 peking univ peop...
  • 61 篇 univ oxford oxfo...
  • 59 篇 peng cheng labor...

作者

  • 80 篇 van gool luc
  • 71 篇 timofte radu
  • 65 篇 zhang lei
  • 43 篇 luc van gool
  • 40 篇 yang yi
  • 37 篇 loy chen change
  • 34 篇 li stan z.
  • 33 篇 liu yang
  • 33 篇 xiaoou tang
  • 33 篇 murino vittorio
  • 33 篇 chen chen
  • 33 篇 qi tian
  • 33 篇 li fei-fei
  • 32 篇 tian qi
  • 32 篇 sun jian
  • 30 篇 ying shan
  • 30 篇 pascal fua
  • 29 篇 darrell trevor
  • 28 篇 li xin
  • 28 篇 hanqing lu

语言

  • 23,162 篇 英文
  • 52 篇 其他
  • 20 篇 中文
  • 5 篇 土耳其文
  • 2 篇 日文
检索条件"任意字段=IEEE/CVF Conference on Computer Vision and Pattern Recognition"
23241 条 记 录,以下是361-370 订阅
排序:
Volumetric Environment Representation for vision-Language Navigation
Volumetric Environment Representation for Vision-Language Na...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Liu, Rui Wang, Wenguan Yan, Yi Zhejiang Univ CCAI ReLER Hangzhou Zhejiang Peoples R China
vision-language navigation (VLN) requires an agent to navigate through an 3D environment based on visual observations and natural language instructions. It is clear that the pivotal factor for successful navigation li... 详细信息
来源: 评论
Visual Program Distillation: Distilling Tools and Programmatic Reasoning into vision-Language Models
Visual Program Distillation: Distilling Tools and Programmat...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Hu, Yushi Stretcu, Otilia Lu, Chun-Ta Viswanathan, Krishnamurthy Hata, Kenji Luo, Enming Krishna, Ranjay Fuxman, Ariel Google Res Mountain View CA 94043 USA Univ Washington Seattle WA 98195 USA
Solving complex visual tasks such as "Who invented the musical instrument on the right?" involves a composition of skills: understanding space, recognizing instruments, and also retrieving prior knowledge. R... 详细信息
来源: 评论
Co-designing a Sub-millisecond Latency Event-based Eye Tracking System with Submanifold Sparse CNN
Co-designing a Sub-millisecond Latency Event-based Eye Track...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Zhang, Baoheng Gao, Yizhao Li, Jingyuan So, Hayden Kwok-Hay Univ Hong Kong Hong Kong Peoples R China
Eye-tracking technology is integral to numerous consumer electronics applications, particularly in the realm of virtual and augmented reality (VR/AR). These applications demand solutions that excel in three crucial as... 详细信息
来源: 评论
EgoGen: An Egocentric Synthetic Data Generator
EgoGen: An Egocentric Synthetic Data Generator
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Li, Gen Zhao, Kaifeng Zhang, Siwei Lyu, Xiaozhong Dusmanu, Mihai Zhang, Yan Pollefeys, Marc Tang, Siyu Swiss Fed Inst Technol Zurich Switzerland Microsoft Redmond WA USA
Understanding the world in first-person view is fundamental in Augmented Reality (AR). This immersive perspective brings dramatic visual changes and unique challenges compared to third-person views. Synthetic data has... 详细信息
来源: 评论
MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
MovieChat: From Dense Token to Sparse Memory for Long Video ...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Song, Enxin Chai, Wenhao Wang, Guanhong Zhang, Yucheng Zhou, Haoyang Wu, Feiyang Chi, Haozhe Guo, Xun Ye, Tian Zhang, Yanting Lu, Yan Hwang, Jenq-Neng Wang, Gaoang Zhejiang Univ Hangzhou Peoples R China Univ Washington Seattle WA 98195 USA Microsoft Res Asia Florence Italy Hong Kong Univ Sci & Technol GZ Hong Kong Peoples R China Donghua Univ Shanghai Peoples R China Shanghai Artificial Intelligence Lab Shanghai Peoples R China
Recently, integrating video foundation models and large language models to build a video understanding system can overcome the limitations of specific pre-defined vision tasks. Yet, existing systems can only handle vi... 详细信息
来源: 评论
SCoFT: Self-Contrastive Fine-Tuning for Equitable Image Generation
SCoFT: Self-Contrastive Fine-Tuning for Equitable Image Gene...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Liu, Zhixuan Schaldenbrand, Peter Okogwu, Beverley-Claire Peng, Wenxuan Yun, Youngsik Hundt, Andrew Kim, Jihie Oh, Jean Carnegie Mellon Univ Pittsburgh PA 15213 USA Nanyang Technol Univ Singapore Singapore Dongguk Univ Seoul South Korea
Accurate representation in media is known to improve the well-being of the people who consume it. Generative image models trained on large web-crawled datasets such as LAION are known to produce images with harmful st... 详细信息
来源: 评论
MULTIFLOW: Shifting Towards Task-Agnostic vision-Language Pruning
MULTIFLOW: Shifting Towards Task-Agnostic Vision-Language Pr...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Farina, Matteo Mancini, Massimiliano Cunegatti, Elia Liu, Gaowen Iacca, Giovanni Ricci, Elisa Univ Trento Trento Italy Cisco Res Res Triangle Pk NC USA Fdn Bruno Kessler Povo Italy
While excellent in transfer learning, vision-Language models (VLMs) come with high computational costs due to their large number of parameters. To address this issue, removing parameters via model pruning is a viable ... 详细信息
来源: 评论
Sharingan: A Transformer Architecture for Multi-Person Gaze Following
Sharingan: A Transformer Architecture for Multi-Person Gaze ...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Tafasca, Samy Gupta, Anshul Odobez, Jean-Marc Idiap Res Inst Martigny Switzerland Ecole Polytech Fed Lausanne Lausanne Switzerland
Gaze is a powerful form of non-verbal communication that humans develop from an early age. As such, modeling this behavior is an important task that can benefit a broad set of application domains ranging from robotics... 详细信息
来源: 评论
VideoGrounding-DINO: Towards Open-Vocabulary Spatio-Temporal Video Grounding
VideoGrounding-DINO: Towards Open-Vocabulary Spatio-Temporal...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Wasim, Syed Talal Naseer, Muzammal Khan, Salman Yang, Ming-Hsuan Khan, Fahad Shahbaz Mohamed Bin Zayed Univ AI Abu Dhabi U Arab Emirates Australian Natl Univ Canberra Australia Univ Calif Merced Merced CA USA Google Res Mountain View CA USA Linkoping Univ Linkoping Sweden
Video grounding aims to localize a spatio-temporal section in a video corresponding to an input text query. This paper addresses a critical limitation in current video grounding methodologies by introducing an Open-Vo... 详细信息
来源: 评论
Language-driven Grasp Detection
Language-driven Grasp Detection
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: An Dinh Vuong Minh Nhat Vu Baoru Huang Nghia Nguyen Hieu Le Thieu Vo Anh Nguyen FPT Software AI Ctr Hanoi Vietnam TU Wien Automat Control Inst Vienna Austria Imperial Coll London London England Ton Duc Thang Univ Ho Chi Minh City Vietnam Univ Liverpool Liverpool Merseyside England
Grasp detection is a persistent and intricate challenge with various industrial applications. Recently, many methods and datasets have been proposed to tackle the grasp detection problem. However, most of them do not ... 详细信息
来源: 评论