咨询与建议

限定检索结果

文献类型

  • 22,771 篇 会议
  • 112 篇 期刊文献
  • 23 册 图书

馆藏范围

  • 22,905 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 13,398 篇 工学
    • 10,880 篇 计算机科学与技术...
    • 3,450 篇 软件工程
    • 2,430 篇 机械工程
    • 1,721 篇 光学工程
    • 1,010 篇 控制科学与工程
    • 998 篇 电气工程
    • 761 篇 信息与通信工程
    • 393 篇 仪器科学与技术
    • 337 篇 生物工程
    • 257 篇 生物医学工程(可授...
    • 215 篇 电子科学与技术(可...
    • 113 篇 化学工程与技术
    • 112 篇 安全科学与工程
    • 98 篇 测绘科学与技术
    • 92 篇 交通运输工程
    • 86 篇 建筑学
    • 82 篇 土木工程
  • 3,362 篇 医学
    • 3,348 篇 临床医学
    • 79 篇 基础医学(可授医学...
  • 3,250 篇 理学
    • 1,953 篇 物理学
    • 1,664 篇 数学
    • 567 篇 统计学(可授理学、...
    • 484 篇 生物学
    • 245 篇 系统科学
    • 109 篇 化学
  • 506 篇 管理学
    • 299 篇 图书情报与档案管...
    • 219 篇 管理科学与工程(可...
    • 75 篇 工商管理
  • 252 篇 艺术学
    • 252 篇 设计学(可授艺术学...
  • 62 篇 法学
    • 59 篇 社会学
  • 40 篇 农学
  • 25 篇 教育学
  • 19 篇 经济学
  • 11 篇 军事学
  • 3 篇 文学

主题

  • 10,126 篇 computer vision
  • 4,025 篇 pattern recognit...
  • 2,900 篇 training
  • 1,958 篇 computational mo...
  • 1,792 篇 cameras
  • 1,758 篇 visualization
  • 1,485 篇 shape
  • 1,466 篇 image segmentati...
  • 1,447 篇 feature extracti...
  • 1,412 篇 three-dimensiona...
  • 1,288 篇 robustness
  • 1,169 篇 computer archite...
  • 1,144 篇 layout
  • 1,142 篇 computer science
  • 1,134 篇 semantics
  • 1,071 篇 object detection
  • 1,043 篇 conferences
  • 1,009 篇 benchmark testin...
  • 967 篇 codes
  • 810 篇 face recognition

机构

  • 135 篇 univ sci & techn...
  • 118 篇 univ chinese aca...
  • 118 篇 chinese univ hon...
  • 110 篇 carnegie mellon ...
  • 99 篇 tsinghua univers...
  • 99 篇 microsoft resear...
  • 94 篇 swiss fed inst t...
  • 92 篇 zhejiang univ pe...
  • 82 篇 university of sc...
  • 81 篇 zhejiang univers...
  • 77 篇 shanghai ai lab ...
  • 77 篇 university of ch...
  • 72 篇 shanghai jiao to...
  • 68 篇 microsoft res as...
  • 65 篇 national laborat...
  • 65 篇 alibaba grp peop...
  • 64 篇 tsinghua univ pe...
  • 63 篇 adobe research
  • 60 篇 peking univ peop...
  • 59 篇 peng cheng labor...

作者

  • 78 篇 van gool luc
  • 72 篇 timofte radu
  • 63 篇 zhang lei
  • 45 篇 luc van gool
  • 40 篇 yang yi
  • 37 篇 loy chen change
  • 33 篇 xiaoou tang
  • 33 篇 li stan z.
  • 33 篇 qi tian
  • 32 篇 sun jian
  • 31 篇 liu yang
  • 31 篇 li fei-fei
  • 30 篇 chen chen
  • 30 篇 tian qi
  • 30 篇 pascal fua
  • 29 篇 darrell trevor
  • 28 篇 ying shan
  • 27 篇 li xin
  • 27 篇 vasconcelos nuno
  • 27 篇 hanqing lu

语言

  • 22,844 篇 英文
  • 35 篇 其他
  • 20 篇 中文
  • 5 篇 土耳其文
  • 2 篇 日文
检索条件"任意字段=1994 IEEE Computer-Society Conference on Computer Vision and Pattern Recognition"
22906 条 记 录,以下是421-430 订阅
排序:
Zero-TPrune: Zero-Shot Token Pruning through Leveraging of the Attention Graph in Pre-Trained Transformers
Zero-TPrune: Zero-Shot Token Pruning through Leveraging of t...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Wang, Hongjie Dedhia, Bhishma Jha, Niraj K. Princeton Univ Princeton NJ 08540 USA
Deployment of Transformer models on edge devices is becoming increasingly challenging due to the exponentially growing inference cost that scales quadratically with the number of tokens in the input sequence. Token pr... 详细信息
来源: 评论
HALLUSIONBENCH: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large vision-Language Models
HALLUSIONBENCH: An Advanced Diagnostic Suite for Entangled L...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Guan, Tianrui Liu, Fuxiao Wu, Xiyang Xian, Ruiqi Li, Zongxia Liu, Xiaoyu Wang, Xijun Chen, Lichang Huang, Furong Yacoob, Yaser Manocha, Dinesh Zhou, Tianyi Univ Maryland College Pk MD 20742 USA
We introduce "HALLUSIONBENCH(1)," a comprehensive benchmark designed for the evaluation of image-context reasoning. This benchmark presents significant challenges to advanced large visual-language models (LV... 详细信息
来源: 评论
Lookahead Exploration with Neural Radiance Representation for Continuous vision-Language Navigation
Lookahead Exploration with Neural Radiance Representation fo...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Wang, Zihan Li, Xiangyang Yang, Jiahao Liu, Yeqi Hu, Junjie Jiang, Ming Jiang, Shuqiang Chinese Acad Sci Inst Comp Technol Beijing 100190 Peoples R China Univ Chinese Acad Sci Beijing 100049 Peoples R China Univ Wisconsin Dept Comp Sci 1210 W Dayton St Madison WI 53706 USA Univ Wisconsin Dept Biostat & Med Informat Madison WI USA Indiana Univ Dept Human Ctr Comp Indianapolis IN 46204 USA
vision-and-language navigation (VLN) enables the agent to navigate to a remote location following the natural language instruction in 3D environments. At each navigation step, the agent selects from possible candidate... 详细信息
来源: 评论
3D Human Pose Perception from Egocentric Stereo Videos
3D Human Pose Perception from Egocentric Stereo Videos
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Akada, Hiroyasu Wang, Jian Golyanik, Vladislav Theobalt, Christian Max Planck Inst Informat SIC Saarbrucken Germany
While head-mounted devices are becoming more compact, they provide egocentric views with significant self-occlusions of the device user. Hence, existing methods often fail to accurately estimate complex 3D poses from ... 详细信息
来源: 评论
Towards Language-Driven Video Inpainting via Multimodal Large Language Models
Towards Language-Driven Video Inpainting via Multimodal Larg...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Wu, Jianzong Li, Xiangtai Si, Chenyang Zhou, Shangchen Yang, Jingkang Zhang, Jiangning Li, Yining Chen, Kai Tong, Yunhai Liu, Ziwei Loy, Chen Change Peking Univ Natl Key Lab Gen Artificial Intelligence Beijing Peoples R China Nanyang Technol Univ S Lab Singapore Singapore Shanghai AI Lab Shanghai Peoples R China PKU Wuhan Inst Artificial Intelligence Wuhan Peoples R China Zhejiang Univ Hangzhou Peoples R China
We introduce a new task - language-driven video inpainting, which uses natural language instructions to guide the inpainting process. This approach overcomes the limitations of traditional video inpainting methods tha... 详细信息
来源: 评论
The Neglected Tails in vision-Language Models
The Neglected Tails in Vision-Language Models
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Parashar, Shubham Lin, Zhiqiu Liu, Tian Dong, Xiangjue Li, Yanan Ramanan, Deva Caverlee, James Kong, Shu Texas A&M Univ College Stn TX 77840 USA Carnegie Mellon Univ Pittsburgh PA 15213 USA Zhejiang Lab Hangzhou Peoples R China Univ Macau Taipa Macao Peoples R China
vision-language models (VLMs) excel in zero-shot recognition but their performance varies greatly across different visual concepts. For example, although CLIP achieves impressive accuracy on ImageNet (60-80%), its per... 详细信息
来源: 评论
Dual Memory Networks: A Versatile Adaptation Approach for vision-Language Models
Dual Memory Networks: A Versatile Adaptation Approach for Vi...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Zhang, Yabin Zhu, Wenjie Tang, Hui Ma, Zhiyuan Zhou, Kaiyang Zh, Lei HKPolyU Hong Kong Peoples R China OPPO Hong Kong Peoples R China HKUST Hong Kong Peoples R China HKBU Hong Kong Peoples R China
With the emergence of pre-trained vision-language models like CLIP, how to adapt them to various downstream classification tasks has garnered significant attention in recent research. The adaptation strategies can be ... 详细信息
来源: 评论
VideoGrounding-DINO: Towards Open-Vocabulary Spatio-Temporal Video Grounding
VideoGrounding-DINO: Towards Open-Vocabulary Spatio-Temporal...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Wasim, Syed Talal Naseer, Muzammal Khan, Salman Yang, Ming-Hsuan Khan, Fahad Shahbaz Mohamed Bin Zayed Univ AI Abu Dhabi U Arab Emirates Australian Natl Univ Canberra Australia Univ Calif Merced Merced CA USA Google Res Mountain View CA USA Linkoping Univ Linkoping Sweden
Video grounding aims to localize a spatio-temporal section in a video corresponding to an input text query. This paper addresses a critical limitation in current video grounding methodologies by introducing an Open-Vo... 详细信息
来源: 评论
AHIVE: Anatomy-aware Hierarchical vision Encoding for Interactive Radiology Report Retrieval
AHIVE: Anatomy-aware Hierarchical Vision Encoding for Intera...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Yan, Sixing Cheung, William K. Tsang, Ivor W. Chiu, Keith Tong, Terence M. Cheung, Ka Chun Seel, Simon Hong Kong Baptist Univ Hong Kong Peoples R China Agcy Sci Technol & Res CFAR Singapore Singapore Agcy Sci Technol & Res IHPC Singapore Singapore Nanyang Technol Univ SCSE Singapore Singapore Univ Technol Sydney AAII Sydney NSW Australia Queen Elizabeth Hosp Hong Kong Peoples R China Kwong Wah Hosp Hong Kong Peoples R China Tuen Mun Hosp Hong Kong Peoples R China NVIDIA Corp NVIDIA AI Technol Ctr Santa Clara CA USA
Automatic radiology report generation using deep learning models has been recently explored and found promising. Neural decoders are commonly used for the report generation, where irrelevant and unfaithful contents ar... 详细信息
来源: 评论
SparseOcc: Rethinking Sparse Latent Representation for vision-Based Semantic Occupancy Prediction
SparseOcc: Rethinking Sparse Latent Representation for Visio...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (CVPR)
作者: Tang, Pin Wang, Zhongdao Wang, Gum-ling Zheng, Jilai Ren, Xiangxuan Feng, Bailan Ma, Chao Shanghai Jiao Tong Univ AI Inst MoE Key Lab Artificial Intelligence Shanghai Peoples R China Huawei Noahs Ark Lab Montreal PQ Canada
vision-based perception for autonomous driving requires an explicit modeling of a 3D space, where 2D latent representations are mapped and subsequent 3D operators are applied. However, operating on dense latent spaces... 详细信息
来源: 评论