咨询与建议

限定检索结果

文献类型

  • 6,639 篇 会议
  • 34 篇 期刊文献
  • 5 册 图书

馆藏范围

  • 6,677 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 3,950 篇 工学
    • 3,725 篇 计算机科学与技术...
    • 1,476 篇 软件工程
    • 807 篇 光学工程
    • 323 篇 信息与通信工程
    • 240 篇 控制科学与工程
    • 206 篇 机械工程
    • 169 篇 电气工程
    • 85 篇 生物医学工程(可授...
    • 73 篇 电子科学与技术(可...
    • 70 篇 生物工程
    • 65 篇 仪器科学与技术
    • 38 篇 建筑学
    • 36 篇 土木工程
    • 34 篇 力学(可授工学、理...
    • 32 篇 航空宇航科学与技...
    • 29 篇 安全科学与工程
    • 23 篇 化学工程与技术
    • 21 篇 材料科学与工程(可...
  • 1,498 篇 理学
    • 969 篇 物理学
    • 929 篇 数学
    • 369 篇 统计学(可授理学、...
    • 136 篇 生物学
    • 40 篇 系统科学
    • 26 篇 化学
  • 210 篇 医学
    • 210 篇 临床医学
    • 23 篇 基础医学(可授医学...
  • 165 篇 管理学
    • 123 篇 图书情报与档案管...
    • 44 篇 管理科学与工程(可...
    • 29 篇 工商管理
  • 21 篇 法学
    • 21 篇 社会学
  • 10 篇 农学
  • 9 篇 教育学
  • 6 篇 经济学
  • 2 篇 军事学
  • 1 篇 艺术学

主题

  • 2,364 篇 computer vision
  • 848 篇 pattern recognit...
  • 663 篇 cameras
  • 634 篇 computer science
  • 592 篇 face recognition
  • 558 篇 layout
  • 541 篇 conferences
  • 527 篇 image segmentati...
  • 514 篇 shape
  • 454 篇 object recogniti...
  • 453 篇 robustness
  • 394 篇 humans
  • 339 篇 feature extracti...
  • 324 篇 training
  • 305 篇 object detection
  • 263 篇 image recognitio...
  • 260 篇 application soft...
  • 249 篇 lighting
  • 248 篇 computational mo...
  • 238 篇 image reconstruc...

机构

  • 44 篇 microsoft resear...
  • 27 篇 department of co...
  • 21 篇 swiss fed inst t...
  • 21 篇 school of comput...
  • 21 篇 carnegie mellon ...
  • 20 篇 department of co...
  • 19 篇 swiss fed inst t...
  • 18 篇 department of co...
  • 17 篇 department of in...
  • 17 篇 the robotics ins...
  • 17 篇 institute of com...
  • 16 篇 univ sci & techn...
  • 16 篇 robotics institu...
  • 15 篇 tsinghua univ pe...
  • 14 篇 department of el...
  • 14 篇 center for autom...
  • 14 篇 school of comput...
  • 14 篇 school of comput...
  • 13 篇 univ maryland co...
  • 13 篇 microsoft resear...

作者

  • 39 篇 timofte radu
  • 28 篇 s.k. nayar
  • 25 篇 huang thomas s.
  • 24 篇 xiaoou tang
  • 22 篇 t. kanade
  • 20 篇 chellappa rama
  • 20 篇 t.s. huang
  • 19 篇 van gool luc
  • 19 篇 nayar shree k.
  • 19 篇 t. darrell
  • 17 篇 a.k. jain
  • 17 篇 a. zisserman
  • 17 篇 heung-yeung shum
  • 17 篇 jain anil k.
  • 17 篇 zisserman andrew
  • 16 篇 g. healey
  • 16 篇 torralba antonio
  • 16 篇 l. van gool
  • 15 篇 ying wu
  • 15 篇 m. shah

语言

  • 6,668 篇 英文
  • 8 篇 中文
  • 2 篇 其他
检索条件"任意字段=2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2003"
6678 条 记 录,以下是121-130 订阅
排序:
Task Navigator: Decomposing Complex Tasks for Multimodal Large Language Models
Task Navigator: Decomposing Complex Tasks for Multimodal Lar...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Ma, Feipeng Zhou, Yizhou Zhang, Yueyi Wu, Siying Zhang, Zheyu He, Zilong Rao, Fengyun Sun, Xiaoyan Univ Sci & Technol China Hefei Peoples R China Tencent Inc WeChat Shenzhen Peoples R China Hefei Comprehens Natl Sci Ctr Inst Artificial Intelligence Hefei Peoples R China
Inspired by the remarkable progress achieved by recent Large Language Models (LLMs), Multimodal Large Language Models (MLLMs) take LLMs as their brains, and have achieved surprising results in many downstream tasks by... 详细信息
来源: 评论
Universal Guidance for Diffusion Models
Universal Guidance for Diffusion Models
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Bansal, Arpit Chu, Hong-Min Schwarzschild, Avi Sengupta, Soumyadip Goldblum, Micah Geiping, Jonas Goldstein, Tom Univ Maryland College Pk MD 20742 USA Univ North Carolina Chapel Hill Chapel Hill NC USA NYU New York NY USA
Typical diffusion models are trained to accept a particular form of conditioning, most commonly text, and cannot be conditioned on other modalities without retraining. In this work, we propose a universal guidance alg... 详细信息
来源: 评论
ViTA: An Efficient Video-to-Text Algorithm using VLM for RAG-based Video Analysis System
ViTA: An Efficient Video-to-Text Algorithm using VLM for RAG...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Arefeen, Md Adnan Debnath, Biplob Uddin, Md Yusuf Sarwar Chakradhar, Srimat NEC Labs Amer Princeton NJ 08540 USA Univ Missouri Kansas City MO 64110 USA
Retrieval-augmented generation (RAG) is used in natural language processing (NLP) to provide query-relevant information in enterprise documents to large language models (LLMs). Such enterprise context enables the LLMs... 详细信息
来源: 评论
Scattering Prompt Tuning: A Fine-tuned Foundation Model for SAR Object recognition
Scattering Prompt Tuning: A Fine-tuned Foundation Model for ...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Guo, Weilong Li, Shengyang Yang, Jian Chinese Acad Sci Key Lab Space Utilizat Beijing 100864 Peoples R China Chinese Acad Sci Technol & Engn Ctr Space Utilizat Beijing 100864 Peoples R China Univ Chinese Acad Sci Beijing Peoples R China
Synthetic Aperture Radar (SAR) serves as a vital tool in various earth observation applications, providing robust imaging under challenging weather conditions. While the fine-tuned foundation models excel in many down... 详细信息
来源: 评论
Extending global-local view alignment for self-supervised learning with remote sensing imagery
Extending global-local view alignment for self-supervised le...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Wanyan, Xinye Seneviratne, Sachith Shen, Shuchang Kirley, Michael
Since large number of high-quality remote sensing images are readily accessible, exploiting the corpus of images with less manual annotation draws increasing attention. Self-supervised models acquire general feature r... 详细信息
来源: 评论
Photo-Realistic Image Restoration in the Wild with Controlled vision-Language Models
Photo-Realistic Image Restoration in the Wild with Controlle...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Luo, Ziwei Gustafsson, Fredrik K. Zhao, Zheng Sjolund, Jens Schon, Thomas B. Uppsala Univ Uppsala Sweden Karolinska Inst Stockholm Sweden
Though diffusion models have been successfully applied to various image restoration (IR) tasks, their performance is sensitive to the choice of training datasets. Typically, diffusion models trained in specific datase... 详细信息
来源: 评论
DiCo-NeRF: Difference of Cosine Similarity for Neural Rendering of Fisheye Driving Scenes
DiCo-NeRF: Difference of Cosine Similarity for Neural Render...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Choi, Jiho Hwang, Gyutae Lee, Sang Jun Jeonbuk Natl Univ Jeonju South Korea
Neural radiance fields have emerged in the field of autonomous driving, which contributes to improve perception of the complex 3D environment through the reconstruction of geometry and appearance. Moving objects and s... 详细信息
来源: 评论
Generalized Single-Image-Based Morphing Attack Detection Using Deep Representations from vision Transformer
Generalized Single-Image-Based Morphing Attack Detection Usi...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Zhang, Haoyu Ramachandra, Raghavendra Raja, Kiran Busch, Christoph Norwegian Univ Sci & Technol Trondheim Norway Darmstadt Univ Appl Sci Darmstadt Germany
Face morphing attacks have posed severe threats to Face recognition Systems (FRS), which are operated in border control and passport issuance use cases. Correspondingly, morphing attack detection algorithms (MAD) are ... 详细信息
来源: 评论
Divide and Conquer Boosting for Enhanced Traffic Safety Description and Analysis with Large vision Language Model
Divide and Conquer Boosting for Enhanced Traffic Safety Desc...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Khai Trinh Xuan Khoi Nguyen Nguyen Bach Hoang Ngo Vu Dinh Xuan Minh-Hung An Quang-Vinh Dinh Ho Chi Minh City Univ Technol VNU HCM Ho Chi Minh City Vietnam Univ Sci VNU HCM Ho Chi Minh City Vietnam Univ Informat Technol VNU HCM Ho Chi Minh City Vietnam FPT Telecom Hanoi Vietnam AI Lab AI VIETNAM Ho Chi Minh City Vietnam Vietnam Natl Univ Ho Chi Minh City Ho Chi Minh City Vietnam
The increasing complexity of traffic dynamics has underscored the necessity for advanced traffic safety description and analysis, challenging the efficacy of current methodologies in comprehensively understanding and ... 详细信息
来源: 评论
Sat2Cap: Mapping Fine-Grained Textual Descriptions from Satellite Images
Sat2Cap: Mapping Fine-Grained Textual Descriptions from Sate...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Dhakal, Aayush Ahmad, Adeel Khanal, Subash Sastry, Srikumar Kerner, Hannah Jacobs, Nathan Washington Univ St Louis MO 63110 USA Taylor Geospatial Inst St Louis MO USA Arizona State Univ Tempe AZ 85287 USA
We propose a weakly supervised approach for creating maps using free-form textual descriptions. We refer to this work of creating textual maps as zero-shot mapping. Prior works have approached mapping tasks by develop... 详细信息
来源: 评论