咨询与建议

限定检索结果

文献类型

  • 20,994 篇 会议
  • 99 册 图书
  • 86 篇 期刊文献
  • 1 篇 学位论文

馆藏范围

  • 21,179 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 13,604 篇 工学
    • 11,180 篇 计算机科学与技术...
    • 2,631 篇 机械工程
    • 2,543 篇 软件工程
    • 990 篇 光学工程
    • 848 篇 电气工程
    • 676 篇 控制科学与工程
    • 487 篇 信息与通信工程
    • 242 篇 仪器科学与技术
    • 215 篇 测绘科学与技术
    • 159 篇 生物医学工程(可授...
    • 150 篇 生物工程
    • 139 篇 电子科学与技术(可...
    • 69 篇 安全科学与工程
    • 67 篇 化学工程与技术
    • 55 篇 建筑学
    • 53 篇 土木工程
    • 43 篇 力学(可授工学、理...
    • 41 篇 航空宇航科学与技...
  • 3,462 篇 医学
    • 3,452 篇 临床医学
    • 41 篇 基础医学(可授医学...
  • 2,484 篇 理学
    • 1,248 篇 数学
    • 1,213 篇 物理学
    • 446 篇 统计学(可授理学、...
    • 418 篇 生物学
    • 269 篇 系统科学
    • 67 篇 化学
  • 424 篇 管理学
    • 218 篇 管理科学与工程(可...
    • 217 篇 图书情报与档案管...
    • 43 篇 工商管理
  • 144 篇 艺术学
    • 142 篇 设计学(可授艺术学...
  • 41 篇 法学
  • 31 篇 农学
  • 12 篇 经济学
  • 10 篇 教育学
  • 6 篇 文学
  • 3 篇 军事学

主题

  • 8,072 篇 computer vision
  • 2,880 篇 pattern recognit...
  • 2,859 篇 training
  • 1,808 篇 computational mo...
  • 1,718 篇 visualization
  • 1,477 篇 cameras
  • 1,381 篇 shape
  • 1,374 篇 face recognition
  • 1,364 篇 three-dimensiona...
  • 1,342 篇 feature extracti...
  • 1,269 篇 image segmentati...
  • 1,156 篇 robustness
  • 1,109 篇 semantics
  • 982 篇 layout
  • 977 篇 object detection
  • 953 篇 computer archite...
  • 952 篇 benchmark testin...
  • 931 篇 codes
  • 918 篇 object recogniti...
  • 898 篇 computer science

机构

  • 174 篇 univ sci & techn...
  • 154 篇 carnegie mellon ...
  • 149 篇 univ chinese aca...
  • 144 篇 chinese univ hon...
  • 110 篇 microsoft resear...
  • 104 篇 zhejiang univ pe...
  • 98 篇 swiss fed inst t...
  • 93 篇 tsinghua univ pe...
  • 92 篇 tsinghua univers...
  • 90 篇 microsoft res as...
  • 88 篇 shanghai ai lab ...
  • 83 篇 zhejiang univers...
  • 76 篇 alibaba grp peop...
  • 74 篇 hong kong univ s...
  • 73 篇 university of sc...
  • 72 篇 peking univ peop...
  • 68 篇 shanghai jiao to...
  • 68 篇 university of ch...
  • 66 篇 google res mount...
  • 66 篇 univ oxford oxfo...

作者

  • 83 篇 van gool luc
  • 71 篇 zhang lei
  • 60 篇 timofte radu
  • 49 篇 yang yi
  • 49 篇 luc van gool
  • 48 篇 xiaoou tang
  • 43 篇 darrell trevor
  • 43 篇 tian qi
  • 42 篇 loy chen change
  • 42 篇 sun jian
  • 41 篇 qi tian
  • 37 篇 vasconcelos nuno
  • 37 篇 liu yang
  • 37 篇 chen xilin
  • 37 篇 li fei-fei
  • 36 篇 liu xiaoming
  • 36 篇 shan shiguang
  • 36 篇 li stan z.
  • 36 篇 torralba antonio
  • 33 篇 zhou jie

语言

  • 21,138 篇 英文
  • 31 篇 中文
  • 5 篇 土耳其文
  • 4 篇 其他
  • 2 篇 日文
检索条件"任意字段=2011 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2011"
21180 条 记 录,以下是831-840 订阅
排序:
Task Navigator: Decomposing Complex Tasks for Multimodal Large Language Models
Task Navigator: Decomposing Complex Tasks for Multimodal Lar...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Ma, Feipeng Zhou, Yizhou Zhang, Yueyi Wu, Siying Zhang, Zheyu He, Zilong Rao, Fengyun Sun, Xiaoyan Univ Sci & Technol China Hefei Peoples R China Tencent Inc WeChat Shenzhen Peoples R China Hefei Comprehens Natl Sci Ctr Inst Artificial Intelligence Hefei Peoples R China
Inspired by the remarkable progress achieved by recent Large Language Models (LLMs), Multimodal Large Language Models (MLLMs) take LLMs as their brains, and have achieved surprising results in many downstream tasks by... 详细信息
来源: 评论
Aligning Bag of Regions for Open-Vocabulary Object Detection
Aligning Bag of Regions for Open-Vocabulary Object Detection
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Wu, Size Zhang, Wenwei Jin, Sheng Liu, Wentao Loy, Chen Change Nanyang Technol Univ S Lab Singapore Singapore Univ Hong Kong Hong Kong Peoples R China SenseTime Res & Tetras AI Shenzhen Peoples R China Shanghai AI Lab Shanghai Peoples R China
Pre-trained vision-language models (VLMs) learn to align vision and language representations on large-scale datasets, where each image-text pair usually contains a bag of semantic concepts. However, existing open-voca... 详细信息
来源: 评论
Multivariate, Multi-frequency and Multimodal: Rethinking Graph Neural Networks for Emotion recognition in Conversation
Multivariate, Multi-frequency and Multimodal: Rethinking Gra...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Chen, Feiyu Shao, Jie Zhu, Shuyuan Shen, Heng Tao Univ Elect Sci & Technol China Chengdu Peoples R China Sichuan Artificial Intelligence Res Inst Yibin Peoples R China
Complex relationships of high arity across modality and context dimensions is a critical challenge in the Emotion recognition in Conversation (ERC) task. Yet, previous works tend to encode multimodal and contextual re... 详细信息
来源: 评论
Learning to Render Novel Views from Wide-Baseline Stereo Pairs
Learning to Render Novel Views from Wide-Baseline Stereo Pai...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Du, Yilun Smith, Cameron Tewari, Ayush Sitzmann, Vincent MIT CSAIL Cambridge MA 02139 USA
We introduce a method for novel view synthesis given only a single wide-baseline stereo image pair. In this challenging regime, 3D scene points are regularly observed only once, requiring prior-based reconstruction of... 详细信息
来源: 评论
Universal Guidance for Diffusion Models
Universal Guidance for Diffusion Models
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Bansal, Arpit Chu, Hong-Min Schwarzschild, Avi Sengupta, Soumyadip Goldblum, Micah Geiping, Jonas Goldstein, Tom Univ Maryland College Pk MD 20742 USA Univ North Carolina Chapel Hill Chapel Hill NC USA NYU New York NY USA
Typical diffusion models are trained to accept a particular form of conditioning, most commonly text, and cannot be conditioned on other modalities without retraining. In this work, we propose a universal guidance alg... 详细信息
来源: 评论
Feature Representation Learning with Adaptive Displacement Generation and Transformer Fusion for Micro-Expression recognition
Feature Representation Learning with Adaptive Displacement G...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Zhai, Zhijun Zhao, Jianhui Long, Chengjiang Xu, Wenju He, Shuangjiang Zhao, Huijuan Wuhan Univ Sch Comp Sci Wuhan Hubei Peoples R China Meta Real Labs Burlingame CA USA InnoPeak Technol Inc OPPO US Res Ctr Palo Alto CA USA FiberHome Telecommun Technol Co Ltd Wuhan Hubei Peoples R China
Micro-expressions are spontaneous, rapid and subtle facial movements that can neither be forged nor suppressed. They are very important nonverbal communication clues, but are transient and of low intensity thus diffic... 详细信息
来源: 评论
ViTA: An Efficient Video-to-Text Algorithm using VLM for RAG-based Video Analysis System
ViTA: An Efficient Video-to-Text Algorithm using VLM for RAG...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Arefeen, Md Adnan Debnath, Biplob Uddin, Md Yusuf Sarwar Chakradhar, Srimat NEC Labs Amer Princeton NJ 08540 USA Univ Missouri Kansas City MO 64110 USA
Retrieval-augmented generation (RAG) is used in natural language processing (NLP) to provide query-relevant information in enterprise documents to large language models (LLMs). Such enterprise context enables the LLMs... 详细信息
来源: 评论
Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning
Temporal Attention Unit: Towards Efficient Spatiotemporal Pr...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Tan, Cheng Gao, Zhangyang Wu, Lirong Xu, Yongjie Xia, Jun Li, Siyuan Li, Stan Z. Westlake Univ AI Lab Res Ctr Ind Future Hangzhou Peoples R China
Spatiotemporal predictive learning aims to generate future frames by learning from historical frames. In this paper, we investigate existing methods and present a general framework of spatiotemporal predictive learnin... 详细信息
来源: 评论
GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for vision-and-Language Navigation
GeoVLN: Learning Geometry-Enhanced Visual Representation wit...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Huo, Jingyang Sun, Qiang Jiang, Boyan Lin, Haitao Fu, Yanwei Fudan Univ Shanghai Peoples R China
Most existing works solving Room-to-Room VLN problem only utilize RGB images and do not consider local context around candidate views, which lack sufficient visual cues about surrounding environment. Moreover, natural... 详细信息
来源: 评论
D2Former: Jointly Learning Hierarchical Detectors and Contextual Descriptors via Agent-based Transformers
D<SUP>2</SUP>Former: Jointly Learning Hierarchical Detectors...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: He, Jianfeng Gao, Yuan Zhang, Tianzhu Zhang, Zhe Wu, Feng Univ Sci & Technol China Hefei Peoples R China Deep Space Explorat Lab Hefei Peoples R China
Establishing pixel-level matches between image pairs is vital for a variety of computer vision applications. However, achieving robust image matching remains challenging because CNN extracted descriptors usually lack ... 详细信息
来源: 评论