咨询与建议

限定检索结果

文献类型

  • 20,860 篇 会议
  • 105 篇 期刊文献
  • 14 册 图书

馆藏范围

  • 20,978 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 13,620 篇 工学
    • 11,056 篇 计算机科学与技术...
    • 2,652 篇 机械工程
    • 2,252 篇 软件工程
    • 914 篇 光学工程
    • 885 篇 电气工程
    • 529 篇 控制科学与工程
    • 477 篇 信息与通信工程
    • 216 篇 测绘科学与技术
    • 135 篇 生物工程
    • 127 篇 生物医学工程(可授...
    • 98 篇 电子科学与技术(可...
    • 92 篇 仪器科学与技术
    • 46 篇 安全科学与工程
    • 40 篇 建筑学
    • 40 篇 化学工程与技术
    • 39 篇 土木工程
    • 37 篇 交通运输工程
    • 35 篇 力学(可授工学、理...
    • 33 篇 航空宇航科学与技...
  • 3,494 篇 医学
    • 3,489 篇 临床医学
    • 32 篇 基础医学(可授医学...
  • 2,247 篇 理学
    • 1,145 篇 物理学
    • 1,081 篇 数学
    • 401 篇 生物学
    • 384 篇 统计学(可授理学、...
    • 245 篇 系统科学
    • 46 篇 化学
  • 343 篇 管理学
    • 176 篇 管理科学与工程(可...
    • 168 篇 图书情报与档案管...
    • 34 篇 工商管理
  • 31 篇 法学
  • 19 篇 农学
  • 15 篇 教育学
  • 8 篇 经济学
  • 5 篇 艺术学
  • 2 篇 军事学
  • 1 篇 文学

主题

  • 8,141 篇 computer vision
  • 2,886 篇 training
  • 2,826 篇 pattern recognit...
  • 1,809 篇 computational mo...
  • 1,715 篇 visualization
  • 1,493 篇 cameras
  • 1,433 篇 three-dimensiona...
  • 1,433 篇 feature extracti...
  • 1,366 篇 shape
  • 1,360 篇 face recognition
  • 1,243 篇 image segmentati...
  • 1,135 篇 robustness
  • 1,124 篇 semantics
  • 992 篇 computer archite...
  • 985 篇 object detection
  • 982 篇 layout
  • 959 篇 benchmark testin...
  • 935 篇 codes
  • 900 篇 computer science
  • 898 篇 object recogniti...

机构

  • 174 篇 univ sci & techn...
  • 158 篇 univ chinese aca...
  • 153 篇 carnegie mellon ...
  • 145 篇 chinese univ hon...
  • 109 篇 microsoft resear...
  • 103 篇 zhejiang univ pe...
  • 99 篇 swiss fed inst t...
  • 93 篇 tsinghua univers...
  • 91 篇 tsinghua univ pe...
  • 90 篇 microsoft res as...
  • 88 篇 shanghai ai lab ...
  • 81 篇 zhejiang univers...
  • 77 篇 alibaba grp peop...
  • 74 篇 hong kong univ s...
  • 73 篇 university of sc...
  • 72 篇 peking univ peop...
  • 72 篇 university of ch...
  • 68 篇 shanghai jiao to...
  • 66 篇 univ oxford oxfo...
  • 65 篇 google res mount...

作者

  • 80 篇 van gool luc
  • 70 篇 zhang lei
  • 58 篇 timofte radu
  • 48 篇 yang yi
  • 47 篇 luc van gool
  • 46 篇 xiaoou tang
  • 44 篇 tian qi
  • 43 篇 darrell trevor
  • 42 篇 loy chen change
  • 42 篇 sun jian
  • 41 篇 qi tian
  • 40 篇 li stan z.
  • 38 篇 li fei-fei
  • 37 篇 chen xilin
  • 36 篇 shan shiguang
  • 35 篇 zhou jie
  • 35 篇 vasconcelos nuno
  • 35 篇 liu yang
  • 35 篇 torralba antonio
  • 34 篇 liu xiaoming

语言

  • 20,953 篇 英文
  • 10 篇 中文
  • 7 篇 其他
  • 5 篇 土耳其文
  • 2 篇 日文
  • 2 篇 葡萄牙文
检索条件"任意字段=2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016"
20979 条 记 录,以下是341-350 订阅
排序:
Equivariant Multi-Modality Image Fusion
Equivariant Multi-Modality Image Fusion
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Zhao, Zixiang Hai, Haowen Zhang, Jiangshe Zhang, Yulun Zhane, Kai Xu, Shuang Chen, Dongdong Timofte, Radu Van Gool, Luc Xi An Jiao Tong Univ Xian Peoples R China Swiss Fed Inst Technol Zurich Switzerland Shanghai Jiao Tong Univ Shanghai Peoples R China Nanjing Univ Nanjing Peoples R China Northwestern Polytech Univ Xian Peoples R China Heriot Watt Univ Edinburgh Midlothian Scotland Univ Wurzburg Wurzburg Germany INSAIT Sofia Bulgaria
Multi-modality image fusion is a technique that combines information from different sensors or modalities, enabling the fused image to retain complementary features from each modality, such as functional highlights an... 详细信息
来源: 评论
Sharingan: A Transformer Architecture for Multi-Person Gaze Following
Sharingan: A Transformer Architecture for Multi-Person Gaze ...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Tafasca, Samy Gupta, Anshul Odobez, Jean-Marc Idiap Res Inst Martigny Switzerland Ecole Polytech Fed Lausanne Lausanne Switzerland
Gaze is a powerful form of non-verbal communication that humans develop from an early age. As such, modeling this behavior is an important task that can benefit a broad set of application domains ranging from robotics... 详细信息
来源: 评论
EgoThink: Evaluating First-Person Perspective Thinking Capability of vision-Language Models
EgoThink: Evaluating First-Person Perspective Thinking Capab...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Cheng, Sijie Guo, Zhicheng Wu, Jingwen Fang, Kechen Li, Peng Liu, Huaping Liu, Yang Tsinghua Univ Dept Comp Sci & Technol Beijing Peoples R China Tsinghua Univ Inst AI Ind Res AIR Beijing Peoples R China Univ Toronto Dept Elect & Comp Engn Toronto ON Canada Tsinghua Univ Zhili Coll Beijing Peoples R China 01 Ai Beijing Peoples R China
vision-language models (VLMs) have recently shown promising results in traditional downstream tasks. Evaluation studies have emerged to assess their abilities, with the majority focusing on the third-person perspectiv... 详细信息
来源: 评论
Language-driven Grasp Detection
Language-driven Grasp Detection
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: An Dinh Vuong Minh Nhat Vu Baoru Huang Nghia Nguyen Hieu Le Thieu Vo Anh Nguyen FPT Software AI Ctr Hanoi Vietnam TU Wien Automat Control Inst Vienna Austria Imperial Coll London London England Ton Duc Thang Univ Ho Chi Minh City Vietnam Univ Liverpool Liverpool Merseyside England
Grasp detection is a persistent and intricate challenge with various industrial applications. Recently, many methods and datasets have been proposed to tackle the grasp detection problem. However, most of them do not ... 详细信息
来源: 评论
EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning
EmoVIT: Revolutionizing Emotion Insights with Visual Instruc...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Xie, Hongxia Peng, Chu-Jun Tseng, Yu-Wen Chen, Hung-Jen Hsu, Chan-Feng Shuai, Hong-Han Cheng, Wen-Huang Jilin Univ Changchun Peoples R China Natl Taiwan Univ Taipei Taiwan Natl Yang Ming Chiao Tung Univ Hsinchu Taiwan
Visual Instruction Tuning represents a novel learning paradigm involving the fine-tuning of pre-trained language models using task-specific instructions. This paradigm shows promising zero-shot results in various natu... 详细信息
来源: 评论
Bootstrapping SparseFormers from vision Foundation Models
Bootstrapping SparseFormers from Vision Foundation Models
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Gao, Ziteng Tong, Zhan Lin, Kevin Qinghong Chen, Joya Shou, Mike Zheng Natl Univ Singapore Show Lab Singapore Singapore
The recently proposed SparseFormer architecture provides an alternative approach to visual understanding by utilizing a significantly lower number of visual tokens via adjusting RoIs, greatly reducing computational co... 详细信息
来源: 评论
From Coarse to Fine-Grained Open-Set recognition
From Coarse to Fine-Grained Open-Set Recognition
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Lang, Nico Snaebjarnarson, Vesteinn Cole, Elijah Mac Aodha, Oisin Igel, Christian Belongie, Serge Univ Copenhagen Copenhagen Denmark Altos Labs San Diego CA USA Univ Edinburgh Edinburgh Midlothian Scotland
Open-set recognition (OSR) methods aim to identify whether or not a test example belongs to a category observed during training. Depending on how visually similar a test example is to the training categories, the OSR ... 详细信息
来源: 评论
PracticalDG: Perturbation Distillation on vision-Language Models for Hybrid Domain Generalization
PracticalDG: Perturbation Distillation on Vision-Language Mo...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Chen, Zining Wang, Weiqiu Zhao, Zhicheng Su, Fei Men, Aidong Meng, Hongying Beijing Univ Posts & Telecommun Sch Artificial Intelligence Beijing Peoples R China Beijing Key Lab Network Syst & Network Culture Beijing Peoples R China Minist Culture & Tourism Key Lab Interact Technol & Experience Syst Beijing Peoples R China Brunel Univ Uxbridge Uxbridge Middx England
Domain Generalization (DG) aims to resolve distribution shifts between source and target domains, and current DG methods are default to the setting that data from source and target domains share identical categories. ... 详细信息
来源: 评论
Forecasting of 3D Whole-body Human Poses with Grasping Objects
Forecasting of 3D Whole-body Human Poses with Grasping Objec...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Yan, Haitao Cui, Qiongjie Xie, Jiexin Guo, Shijie Fudan Univ Acad Engn & Technol Shanghai Peoples R China Nanjing Univ Sci & Technol Nanjing Peoples R China
In the context of computer vision and human-robot interaction, forecasting 3D human poses is crucial for understanding human behavior and enhancing the predictive capabilities of intelligent systems. While existing me... 详细信息
来源: 评论
MM-Narrator: Narrating Long-form Videos with Multimodal In-Context Learning
MM-Narrator: Narrating Long-form Videos with Multimodal In-C...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Zh, Chaoyi Lin, Kevin Yang, Zhengyuan Wang, Jianfeng Li, Linjie Lin, Chung-Ching Liu, Zicheng Wang, Lijuan Univ Sydney Sydney NSW Australia Microsoft Corp Redmond WA 98052 USA Adv Micro Devices Inc Santa Clara CA USA
We present MM-Narrator, a novel system leveraging GPT-4 with multimodal in-context learning for the generation of audio descriptions (AD). Unlike previous methods that primarily focused on downstream fine-tuning with ... 详细信息
来源: 评论