咨询与建议

限定检索结果

文献类型

  • 20,860 篇 会议
  • 104 篇 期刊文献
  • 43 册 图书

馆藏范围

  • 21,006 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 13,619 篇 工学
    • 11,055 篇 计算机科学与技术...
    • 2,652 篇 机械工程
    • 2,252 篇 软件工程
    • 914 篇 光学工程
    • 884 篇 电气工程
    • 529 篇 控制科学与工程
    • 477 篇 信息与通信工程
    • 216 篇 测绘科学与技术
    • 135 篇 生物工程
    • 127 篇 生物医学工程(可授...
    • 98 篇 电子科学与技术(可...
    • 92 篇 仪器科学与技术
    • 46 篇 安全科学与工程
    • 40 篇 建筑学
    • 40 篇 化学工程与技术
    • 39 篇 土木工程
    • 37 篇 交通运输工程
    • 35 篇 力学(可授工学、理...
    • 33 篇 航空宇航科学与技...
  • 3,494 篇 医学
    • 3,489 篇 临床医学
    • 32 篇 基础医学(可授医学...
  • 2,247 篇 理学
    • 1,145 篇 物理学
    • 1,081 篇 数学
    • 401 篇 生物学
    • 384 篇 统计学(可授理学、...
    • 245 篇 系统科学
    • 46 篇 化学
  • 343 篇 管理学
    • 176 篇 管理科学与工程(可...
    • 168 篇 图书情报与档案管...
    • 34 篇 工商管理
  • 31 篇 法学
  • 19 篇 农学
  • 15 篇 教育学
  • 8 篇 经济学
  • 5 篇 艺术学
  • 2 篇 军事学
  • 1 篇 文学

主题

  • 8,140 篇 computer vision
  • 2,886 篇 training
  • 2,840 篇 pattern recognit...
  • 1,809 篇 computational mo...
  • 1,715 篇 visualization
  • 1,492 篇 cameras
  • 1,433 篇 three-dimensiona...
  • 1,433 篇 feature extracti...
  • 1,366 篇 shape
  • 1,360 篇 face recognition
  • 1,243 篇 image segmentati...
  • 1,135 篇 robustness
  • 1,124 篇 semantics
  • 992 篇 computer archite...
  • 984 篇 object detection
  • 982 篇 layout
  • 959 篇 benchmark testin...
  • 935 篇 codes
  • 899 篇 computer science
  • 898 篇 object recogniti...

机构

  • 174 篇 univ sci & techn...
  • 158 篇 univ chinese aca...
  • 153 篇 carnegie mellon ...
  • 145 篇 chinese univ hon...
  • 109 篇 microsoft resear...
  • 103 篇 zhejiang univ pe...
  • 99 篇 swiss fed inst t...
  • 95 篇 tsinghua univers...
  • 90 篇 microsoft res as...
  • 90 篇 tsinghua univ pe...
  • 88 篇 shanghai ai lab ...
  • 81 篇 zhejiang univers...
  • 77 篇 alibaba grp peop...
  • 74 篇 hong kong univ s...
  • 73 篇 university of sc...
  • 72 篇 peking univ peop...
  • 72 篇 university of ch...
  • 68 篇 shanghai jiao to...
  • 66 篇 univ oxford oxfo...
  • 65 篇 google res mount...

作者

  • 80 篇 van gool luc
  • 70 篇 zhang lei
  • 58 篇 timofte radu
  • 48 篇 yang yi
  • 47 篇 luc van gool
  • 46 篇 xiaoou tang
  • 44 篇 tian qi
  • 43 篇 darrell trevor
  • 42 篇 loy chen change
  • 42 篇 sun jian
  • 41 篇 qi tian
  • 40 篇 li stan z.
  • 38 篇 li fei-fei
  • 37 篇 chen xilin
  • 36 篇 shan shiguang
  • 35 篇 zhou jie
  • 35 篇 vasconcelos nuno
  • 35 篇 liu yang
  • 35 篇 torralba antonio
  • 34 篇 liu xiaoming

语言

  • 20,981 篇 英文
  • 10 篇 中文
  • 7 篇 其他
  • 5 篇 土耳其文
  • 2 篇 日文
  • 2 篇 葡萄牙文
检索条件"任意字段=2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016"
21007 条 记 录,以下是1211-1220 订阅
排序:
Generalized Decoding for Pixel, Image, and Language
Generalized Decoding for Pixel, Image, and Language
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Zou, Xueyan Dou, Zi-Yi Yang, Jianwei Gan, Zhe Li, Linjie Li, Chunyuan Dai, Xiyang Behl, Harkirat Wang, Jianfeng Yuan, Lu Peng, Nanyun Wang, Lijuan Lee, Yong Jae Gao, Jianfeng Univ Wisconsin Madison Madison WI 53706 USA UCLA Los Angeles CA 90024 USA Microsoft Res Redmond Redmond WA USA Microsoft Cloud & AI Redmond WA USA
We present X-Decoder, a generalized decoding model that can predict pixel-level segmentation and language tokens seamlessly. X-Decoder takes as input two types of queries: (i) generic non-semantic queries and (ii) sem... 详细信息
来源: 评论
3D Concept Learning and Reasoning from Multi-View Images
3D Concept Learning and Reasoning from Multi-View Images
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Hong, Yining Lin, Chunru Du, Yilun Chen, Zhenfang Tenenbaum, Joshua B. Gan, Chuang UCLA Los Angeles CA 90095 USA Shanghai Jiao Tong Univ Shanghai Peoples R China MIT CSAIL Cambridge MA USA UMass Amherst Amherst MA USA MIT IBM Watson Lab Cambridge MA USA
Humans are able to accurately reason in 3D by gathering multi-view observations of the surrounding world. Inspired by this insight, we introduce a new large-scale benchmark for 3D multi-view visual question answering ... 详细信息
来源: 评论
Uni-Perceiver v2: A Generalist Model for Large-Scale vision and vision-Language Tasks
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision ...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Li, Hao Zhu, Jinguo Jiang, Xiaohu Zhu, Xizhou Li, Hongsheng Yuan, Chun Wang, Xiaohua Qiao, Yu Wang, Xiaogang Wang, Wenhai Dai, Jifeng Chinese Univ Hong Kong CUHK SenseTime Joint Lab Hong Kong Peoples R China Xi An Jiao Tong Univ Xian Peoples R China Tsinghua Univ SIGS Beijing Peoples R China SenseTime Res Beijing Peoples R China Tsinghua Univ Beijing Peoples R China Shanghai Artificial Intelligence Lab Shanghai Peoples R China
Despite the remarkable success of foundation models, their task-specific fine-tuning paradigm makes them inconsistent with the goal of general perception modeling. The key to eliminating this inconsistency is to use g... 详细信息
来源: 评论
Hierarchical Semantic Correspondence Networks for Video Paragraph Grounding
Hierarchical Semantic Correspondence Networks for Video Para...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Tan, Chaolei Lin, Zihang Hu, Jian-Fang Zheng, Wei-Shi Lai, Jianhuang Sun Yat Sen Univ Sch Comp Sci & Engn Guangzhou Peoples R China Guangdong Prov Key Lab Informat Secur Technol Guangzhou Peoples R China Minist Educ Key Lab Machine Intelligence & Adv Comp Beijing Peoples R China
Video Paragraph Grounding (VPG) is an essential yet challenging task in vision-language understanding, which aims to jointly localize multiple events from an untrimmed video with a paragraph query description. One of ... 详细信息
来源: 评论
Position-guided Text Prompt for vision-Language Pre-training
Position-guided Text Prompt for Vision-Language Pre-training
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Wang, Jinpeng Zhou, Pan Shou, Mike Zheng Yan, Shuicheng Sea AI Lab Singapore Singapore Natl Univ Singapore Show Lab Singapore Singapore
vision-Language Pre-Training (VLP) has shown promising capabilities to align image and text pairs, facilitating a broad variety of cross-modal learning tasks. However, we observe that VLP models often lack the visual ... 详细信息
来源: 评论
Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking
Observation-Centric SORT: Rethinking SORT for Robust Multi-O...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Cao, Jinkun Pang, Jiangmiao Weng, Xinshuo Khirodkar, Rawal Kitani, Kris Carnegie Mellon Univ Pittsburgh PA 15213 USA Shanghai AI Lab Shanghai Peoples R China Nvidia Santa Clara CA USA
Kalman filter (KF) based methods for multi-object tracking (MOT) make an assumption that objects move linearly. While this assumption is acceptable for very short periods of occlusion, linear estimates of motion for p... 详细信息
来源: 评论
BUFFER: Balancing Accuracy, Efficiency, and Generalizability in Point Cloud Registration
BUFFER: Balancing Accuracy, Efficiency, and Generalizability...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Ao, Sheng Hu, Qingyong Wang, Hanyun Xu, Kai Guo, Yulan Sun Yat Sen Univ Shenzhen Campus Shenzhen Peoples R China Univ Oxford Oxford England Informat Engn Univ Zhengzhou Peoples R China Natl Univ Def Technol Changsha Peoples R China
An ideal point cloud registration framework should have superior accuracy, acceptable efficiency, and strong generalizability. However, this is highly challenging since existing registration techniques are either not ... 详细信息
来源: 评论
A Unified HDR Imaging Method with Pixel and Patch Level
A Unified HDR Imaging Method with Pixel and Patch Level
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Yan, Qingsen Chen, Weiye Zhang, Song Zhu, Yu Sun, Jinqiu Zhang, Yanning Northwestern Polytech Univ Xian Peoples R China Xidian Univ Xian Peoples R China
Mapping Low Dynamic Range (LDR) images with different exposures to High Dynamic Range (HDR) remains non-trivial and challenging on dynamic scenes due to ghosting caused by object motion or camera jitting. With the suc... 详细信息
来源: 评论
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning
Vid2Seq: Large-Scale Pretraining of a Visual Language Model ...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Yang, Antoine Nagrani, Arsha Seo, Paul Hongsuck Miech, Antoine Pont-Tuset, Jordi Laptev, Ivan Sivic, Josef Schmid, Cordelia Google Res Mountain View CA USA Inria Paris Paris France PSL Res Univ CNRS Dept Informat ENS Paris France DeepMind London England Czech Tech Univ Czech Inst Informat Robot & Cybernet Prague Czech Republic Google Mountain View CA USA
In this work, we introduce Vid2Seq, a multi-modal single-stage dense event captioning model pretrained on narrated videos which are readily-available at scale. The Vid2Seq architecture augments a language model with s... 详细信息
来源: 评论
FCC: Feature Clusters Compression for Long-Tailed Visual recognition
FCC: Feature Clusters Compression for Long-Tailed Visual Rec...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Li, Jian Meng, Ziyao Shi, Daqian Song, Rui Diao, Xiaolei Wang, Jingwen Xu, Hao Jilin Univ Changchun Peoples R China Univ Minho Braga Portugal Univ Toronto Toronto ON Canada
Deep Neural Networks (DNNs) are rather restrictive in long-tailed data, since they commonly exhibit an under-representation for minority classes. Various remedies have been proposed to tackle this problem from differe... 详细信息
来源: 评论