咨询与建议

限定检索结果

文献类型

  • 12,844 篇 会议
  • 13 篇 期刊文献
  • 2 册 图书

馆藏范围

  • 12,859 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 7,573 篇 工学
    • 6,863 篇 计算机科学与技术...
    • 880 篇 机械工程
    • 814 篇 软件工程
    • 435 篇 控制科学与工程
    • 360 篇 光学工程
    • 306 篇 电气工程
    • 209 篇 仪器科学与技术
    • 124 篇 信息与通信工程
    • 91 篇 生物工程
    • 62 篇 生物医学工程(可授...
    • 39 篇 电子科学与技术(可...
    • 34 篇 安全科学与工程
    • 26 篇 化学工程与技术
    • 21 篇 交通运输工程
    • 20 篇 建筑学
    • 18 篇 土木工程
  • 2,957 篇 医学
    • 2,956 篇 临床医学
    • 15 篇 基础医学(可授医学...
    • 12 篇 药学(可授医学、理...
  • 700 篇 理学
    • 359 篇 物理学
    • 225 篇 数学
    • 175 篇 系统科学
    • 95 篇 统计学(可授理学、...
    • 93 篇 生物学
    • 22 篇 化学
  • 201 篇 艺术学
    • 201 篇 设计学(可授艺术学...
  • 84 篇 管理学
    • 59 篇 图书情报与档案管...
    • 25 篇 管理科学与工程(可...
    • 14 篇 工商管理
  • 23 篇 法学
    • 21 篇 社会学
  • 5 篇 农学
  • 4 篇 教育学
  • 2 篇 经济学
  • 1 篇 军事学

主题

  • 6,464 篇 computer vision
  • 2,688 篇 training
  • 2,437 篇 pattern recognit...
  • 1,780 篇 computational mo...
  • 1,522 篇 visualization
  • 1,348 篇 three-dimensiona...
  • 1,091 篇 computer archite...
  • 1,063 篇 semantics
  • 997 篇 benchmark testin...
  • 976 篇 codes
  • 970 篇 conferences
  • 854 篇 feature extracti...
  • 830 篇 cameras
  • 771 篇 task analysis
  • 707 篇 deep learning
  • 646 篇 image segmentati...
  • 611 篇 object detection
  • 595 篇 shape
  • 554 篇 transformers
  • 538 篇 neural networks

机构

  • 132 篇 univ sci & techn...
  • 122 篇 carnegie mellon ...
  • 120 篇 tsinghua univ pe...
  • 114 篇 univ chinese aca...
  • 113 篇 chinese univ hon...
  • 94 篇 tsinghua univers...
  • 91 篇 zhejiang univ pe...
  • 91 篇 swiss fed inst t...
  • 85 篇 peng cheng lab p...
  • 81 篇 university of ch...
  • 80 篇 zhejiang univers...
  • 77 篇 shanghai ai lab ...
  • 77 篇 peng cheng labor...
  • 75 篇 university of sc...
  • 69 篇 shanghai jiao to...
  • 68 篇 shanghai jiao to...
  • 67 篇 alibaba grp peop...
  • 67 篇 stanford univ st...
  • 66 篇 univ hong kong p...
  • 64 篇 sensetime res pe...

作者

  • 77 篇 timofte radu
  • 63 篇 van gool luc
  • 45 篇 zhang lei
  • 36 篇 yang yi
  • 36 篇 luc van gool
  • 34 篇 tao dacheng
  • 31 篇 loy chen change
  • 29 篇 chen chen
  • 28 篇 sun jian
  • 28 篇 qi tian
  • 25 篇 li xin
  • 24 篇 liu yang
  • 24 篇 tian qi
  • 24 篇 ying shan
  • 23 篇 wang xinchao
  • 23 篇 zha zheng-jun
  • 23 篇 boxin shi
  • 21 篇 zhou jie
  • 21 篇 vasconcelos nuno
  • 20 篇 luo ping

语言

  • 12,851 篇 英文
  • 7 篇 其他
  • 1 篇 中文
检索条件"任意字段=IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops"
12859 条 记 录,以下是381-390 订阅
GLID: Pre-training a Generalist Encoder-Decoder vision Model
GLID: Pre-training a Generalist Encoder-Decoder Vision Model
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Liu, Jihao Zheng, Jinliang Liu, Yu Li, Hongsheng CUHK MMLab Hong Kong Peoples R China SenseTime Res Hong Kong Peoples R China Shanghai AI Lab Shanghai Peoples R China CPII InnoHK Hong Kong Peoples R China Tsinghua Univ Inst AI Ind Res AIR Shanghai Peoples R China
This paper proposes a GeneraLIst encoder-Decoder (GLID) pre-training method for better handling various downstream computer vision tasks. While self-supervised pre-training approaches, e.g., Masked Autoencoder, have s... 详细信息
来源: 评论
SaCo Loss: Sample-wise Affinity Consistency for vision-Language Pre-training
SaCo Loss: Sample-wise Affinity Consistency for Vision-Langu...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Wu, Sitong Tan, Haoru Tian, Zhuotao Chen, Yukang Qi, Xiaojuan Jia, Jiaya CUHK Hong Kong Peoples R China HKU Hong Kong Peoples R China SmartMore Hong Kong Peoples R China
vision-language pre-training (VLP) aims to learn joint representations of vision and language modalities. The contrastive paradigm is currently dominant in this field. However, we observe a notable misalignment phenom... 详细信息
来源: 评论
Progressive Semantic-Guided vision Transformer for Zero-Shot Learning
Progressive Semantic-Guided Vision Transformer for Zero-Shot...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Chen, Shiming Hou, Wenjin Khan, Salman Khan, Fahad Shahbaz Mohamed Bin Zayed Univ AI Abu Dhabi U Arab Emirates Huazhong Univ Sci & Technol Wuhan Peoples R China Australian Natl Univ Canberra ACT Australia Linkoping Univ Linkoping Sweden
Zero-shot learning (ZSL) recognizes the unseen classes by conducting visual-semantic interactions to transfer semantic knowledge from seen classes to unseen ones, supported by semantic information (e.g., attributes). ... 详细信息
来源: 评论
PEM: Prototype-based Efficient MaskFormer for Image Segmentation
PEM: Prototype-based Efficient MaskFormer for Image Segmenta...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Cavagnero, Niccolo Rosi, Gabriele Cuttano, Claudia Pistilli, Francesca Ciccone, Marco Averta, Giuseppe Cermelli, Fabio Politecn Torino Turin Italy Focoos AI Rome Italy
Recent transformer-based architectures have shown impressive results in the field of image segmentation. Thanks to their flexibility, they obtain outstanding performance in multiple segmentation tasks, such as semanti... 详细信息
来源: 评论
SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large vision Language Models
SC-Tune: Unleashing Self-Consistent Referential Comprehensio...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Yue, Tongtian Cheng, Jie Guo, Longteng Dai, Xingyuan Zhao, Zijia He, Xingjian Xiong, Gang Lv, Yisheng Liu, Jing CASIA Lab Cognit & Decis Intelligence Complex Syst Beijing Peoples R China CASIA State Key Lab Multimodal Artificial Intelligence Beijing Peoples R China Univ Chinese Acad Sci Sch Artificial Intelligence Beijing Peoples R China
Recent trends in Large vision Language Models (LVLMs) research have been increasingly focusing on advancing beyond general image understanding towards more nuanced, object-level referential comprehension. In this pape...
来源: 评论
Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversarial Robustness
Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversari...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Wang, Sibo Zhang, Jie Yuan, Zheng Shan, Shiguang Chinese Acad Sci Inst Comp Technol Beijing Peoples R China Univ Chinese Acad Sci Beijing Peoples R China
Large-scale pre-trained vision-language models like CLIP have demonstrated impressive performance across various tasks, and exhibit remarkable zero-shot generalization capability, while they are also vulnerable to imp... 详细信息
来源: 评论
RGBD Objects in the Wild: Scaling Real-World 3D Object Learning from RGB-D Videos
RGBD Objects in the Wild: Scaling Real-World 3D Object Learn...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Xia, Hongchi Fu, Yang Liu, Sifei Wang, Xiaolong Shanghai Jiao Tong Univ Shanghai Peoples R China Univ Calif San Diego San Diego CA USA NVIDIA Santa Clara CA USA
We introduce a new RGB-D object dataset captured in the wild called WildRGB-D. Unlike most existing real-world object-centric datasets which only come with RGB capturing, the direct capture of the depth channel allows... 详细信息
来源: 评论
MMA: Multi-Modal Adapter for vision-Language Models
MMA: Multi-Modal Adapter for Vision-Language Models
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Yang, Lingxiao Zhang, Ru-Yuan Wang, Yanchen Xie, Xiaohua Sun Yat Sen Univ Guangzhou Peoples R China Shanghai Jiao Tong Univ Shanghai Peoples R China Stanford Univ Stanford CA USA
Pre-trained vision-Language Models (VLMs) have served as excellent foundation models for transfer learning in diverse downstream tasks. However, tuning VLMs for few-shot generalization tasks faces a discrimination - g... 详细信息
来源: 评论
Volumetric Environment Representation for vision-Language Navigation
Volumetric Environment Representation for Vision-Language Na...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Liu, Rui Wang, Wenguan Yan, Yi Zhejiang Univ CCAI ReLER Hangzhou Zhejiang Peoples R China
vision-language navigation (VLN) requires an agent to navigate through an 3D environment based on visual observations and natural language instructions. It is clear that the pivotal factor for successful navigation li... 详细信息
来源: 评论
SPIN: Simultaneous Perception, Interaction and Navigation
SPIN: Simultaneous Perception, Interaction and Navigation
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Uppal, Shagun Agarwal, Ananye Xiong, Haoyu Shaw, Kenneth Pathak, Deepak Carnegie Mellon Univ Pittsburgh PA 15213 USA
While there has been remarkable progress recently in the fields of manipulation and locomotion, mobile manipulation remains a long-standing challenge. Compared to locomotion or static manipulation, a mobile system mus... 详细信息
来源: 评论