咨询与建议

限定检索结果

文献类型

  • 20,860 篇 会议
  • 105 篇 期刊文献
  • 14 册 图书

馆藏范围

  • 20,978 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 13,620 篇 工学
    • 11,056 篇 计算机科学与技术...
    • 2,652 篇 机械工程
    • 2,252 篇 软件工程
    • 914 篇 光学工程
    • 885 篇 电气工程
    • 529 篇 控制科学与工程
    • 477 篇 信息与通信工程
    • 216 篇 测绘科学与技术
    • 135 篇 生物工程
    • 127 篇 生物医学工程(可授...
    • 98 篇 电子科学与技术(可...
    • 92 篇 仪器科学与技术
    • 46 篇 安全科学与工程
    • 40 篇 建筑学
    • 40 篇 化学工程与技术
    • 39 篇 土木工程
    • 37 篇 交通运输工程
    • 35 篇 力学(可授工学、理...
    • 33 篇 航空宇航科学与技...
  • 3,494 篇 医学
    • 3,489 篇 临床医学
    • 32 篇 基础医学(可授医学...
  • 2,247 篇 理学
    • 1,145 篇 物理学
    • 1,081 篇 数学
    • 401 篇 生物学
    • 384 篇 统计学(可授理学、...
    • 245 篇 系统科学
    • 46 篇 化学
  • 343 篇 管理学
    • 176 篇 管理科学与工程(可...
    • 168 篇 图书情报与档案管...
    • 34 篇 工商管理
  • 31 篇 法学
  • 19 篇 农学
  • 15 篇 教育学
  • 8 篇 经济学
  • 5 篇 艺术学
  • 2 篇 军事学
  • 1 篇 文学

主题

  • 8,141 篇 computer vision
  • 2,886 篇 training
  • 2,826 篇 pattern recognit...
  • 1,809 篇 computational mo...
  • 1,715 篇 visualization
  • 1,493 篇 cameras
  • 1,433 篇 three-dimensiona...
  • 1,433 篇 feature extracti...
  • 1,366 篇 shape
  • 1,360 篇 face recognition
  • 1,243 篇 image segmentati...
  • 1,135 篇 robustness
  • 1,124 篇 semantics
  • 992 篇 computer archite...
  • 985 篇 object detection
  • 982 篇 layout
  • 959 篇 benchmark testin...
  • 935 篇 codes
  • 900 篇 computer science
  • 898 篇 object recogniti...

机构

  • 174 篇 univ sci & techn...
  • 158 篇 univ chinese aca...
  • 153 篇 carnegie mellon ...
  • 145 篇 chinese univ hon...
  • 109 篇 microsoft resear...
  • 103 篇 zhejiang univ pe...
  • 99 篇 swiss fed inst t...
  • 93 篇 tsinghua univers...
  • 91 篇 tsinghua univ pe...
  • 90 篇 microsoft res as...
  • 88 篇 shanghai ai lab ...
  • 81 篇 zhejiang univers...
  • 77 篇 alibaba grp peop...
  • 74 篇 hong kong univ s...
  • 73 篇 university of sc...
  • 72 篇 peking univ peop...
  • 72 篇 university of ch...
  • 68 篇 shanghai jiao to...
  • 66 篇 univ oxford oxfo...
  • 65 篇 google res mount...

作者

  • 80 篇 van gool luc
  • 70 篇 zhang lei
  • 58 篇 timofte radu
  • 48 篇 yang yi
  • 47 篇 luc van gool
  • 46 篇 xiaoou tang
  • 44 篇 tian qi
  • 43 篇 darrell trevor
  • 42 篇 loy chen change
  • 42 篇 sun jian
  • 41 篇 qi tian
  • 40 篇 li stan z.
  • 38 篇 li fei-fei
  • 37 篇 chen xilin
  • 36 篇 shan shiguang
  • 35 篇 zhou jie
  • 35 篇 vasconcelos nuno
  • 35 篇 liu yang
  • 35 篇 torralba antonio
  • 34 篇 liu xiaoming

语言

  • 20,953 篇 英文
  • 10 篇 中文
  • 7 篇 其他
  • 5 篇 土耳其文
  • 2 篇 日文
  • 2 篇 葡萄牙文
检索条件"任意字段=2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016"
20979 条 记 录,以下是311-320 订阅
排序:
EventDance: Unsupervised Source-free Cross-modal Adaptation for Event-based Object recognition
EventDance: Unsupervised Source-free Cross-modal Adaptation ...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Zheng, Xu Wang, Lin HKUST GZ AI Thrust Guangzhou Peoples R China HKUST Dept CSE Guangzhou Peoples R China
In this paper, we make the first attempt at achieving the cross-modal (i.e., image-to-events) adaptation for event-based object recognition without accessing any labeled source image data owning to privacy and commerc...
来源: 评论
MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
MovieChat: From Dense Token to Sparse Memory for Long Video ...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Song, Enxin Chai, Wenhao Wang, Guanhong Zhang, Yucheng Zhou, Haoyang Wu, Feiyang Chi, Haozhe Guo, Xun Ye, Tian Zhang, Yanting Lu, Yan Hwang, Jenq-Neng Wang, Gaoang Zhejiang Univ Hangzhou Peoples R China Univ Washington Seattle WA 98195 USA Microsoft Res Asia Florence Italy Hong Kong Univ Sci & Technol GZ Hong Kong Peoples R China Donghua Univ Shanghai Peoples R China Shanghai Artificial Intelligence Lab Shanghai Peoples R China
Recently, integrating video foundation models and large language models to build a video understanding system can overcome the limitations of specific pre-defined vision tasks. Yet, existing systems can only handle vi... 详细信息
来源: 评论
StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On
StableVITON: Learning Semantic Correspondence with Latent Di...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Kim, Jeongho Gu, Gyojung Park, Minho Park, Sunghyun Choo, Jaegul Korea Adv Inst Sci & Technol Daejeon South Korea
Given a clothing image and a person image, an image-based virtual try-on aims to generate a customized image that appears natural and accurately reflects the characteristics of the clothing image. In this work, we aim... 详细信息
来源: 评论
GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis
GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Zheng, Shunyuan Zhou, Boyao Shao, Ruizhi Liu, Boning Zhang, Shengping Nie, Liqiang Liu, Yebin Harbin Inst Technol Harbin Peoples R China Tsinghua Univ Beijing Peoples R China Peng Cheng Lab Shenzhen Peoples R China
We present a new approach, termed GPS-Gaussian, for synthesizing novel views of a character in a real-time manner. The proposed method enables 2K-resolution rendering under a sparse-view camera setting. Unlike the ori... 详细信息
来源: 评论
SaCo Loss: Sample-wise Affinity Consistency for vision-Language Pre-training
SaCo Loss: Sample-wise Affinity Consistency for Vision-Langu...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Wu, Sitong Tan, Haoru Tian, Zhuotao Chen, Yukang Qi, Xiaojuan Jia, Jiaya CUHK Hong Kong Peoples R China HKU Hong Kong Peoples R China SmartMore Hong Kong Peoples R China
vision-language pre-training (VLP) aims to learn joint representations of vision and language modalities. The contrastive paradigm is currently dominant in this field. However, we observe a notable misalignment phenom... 详细信息
来源: 评论
PIGEON: Predicting Image Geolocations
PIGEON: Predicting Image Geolocations
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Haas, Lukas Skreta, Michal Alberti, Silas Finn, Chelsea Stanford Univ Stanford CA 94305 USA
Planet-scale image geolocalization remains a challenging problem due to the diversity of images originating from anywhere in the world. Although approaches based on vision transformers have made significant progress i... 详细信息
来源: 评论
DVMNet: Computing Relative Pose for Unseen Objects Beyond Hypotheses
DVMNet: Computing Relative Pose for Unseen Objects Beyond Hy...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Zhao, Chen Zhang, Tong Dang, Zheng Salzmann, Mathieu Ecole Polytech Fed Lausanne Lausanne Switzerland ClearSpace SA Renens Switzerland
Determining the relative pose of an object between two images is pivotal to the success of generalizable object pose estimation. Existing approaches typically approximate the continuous pose representation with a larg... 详细信息
来源: 评论
Progressive Semantic-Guided vision Transformer for Zero-Shot Learning
Progressive Semantic-Guided Vision Transformer for Zero-Shot...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Chen, Shiming Hou, Wenjin Khan, Salman Khan, Fahad Shahbaz Mohamed Bin Zayed Univ AI Abu Dhabi U Arab Emirates Huazhong Univ Sci & Technol Wuhan Peoples R China Australian Natl Univ Canberra ACT Australia Linkoping Univ Linkoping Sweden
Zero-shot learning (ZSL) recognizes the unseen classes by conducting visual-semantic interactions to transfer semantic knowledge from seen classes to unseen ones, supported by semantic information (e.g., attributes). ... 详细信息
来源: 评论
Visual Program Distillation: Distilling Tools and Programmatic Reasoning into vision-Language Models
Visual Program Distillation: Distilling Tools and Programmat...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Hu, Yushi Stretcu, Otilia Lu, Chun-Ta Viswanathan, Krishnamurthy Hata, Kenji Luo, Enming Krishna, Ranjay Fuxman, Ariel Google Res Mountain View CA 94043 USA Univ Washington Seattle WA 98195 USA
Solving complex visual tasks such as "Who invented the musical instrument on the right?" involves a composition of skills: understanding space, recognizing instruments, and also retrieving prior knowledge. R... 详细信息
来源: 评论
SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large vision Language Models
SC-Tune: Unleashing Self-Consistent Referential Comprehensio...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Yue, Tongtian Cheng, Jie Guo, Longteng Dai, Xingyuan Zhao, Zijia He, Xingjian Xiong, Gang Lv, Yisheng Liu, Jing CASIA Lab Cognit & Decis Intelligence Complex Syst Beijing Peoples R China CASIA State Key Lab Multimodal Artificial Intelligence Beijing Peoples R China Univ Chinese Acad Sci Sch Artificial Intelligence Beijing Peoples R China
Recent trends in Large vision Language Models (LVLMs) research have been increasingly focusing on advancing beyond general image understanding towards more nuanced, object-level referential comprehension. In this pape...
来源: 评论