咨询与建议

限定检索结果

文献类型

  • 20,994 篇 会议
  • 99 册 图书
  • 85 篇 期刊文献
  • 1 篇 学位论文

馆藏范围

  • 21,178 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 13,603 篇 工学
    • 11,179 篇 计算机科学与技术...
    • 2,631 篇 机械工程
    • 2,542 篇 软件工程
    • 990 篇 光学工程
    • 849 篇 电气工程
    • 676 篇 控制科学与工程
    • 487 篇 信息与通信工程
    • 242 篇 仪器科学与技术
    • 215 篇 测绘科学与技术
    • 159 篇 生物医学工程(可授...
    • 150 篇 生物工程
    • 139 篇 电子科学与技术(可...
    • 69 篇 安全科学与工程
    • 67 篇 化学工程与技术
    • 55 篇 建筑学
    • 53 篇 土木工程
    • 43 篇 力学(可授工学、理...
    • 41 篇 航空宇航科学与技...
  • 3,462 篇 医学
    • 3,452 篇 临床医学
    • 41 篇 基础医学(可授医学...
  • 2,483 篇 理学
    • 1,247 篇 数学
    • 1,213 篇 物理学
    • 446 篇 统计学(可授理学、...
    • 418 篇 生物学
    • 269 篇 系统科学
    • 67 篇 化学
  • 424 篇 管理学
    • 218 篇 管理科学与工程(可...
    • 217 篇 图书情报与档案管...
    • 43 篇 工商管理
  • 144 篇 艺术学
    • 142 篇 设计学(可授艺术学...
  • 41 篇 法学
  • 31 篇 农学
  • 12 篇 经济学
  • 10 篇 教育学
  • 6 篇 文学
  • 3 篇 军事学

主题

  • 8,072 篇 computer vision
  • 2,879 篇 pattern recognit...
  • 2,859 篇 training
  • 1,808 篇 computational mo...
  • 1,718 篇 visualization
  • 1,478 篇 cameras
  • 1,381 篇 shape
  • 1,374 篇 face recognition
  • 1,364 篇 three-dimensiona...
  • 1,342 篇 feature extracti...
  • 1,269 篇 image segmentati...
  • 1,156 篇 robustness
  • 1,109 篇 semantics
  • 982 篇 layout
  • 978 篇 object detection
  • 953 篇 computer archite...
  • 952 篇 benchmark testin...
  • 931 篇 codes
  • 918 篇 object recogniti...
  • 899 篇 computer science

机构

  • 174 篇 univ sci & techn...
  • 154 篇 carnegie mellon ...
  • 149 篇 univ chinese aca...
  • 144 篇 chinese univ hon...
  • 110 篇 microsoft resear...
  • 104 篇 zhejiang univ pe...
  • 98 篇 swiss fed inst t...
  • 93 篇 tsinghua univ pe...
  • 92 篇 tsinghua univers...
  • 90 篇 microsoft res as...
  • 88 篇 shanghai ai lab ...
  • 83 篇 zhejiang univers...
  • 76 篇 alibaba grp peop...
  • 74 篇 hong kong univ s...
  • 73 篇 university of sc...
  • 72 篇 peking univ peop...
  • 68 篇 shanghai jiao to...
  • 68 篇 university of ch...
  • 66 篇 google res mount...
  • 66 篇 univ oxford oxfo...

作者

  • 83 篇 van gool luc
  • 71 篇 zhang lei
  • 60 篇 timofte radu
  • 49 篇 yang yi
  • 49 篇 luc van gool
  • 48 篇 xiaoou tang
  • 43 篇 darrell trevor
  • 43 篇 tian qi
  • 42 篇 loy chen change
  • 42 篇 sun jian
  • 41 篇 qi tian
  • 37 篇 vasconcelos nuno
  • 37 篇 liu yang
  • 37 篇 chen xilin
  • 37 篇 li fei-fei
  • 36 篇 liu xiaoming
  • 36 篇 shan shiguang
  • 36 篇 li stan z.
  • 36 篇 torralba antonio
  • 33 篇 zhou jie

语言

  • 21,137 篇 英文
  • 31 篇 中文
  • 5 篇 土耳其文
  • 4 篇 其他
  • 2 篇 日文
检索条件"任意字段=2011 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2011"
21179 条 记 录,以下是311-320 订阅
排序:
GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis
GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Zheng, Shunyuan Zhou, Boyao Shao, Ruizhi Liu, Boning Zhang, Shengping Nie, Liqiang Liu, Yebin Harbin Inst Technol Harbin Peoples R China Tsinghua Univ Beijing Peoples R China Peng Cheng Lab Shenzhen Peoples R China
We present a new approach, termed GPS-Gaussian, for synthesizing novel views of a character in a real-time manner. The proposed method enables 2K-resolution rendering under a sparse-view camera setting. Unlike the ori... 详细信息
来源: 评论
StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On
StableVITON: Learning Semantic Correspondence with Latent Di...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Kim, Jeongho Gu, Gyojung Park, Minho Park, Sunghyun Choo, Jaegul Korea Adv Inst Sci & Technol Daejeon South Korea
Given a clothing image and a person image, an image-based virtual try-on aims to generate a customized image that appears natural and accurately reflects the characteristics of the clothing image. In this work, we aim... 详细信息
来源: 评论
SaCo Loss: Sample-wise Affinity Consistency for vision-Language Pre-training
SaCo Loss: Sample-wise Affinity Consistency for Vision-Langu...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Wu, Sitong Tan, Haoru Tian, Zhuotao Chen, Yukang Qi, Xiaojuan Jia, Jiaya CUHK Hong Kong Peoples R China HKU Hong Kong Peoples R China SmartMore Hong Kong Peoples R China
vision-language pre-training (VLP) aims to learn joint representations of vision and language modalities. The contrastive paradigm is currently dominant in this field. However, we observe a notable misalignment phenom... 详细信息
来源: 评论
PIGEON: Predicting Image Geolocations
PIGEON: Predicting Image Geolocations
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Haas, Lukas Skreta, Michal Alberti, Silas Finn, Chelsea Stanford Univ Stanford CA 94305 USA
Planet-scale image geolocalization remains a challenging problem due to the diversity of images originating from anywhere in the world. Although approaches based on vision transformers have made significant progress i... 详细信息
来源: 评论
DVMNet: Computing Relative Pose for Unseen Objects Beyond Hypotheses
DVMNet: Computing Relative Pose for Unseen Objects Beyond Hy...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Zhao, Chen Zhang, Tong Dang, Zheng Salzmann, Mathieu Ecole Polytech Fed Lausanne Lausanne Switzerland ClearSpace SA Renens Switzerland
Determining the relative pose of an object between two images is pivotal to the success of generalizable object pose estimation. Existing approaches typically approximate the continuous pose representation with a larg... 详细信息
来源: 评论
Progressive Semantic-Guided vision Transformer for Zero-Shot Learning
Progressive Semantic-Guided Vision Transformer for Zero-Shot...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Chen, Shiming Hou, Wenjin Khan, Salman Khan, Fahad Shahbaz Mohamed Bin Zayed Univ AI Abu Dhabi U Arab Emirates Huazhong Univ Sci & Technol Wuhan Peoples R China Australian Natl Univ Canberra ACT Australia Linkoping Univ Linkoping Sweden
Zero-shot learning (ZSL) recognizes the unseen classes by conducting visual-semantic interactions to transfer semantic knowledge from seen classes to unseen ones, supported by semantic information (e.g., attributes). ... 详细信息
来源: 评论
Visual Program Distillation: Distilling Tools and Programmatic Reasoning into vision-Language Models
Visual Program Distillation: Distilling Tools and Programmat...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Hu, Yushi Stretcu, Otilia Lu, Chun-Ta Viswanathan, Krishnamurthy Hata, Kenji Luo, Enming Krishna, Ranjay Fuxman, Ariel Google Res Mountain View CA 94043 USA Univ Washington Seattle WA 98195 USA
Solving complex visual tasks such as "Who invented the musical instrument on the right?" involves a composition of skills: understanding space, recognizing instruments, and also retrieving prior knowledge. R... 详细信息
来源: 评论
SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large vision Language Models
SC-Tune: Unleashing Self-Consistent Referential Comprehensio...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Yue, Tongtian Cheng, Jie Guo, Longteng Dai, Xingyuan Zhao, Zijia He, Xingjian Xiong, Gang Lv, Yisheng Liu, Jing CASIA Lab Cognit & Decis Intelligence Complex Syst Beijing Peoples R China CASIA State Key Lab Multimodal Artificial Intelligence Beijing Peoples R China Univ Chinese Acad Sci Sch Artificial Intelligence Beijing Peoples R China
Recent trends in Large vision Language Models (LVLMs) research have been increasingly focusing on advancing beyond general image understanding towards more nuanced, object-level referential comprehension. In this pape...
来源: 评论
GLID: Pre-training a Generalist Encoder-Decoder vision Model
GLID: Pre-training a Generalist Encoder-Decoder Vision Model
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Liu, Jihao Zheng, Jinliang Liu, Yu Li, Hongsheng CUHK MMLab Hong Kong Peoples R China SenseTime Res Hong Kong Peoples R China Shanghai AI Lab Shanghai Peoples R China CPII InnoHK Hong Kong Peoples R China Tsinghua Univ Inst AI Ind Res AIR Shanghai Peoples R China
This paper proposes a GeneraLIst encoder-Decoder (GLID) pre-training method for better handling various downstream computer vision tasks. While self-supervised pre-training approaches, e.g., Masked Autoencoder, have s... 详细信息
来源: 评论
EgoGen: An Egocentric Synthetic Data Generator
EgoGen: An Egocentric Synthetic Data Generator
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Li, Gen Zhao, Kaifeng Zhang, Siwei Lyu, Xiaozhong Dusmanu, Mihai Zhang, Yan Pollefeys, Marc Tang, Siyu Swiss Fed Inst Technol Zurich Switzerland Microsoft Redmond WA USA
Understanding the world in first-person view is fundamental in Augmented Reality (AR). This immersive perspective brings dramatic visual changes and unique challenges compared to third-person views. Synthetic data has... 详细信息
来源: 评论