咨询与建议

限定检索结果

文献类型

  • 20,798 篇 会议
  • 88 篇 期刊文献
  • 65 册 图书

馆藏范围

  • 20,950 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 13,275 篇 工学
    • 10,923 篇 计算机科学与技术...
    • 2,484 篇 机械工程
    • 2,307 篇 软件工程
    • 913 篇 光学工程
    • 771 篇 电气工程
    • 556 篇 控制科学与工程
    • 405 篇 信息与通信工程
    • 210 篇 测绘科学与技术
    • 131 篇 生物医学工程(可授...
    • 104 篇 电子科学与技术(可...
    • 100 篇 生物工程
    • 92 篇 仪器科学与技术
    • 56 篇 化学工程与技术
    • 52 篇 建筑学
    • 48 篇 土木工程
    • 44 篇 安全科学与工程
    • 38 篇 力学(可授工学、理...
    • 38 篇 航空宇航科学与技...
    • 35 篇 交通运输工程
  • 3,457 篇 医学
    • 3,449 篇 临床医学
    • 34 篇 基础医学(可授医学...
  • 2,315 篇 理学
    • 1,154 篇 数学
    • 1,132 篇 物理学
    • 417 篇 统计学(可授理学、...
    • 386 篇 生物学
    • 252 篇 系统科学
    • 57 篇 化学
  • 353 篇 管理学
    • 184 篇 图书情报与档案管...
    • 176 篇 管理科学与工程(可...
    • 32 篇 工商管理
  • 28 篇 法学
  • 20 篇 农学
  • 15 篇 教育学
  • 9 篇 经济学
  • 8 篇 艺术学
  • 5 篇 文学
  • 5 篇 军事学

主题

  • 8,203 篇 computer vision
  • 3,010 篇 pattern recognit...
  • 2,732 篇 training
  • 1,769 篇 computational mo...
  • 1,657 篇 visualization
  • 1,483 篇 cameras
  • 1,415 篇 shape
  • 1,369 篇 three-dimensiona...
  • 1,369 篇 face recognition
  • 1,285 篇 image segmentati...
  • 1,272 篇 feature extracti...
  • 1,178 篇 robustness
  • 1,090 篇 semantics
  • 1,040 篇 layout
  • 1,007 篇 object detection
  • 975 篇 object recogniti...
  • 969 篇 computer science
  • 946 篇 computer archite...
  • 946 篇 benchmark testin...
  • 931 篇 codes

机构

  • 174 篇 univ sci & techn...
  • 154 篇 carnegie mellon ...
  • 148 篇 univ chinese aca...
  • 144 篇 chinese univ hon...
  • 113 篇 microsoft resear...
  • 103 篇 zhejiang univ pe...
  • 99 篇 swiss fed inst t...
  • 97 篇 tsinghua univ pe...
  • 93 篇 tsinghua univers...
  • 91 篇 microsoft res as...
  • 88 篇 shanghai ai lab ...
  • 81 篇 zhejiang univers...
  • 76 篇 alibaba grp peop...
  • 74 篇 hong kong univ s...
  • 73 篇 university of sc...
  • 72 篇 peking univ peop...
  • 69 篇 university of ch...
  • 68 篇 shanghai jiao to...
  • 66 篇 google res mount...
  • 66 篇 univ oxford oxfo...

作者

  • 80 篇 van gool luc
  • 71 篇 zhang lei
  • 59 篇 timofte radu
  • 48 篇 yang yi
  • 47 篇 xiaoou tang
  • 44 篇 darrell trevor
  • 43 篇 tian qi
  • 43 篇 luc van gool
  • 42 篇 loy chen change
  • 42 篇 sun jian
  • 42 篇 li fei-fei
  • 40 篇 qi tian
  • 39 篇 li stan z.
  • 37 篇 liu yang
  • 37 篇 chen xilin
  • 36 篇 shan shiguang
  • 35 篇 liu xiaoming
  • 35 篇 vasconcelos nuno
  • 35 篇 torralba antonio
  • 32 篇 zhou jie

语言

  • 20,928 篇 英文
  • 14 篇 中文
  • 6 篇 其他
  • 2 篇 日文
  • 2 篇 土耳其文
检索条件"任意字段=2009 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009"
20951 条 记 录,以下是71-80 订阅
排序:
X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization
X-MIC: Cross-Modal Instance Conditioning for Egocentric Acti...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Kukleva, Anna Sener, Fadime Remelli, Edoardo Tekin, Bugra Sauser, Eric Schiele, Bernt Mal, Shugao Meta Real Labs Menlo Pk CA 94025 USA Max Planck Inst Informat Saarland Informat Campus Saarbrucken Germany
Lately, there has been growing interest in adapting vision-language models (VLMs) to image and third-person video classification due to their success in zero-shot recognition. However, the adaptation of these models t... 详细信息
来源: 评论
Adaptive Hyper-graph Aggregation for Modality-Agnostic Federated Learning
Adaptive Hyper-graph Aggregation for Modality-Agnostic Feder...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Qi, Fan Li, Shuai Tianjin Univ Technol Tianjin Peoples R China
In Federated Learning (FL), the issue of statistical data heterogeneity has been a significant challenge to the field's ongoing development. This problem is further exacerbated when clients' data vary in modal... 详细信息
来源: 评论
Cinematic Behavior Transfer via NeRF-based Differentiable Filming
Cinematic Behavior Transfer via NeRF-based Differentiable Fi...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Jiang, Xuekun Rao, Anyi Wang, Jingbo Lin, Dahua Dai, Bo Shanghai AI Lab Shanghai Peoples R China Stanford Univ Stanford CA 94305 USA Chinese Univ Hong Kong Hong Kong Peoples R China
In the evolving landscape of digital media and video production, the precise manipulation and reproduction of visual elements like camera movements and character actions are highly desired. Existing SLAM methods face ... 详细信息
来源: 评论
Streaming Dense Video Captioning
Streaming Dense Video Captioning
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Zhou, Xingyi Arnab, Anurag Buch, Shyamal Yan, Shen Myers, Austin Xiong, Xuehan Nagrani, Arsha Schmid, Cordelia Google Mountain View CA 94043 USA
An ideal model for dense video captioning - predicting captions localized temporally in a video - should be able to handle long input videos, predict rich, detailed textual descriptions, and be able to produce outputs... 详细信息
来源: 评论
JRDB-Social: A Multifaceted Robotic Dataset for Understanding of Context and Dynamics of Human Interactions Within Social Groups
JRDB-Social: A Multifaceted Robotic Dataset for Understandin...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Jahangard, Simindokht Cai, Zhixi Wen, Shiki Rezatofighi, Hamid Monash Univ Clayton Vic Australia
Understanding human social behaviour is crucial in computer vision and robotics. Micro-level observations like individual actions fall short, necessitating a comprehensive approach that considers individual behaviour,... 详细信息
来源: 评论
Semantics-aware Motion Retargeting with vision-Language Models
Semantics-aware Motion Retargeting with Vision-Language Mode...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Zhang, Haodong Chen, Zhike Xu, Haocheng Hao, Lei Wu, Xiaofei Xu, Songcen Zhang, Zhensong Wang, Yue Xiong, Rong Zhejiang Univ Hangzhou Peoples R China Huawei Noahs Ark Lab Montreal PQ Canada
Capturing and preserving motion semantics is essential to motion retargeting between animation characters. However, most of the previous works neglect the semantic information or rely on human-designed joint-level rep... 详细信息
来源: 评论
Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
Repurposing Diffusion-Based Image Generators for Monocular D...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Ke, Bingxin Obukhov, Anton Huang, Shengyu Metzger, Nando Daudt, Rodrigo Caye Schindler, Konrad Swiss Fed Inst Technol Photogrammetry & Remote Sensing Zurich Switzerland
Monocular depth estimation is a fundamental computer vision task. Recovering 3D depth from a single image is geometrically ill-posed and requires scene understanding, so it is not surprising that the rise of deep lear... 详细信息
来源: 评论
SnAG: Scalable and Accurate Video Grounding
SnAG: Scalable and Accurate Video Grounding
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Mu, Fangzhou Mo, Sicheng Li, Yin Univ Wisconsin Madison Madison WI 53706 USA
Temporal grounding of text descriptions in videos is a central problem in vision-language learning and video understanding. Existing methods often prioritize accuracy over scalability - they have been optimized for gr... 详细信息
来源: 评论
LTGC: Long-tail recognition via Leveraging LLMs-driven Generated Content
LTGC: Long-tail Recognition via Leveraging LLMs-driven Gener...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Zhao, Qihao Dai, Yalun Li, Hao Hu, Wei Zhang, Fan Liu, Jun Beijing Univ Chem Technol Beijing Peoples R China Singapore Univ Technol & Design Singapore Singapore Nanyang Technol Univ Singapore Singapore Northwestern Polytech Univ Xian Peoples R China
Long-tail recognition is challenging because it requires the model to learn good representations from tail categories and address imbalances across all categories. In this paper, we propose a novel generative and fine...
来源: 评论
Exploring vision Transformers for 3D Human Motion-Language Models with Motion Patches
Exploring Vision Transformers for 3D Human Motion-Language M...
收藏 引用
ieee/CVF conference on computer vision and pattern recognition (cvpr)
作者: Yu, Qing Tanaka, Mikihiro Fujiwara, Kent LY Corp Tokyo Japan
To build a cross-modal latent space between 3D human motion and language, acquiring large-scale and high-quality human motion data is crucial. However, unlike the abundance of image data, the scarcity of motion data h... 详细信息
来源: 评论