咨询与建议

限定检索结果

文献类型

  • 11,885 篇 会议
  • 5 篇 期刊文献

馆藏范围

  • 11,890 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 8,059 篇 工学
    • 7,617 篇 计算机科学与技术...
    • 796 篇 机械工程
    • 688 篇 电气工程
    • 360 篇 软件工程
    • 228 篇 控制科学与工程
    • 40 篇 光学工程
    • 19 篇 生物工程
    • 17 篇 信息与通信工程
    • 12 篇 生物医学工程(可授...
    • 6 篇 电子科学与技术(可...
    • 6 篇 建筑学
    • 6 篇 交通运输工程
    • 5 篇 仪器科学与技术
    • 5 篇 化学工程与技术
    • 5 篇 安全科学与工程
    • 4 篇 土木工程
  • 3,347 篇 医学
    • 3,346 篇 临床医学
    • 4 篇 基础医学(可授医学...
    • 4 篇 公共卫生与预防医...
  • 253 篇 理学
    • 198 篇 系统科学
    • 32 篇 物理学
    • 21 篇 生物学
    • 18 篇 数学
    • 9 篇 统计学(可授理学、...
    • 7 篇 化学
  • 17 篇 管理学
    • 12 篇 管理科学与工程(可...
    • 7 篇 图书情报与档案管...
    • 5 篇 工商管理
  • 3 篇 法学
    • 3 篇 社会学
  • 3 篇 教育学
    • 3 篇 教育学
  • 2 篇 农学
  • 1 篇 经济学
  • 1 篇 军事学

主题

  • 5,633 篇 computer vision
  • 2,668 篇 training
  • 2,203 篇 pattern recognit...
  • 1,747 篇 computational mo...
  • 1,502 篇 visualization
  • 1,360 篇 three-dimensiona...
  • 1,074 篇 semantics
  • 999 篇 benchmark testin...
  • 986 篇 codes
  • 959 篇 computer archite...
  • 891 篇 deep learning
  • 777 篇 conferences
  • 754 篇 task analysis
  • 700 篇 feature extracti...
  • 561 篇 transformers
  • 533 篇 face recognition
  • 527 篇 neural networks
  • 495 篇 object detection
  • 490 篇 image segmentati...
  • 468 篇 cameras

机构

  • 174 篇 univ sci & techn...
  • 145 篇 carnegie mellon ...
  • 144 篇 univ chinese aca...
  • 144 篇 tsinghua univ pe...
  • 134 篇 chinese univ hon...
  • 110 篇 zhejiang univ pe...
  • 109 篇 peng cheng lab p...
  • 99 篇 swiss fed inst t...
  • 91 篇 tsinghua univers...
  • 90 篇 shanghai ai lab ...
  • 87 篇 sensetime res pe...
  • 86 篇 shanghai jiao to...
  • 83 篇 zhejiang univers...
  • 82 篇 tech univ munich...
  • 79 篇 university of sc...
  • 79 篇 stanford univ st...
  • 78 篇 univ hong kong p...
  • 77 篇 australian natl ...
  • 76 篇 alibaba grp peop...
  • 75 篇 peng cheng labor...

作者

  • 75 篇 timofte radu
  • 64 篇 van gool luc
  • 50 篇 zhang lei
  • 43 篇 yang yi
  • 37 篇 loy chen change
  • 36 篇 tao dacheng
  • 32 篇 zhou jie
  • 31 篇 chen chen
  • 30 篇 liu yang
  • 30 篇 tian qi
  • 29 篇 sun jian
  • 29 篇 zha zheng-jun
  • 28 篇 li xin
  • 27 篇 qi tian
  • 26 篇 vasconcelos nuno
  • 25 篇 liu xiaoming
  • 25 篇 darrell trevor
  • 24 篇 zheng wei-shi
  • 24 篇 luo ping
  • 24 篇 ying shan

语言

  • 11,863 篇 英文
  • 26 篇 其他
  • 1 篇 中文
检索条件"任意字段=2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024"
11890 条 记 录,以下是481-490 订阅
排序:
Zero-Shot Audio-Visual Compound Expression recognition Method based on Emotion Probability Fusion
Zero-Shot Audio-Visual Compound Expression Recognition Metho...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Ryumina, Elena Markitantov, Maxim Ryumin, Dmitry Kaya, Heysem Karpov, Alexey Russian Acad Sci St Petersburg Fed Res Ctr St Petersburg Russia Univ Utrecht Dept Informat & Comp Sci Utrecht Netherlands
A Compound Expression recognition (CER) as a sub-field of affective computing is a novel task in intelligent human-computer interaction and multimodal user interfaces. We propose a novel audio-visual method for CER. O... 详细信息
来源: 评论
Three Pillars improving vision Foundation Model Distillation for Lidar
Three Pillars improving Vision Foundation Model Distillation...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Puy, Gilles Gidaris, Spyros Boulch, Alexandre Simeoni, Oriane Sautier, Corentin Perez, Patrick Bursucl, Andrei Marlet, Renaud Valeo ai Paris France Kyutai Paris France Univ Gustave Eiffel CNRS LIGM Ecole Ponts Marne La Vallee France
Self-supervised image backbones can be used to address complex 2D tasks (e.g., semantic segmentation, object discovery) very efficiently and with little or no downstream supervision. Ideally, 3D backbones for lidar sh... 详细信息
来源: 评论
LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion
LAKE-RED: Camouflaged Images Generation by Latent Background...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Zhao, Pancheng Xu, Peng Qin, Pengda Fan, Deng-Ping Zhang, Zhicheng Jia, Guoli Zhou, Bowen Yang, Jufeng Nankai Univ Coll Comp Sci VCIP Tianjin Peoples R China Nankai Univ Coll Comp Sci TMCC Tianjin Peoples R China Nankai Univ Coll Comp Sci DISSec Tianjin Peoples R China Nankai Int Adv Res Inst Shenzhen Peoples R China Tsinghua Univ Dept Elect Engn Beijing Peoples R China Alibaba Grp Hangzhou Peoples R China
Camouflaged vision perception is an important vision task with numerous practical applications. Due to the expensive collection and labeling costs, this community struggles with a major bottleneck that the species cat... 详细信息
来源: 评论
ES3: Evolving Self-Supervised Learning of Robust Audio-Visual Speech Representations
ES<SUP>3</SUP>: Evolving Self-Supervised Learning of Robust ...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Zhang, Yuanhang Yang, Shuang Shan, Shiguang Chen, Xilin Chinese Acad Sci Inst Comp Technol Key Lab Intelligent Informat Proc Beijing 100190 Peoples R China Univ Chinese Acad Sci Beijing 100049 Peoples R China
We propose a novel strategy, ES3, for self-supervised learning of robust audio-visual speech representations from unlabeled talking face videos. While many recent approaches for this task primarily rely on guiding the... 详细信息
来源: 评论
Open3DSG: Open-Vocabulary 3D Scene Graphs from Point Clouds with Queryable Objects and Open-Set Relationships
Open3DSG: Open-Vocabulary 3D Scene Graphs from Point Clouds ...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Koch, Sebastian Vaskevicius, Narunas Colosi, Mirco Hermosilla, Pedro Ropinski, Timo Bosch Ctr Artificial Intelligence Stuttgart Germany Robert Bosch Corp Res Stuttgart Germany Univ Ulm Ulm Germany TU Vienna Vienna Austria
Current approaches for 3D scene graph prediction rely on labeled datasets to train models for a fixed set of known object classes and relationship categories. We present Open3DSG, an alternative approach to learn 3D s... 详细信息
来源: 评论
OpenEQA: Embodied Question Answering in the Era of Foundation Models
OpenEQA: Embodied Question Answering in the Era of Foundatio...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Majumdar, Arjun Ajay, Anurag Zhang, Xi Aohan Punya, Pranav Yenamandra, Sriram Henaff, Mikael Silwal, Sneha Mcvay, Paul Maksymets, Oleksandr Arnaud, Sergio Yadav, Karmesh Li, Qiyang Newman, Ben Sharma, Mohit Berges, Vincent Zhang, Shiqi Agrawal, Pulkit Bisk, Yonatan Batra, Dhruv Kalakrishnan, Mrinal Meier, Franziska Paxton, Chris Sax, Alexander Rajeswaran, Aravind Georgia Tech Atlanta GA 30332 USA MIT 77 Massachusetts Ave Cambridge MA 02139 USA SUNY Binghamton Binghamton NY USA Meta AI Menlo Pk CA USA Univ Calif Berkeley Berkeley CA USA CMU Pittsburgh PA USA Meta Fundamental AI Res FAIR Menlo Pk CA USA
We present a modern formulation of Embodied Question Answering (EQA) as the task of understanding an environment well enough to answer questions about it in natural language. An agent can achieve such an understanding... 详细信息
来源: 评论
360Loc: A Dataset and Benchmark for Omnidirectional Visual Localization with Cross-device Queries
360Loc: A Dataset and Benchmark for Omnidirectional Visual L...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Huang, Huajian Liu, Changkun Zhu, Yipeng Cheng, Hui Braud, Tristan Yeung, Sai-Kit Hong Kong Univ Sci & Technol Hong Kong Peoples R China Sun Yat Sen Univ Guangzhou Peoples R China
Portable 360 degrees cameras are becoming a cheap and efficient tool to establish large visual databases. By capturing omnidirectional views of a scene, these cameras could expedite building environment models that ar... 详细信息
来源: 评论
Prompt-Enhanced Multiple Instance Learning for Weakly Supervised Video Anomaly Detection
Prompt-Enhanced Multiple Instance Learning for Weakly Superv...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Chen, Junxi Li, Liang Su, Li Zha, Zheng-Jun Huang, Qingming Univ Chinese Acad Sci Beijing Peoples R China Chinese Acad Sci Key Lab Intell Info Proc ICT Beijing Peoples R China Peng Cheng Lab Shenzhen Peoples R China Univ Sci & Technol China Hefei Peoples R China Chinese Acad Sci Key Lab Safety Beijing Peoples R China
Weakly-supervised Video Anomaly Detection (wVAD) aims to detect frame-level anomalies using only video-level labels in training. Due to the limitation of coarse-grained labels, Multi-Instance Learning (MIL) is prevail... 详细信息
来源: 评论
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
Panda-70M: Captioning 70M Videos with Multiple Cross-Modalit...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Chen, Tsai-Shien Siarohin, Aliaksandr Menapace, Willi Deyneka, Ekaterina Chao, Hsiang-wei Jeon, Byung Eun Fang, Yuwei Lee, Hsin-Ying Ren, Jian Yang, Ming-Hsuan Tulyakov, Sergey Snap Inc Santa Monica CA 90405 USA Univ Calif Merced Merced CA 95343 USA Univ Trento Trento Italy Snap Santa Monica CA USA
The quality of the data and annotation upper-bounds the quality of a downstream model. While there exist large text corpora and image-text pairs, high-quality video-text data is much harder to collect. First of all, m... 详细信息
来源: 评论
3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos
3DGStream: On-the-Fly Training of 3D Gaussians for Efficient...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Sun, Jiakai Jiao, Han Li, Guangyuan Zhang, Zhanjie Zhao, Lei Xing, Wei Zhejiang Univ Hangzhou Peoples R China
Constructing photo-realistic Free-Viewpoint Videos ( FVVs) of dynamic scenes from multi-view videos remains a challenging endeavor. Despite the remarkable advance-ments achieved by current neural rendering techniques,... 详细信息
来源: 评论