咨询与建议

限定检索结果

文献类型

  • 12,844 篇 会议
  • 13 篇 期刊文献
  • 2 册 图书

馆藏范围

  • 12,859 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 7,573 篇 工学
    • 6,863 篇 计算机科学与技术...
    • 880 篇 机械工程
    • 814 篇 软件工程
    • 435 篇 控制科学与工程
    • 360 篇 光学工程
    • 306 篇 电气工程
    • 209 篇 仪器科学与技术
    • 124 篇 信息与通信工程
    • 91 篇 生物工程
    • 62 篇 生物医学工程(可授...
    • 39 篇 电子科学与技术(可...
    • 34 篇 安全科学与工程
    • 26 篇 化学工程与技术
    • 21 篇 交通运输工程
    • 20 篇 建筑学
    • 18 篇 土木工程
  • 2,957 篇 医学
    • 2,956 篇 临床医学
    • 15 篇 基础医学(可授医学...
    • 12 篇 药学(可授医学、理...
  • 700 篇 理学
    • 359 篇 物理学
    • 225 篇 数学
    • 175 篇 系统科学
    • 95 篇 统计学(可授理学、...
    • 93 篇 生物学
    • 22 篇 化学
  • 201 篇 艺术学
    • 201 篇 设计学(可授艺术学...
  • 84 篇 管理学
    • 59 篇 图书情报与档案管...
    • 25 篇 管理科学与工程(可...
    • 14 篇 工商管理
  • 23 篇 法学
    • 21 篇 社会学
  • 5 篇 农学
  • 4 篇 教育学
  • 2 篇 经济学
  • 1 篇 军事学

主题

  • 6,464 篇 computer vision
  • 2,693 篇 training
  • 2,440 篇 pattern recognit...
  • 1,778 篇 computational mo...
  • 1,528 篇 visualization
  • 1,348 篇 three-dimensiona...
  • 1,091 篇 computer archite...
  • 1,061 篇 semantics
  • 997 篇 benchmark testin...
  • 980 篇 codes
  • 970 篇 conferences
  • 852 篇 feature extracti...
  • 828 篇 cameras
  • 771 篇 task analysis
  • 708 篇 deep learning
  • 645 篇 image segmentati...
  • 611 篇 object detection
  • 584 篇 shape
  • 554 篇 transformers
  • 543 篇 neural networks

机构

  • 132 篇 univ sci & techn...
  • 122 篇 carnegie mellon ...
  • 118 篇 tsinghua univ pe...
  • 114 篇 univ chinese aca...
  • 113 篇 chinese univ hon...
  • 94 篇 tsinghua univers...
  • 91 篇 zhejiang univ pe...
  • 91 篇 swiss fed inst t...
  • 83 篇 university of ch...
  • 80 篇 zhejiang univers...
  • 78 篇 peng cheng labor...
  • 77 篇 shanghai ai lab ...
  • 75 篇 university of sc...
  • 72 篇 peng cheng lab p...
  • 69 篇 shanghai jiao to...
  • 69 篇 shanghai jiao to...
  • 69 篇 sensetime res pe...
  • 68 篇 stanford univ st...
  • 67 篇 alibaba grp peop...
  • 67 篇 univ hong kong p...

作者

  • 77 篇 timofte radu
  • 63 篇 van gool luc
  • 45 篇 zhang lei
  • 39 篇 luc van gool
  • 36 篇 yang yi
  • 33 篇 tao dacheng
  • 31 篇 loy chen change
  • 29 篇 chen chen
  • 29 篇 sun jian
  • 28 篇 qi tian
  • 25 篇 li xin
  • 24 篇 liu yang
  • 24 篇 tian qi
  • 24 篇 ying shan
  • 24 篇 wang xinchao
  • 23 篇 zha zheng-jun
  • 22 篇 boxin shi
  • 21 篇 zhou jie
  • 21 篇 vasconcelos nuno
  • 20 篇 luo ping

语言

  • 12,850 篇 英文
  • 8 篇 其他
  • 1 篇 中文
检索条件"任意字段=IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops"
12859 条 记 录,以下是551-560 订阅
排序:
MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating vision-Language Transformer
MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Cao, Jianjian Ye, Peng Li, Shengze Yu, Chong Tang, Yansong Lu, Jiwen Chen, Tao Fudan Univ Sch Informat Sci & Technol Shanghai Peoples R China Fudan Univ Acad Engn & Technol Shanghai Peoples R China Tsinghua Univ Tsinghua Shenzhen Int Grad Sch Beijing Peoples R China Tsinghua Univ Dept Automat Beijing Peoples R China
vision-Language Transformers (VLTs) have shown great success recently, but are meanwhile accompanied by heavy computation costs, where a major reason can be attributed to the large number of visual and language tokens... 详细信息
来源: 评论
Towards Explaining Image-Based Distribution Shifts
Towards Explaining Image-Based Distribution Shifts
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Kulinski, Sean Inouye, David I. Purdue Univ Sch Elect & Comp Engn W Lafayette IN 47907 USA
Distribution shift can have fundamental consequences such as signaling a change in the operating environment or significantly reducing the accuracy of downstream models. Thus, understanding such distribution shifts is... 详细信息
来源: 评论
Adversarial Counterfactual Visual Explanations
Adversarial Counterfactual Visual Explanations
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Jeanneret, Guillaume Simon, Loic Jurie, Frederic Univ Caen Normandie ENSICAEN CNRS Caen France
Counterfactual explanations and adversarial attacks have a related goal: flipping output labels with minimal perturbations regardless of their characteristics. Yet, adversarial attacks cannot be used directly in a cou... 详细信息
来源: 评论
Scaling Language-Image Pre-training via Masking
Scaling Language-Image Pre-training via Masking
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Li, Yanghao Fan, Haoqi Hu, Ronghang Feichtenhofert, Christoph He, Kaiming Meta AI FAIR New York NY 10023 USA
We present Fast Language-Image Pre-training (FLIP), a simple and more efficient method for training CLIP [52]. Our method randomly masks out and removes a large portion of image patches during training. Masking allows... 详细信息
来源: 评论
Prompt-Enhanced Multiple Instance Learning for Weakly Supervised Video Anomaly Detection
Prompt-Enhanced Multiple Instance Learning for Weakly Superv...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Chen, Junxi Li, Liang Su, Li Zha, Zheng-Jun Huang, Qingming Univ Chinese Acad Sci Beijing Peoples R China Chinese Acad Sci Key Lab Intell Info Proc ICT Beijing Peoples R China Peng Cheng Lab Shenzhen Peoples R China Univ Sci & Technol China Hefei Peoples R China Chinese Acad Sci Key Lab Safety Beijing Peoples R China
Weakly-supervised Video Anomaly Detection (wVAD) aims to detect frame-level anomalies using only video-level labels in training. Due to the limitation of coarse-grained labels, Multi-Instance Learning (MIL) is prevail... 详细信息
来源: 评论
What Do You See in Vehicle? Comprehensive vision Solution for In-Vehicle Gaze Estimation
What Do You See in Vehicle? Comprehensive Vision Solution fo...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Cheng, Yihua Zhu, Yaning Wang, Zongji Hao, Hongquan Liu, Yongwei Cheng, Shiqing Wang, Xi Chang, Hyung Jin Univ Birmingham Birmingham W Midlands England Huazhong Univ Sci & Technol Wuhan Hubei Peoples R China Chinese Acad Sci NIST Beijing Peoples R China CalmCar Suzhou Jiangsu Peoples R China
Driver's eye gaze holds a wealth of cognitive and intentional cues crucial for intelligent vehicles. Despite its significance, research on in-vehicle gaze estimation remains limited due to the scarcity of comprehe... 详细信息
来源: 评论
RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback
RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment fro...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Yu, Tianyu Yao, Yuan Zhang, Haoye He, Taiwen Han, Yifeng Cui, Ganqu Hu, Jinyi Liu, Zhiyuan Zheng, Hai-Tao Sun, Maosong Tsinghua Univ Beijing Peoples R China Natl Univ Singapore Singapore Singapore Tsinghua Univ Shenzhen Int Grad Sch Beijing Peoples R China Pengcheng Lab Shenzhen Peoples R China
Multimodal Large Language Models (MLLMs) have recently demonstrated impressive capabilities in multimodal understanding, reasoning, and interaction. However, existing MLLMs prevalently suffer from serious hallucinatio... 详细信息
来源: 评论
Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction
Generalizable Whole Slide Image Classification with Fine-Gra...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Li, Hao Chen, Ying Chen, Yifei Yu, Rongshan Yang, Wenxian Wang, Liansheng Ding, Bowen Han, Yuchen Xiamen Univ Sch Informat Xiamen Peoples R China Huawei Xiamen Peoples R China Aginome Sci Xiamen Peoples R China Shanghai Jiao Tong Univ Shanghai Chest Hosp Dept Pathol Sch Med Shanghai Peoples R China
Whole Slide Image (WSI) classification is often formulated as a Multiple Instance Learning (MIL) problem. Recently, vision-Language Models (VLMs) have demonstrated remarkable performance in WSI classification. However... 详细信息
来源: 评论
TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding
TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understan...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Zhang, Zhihao Cao, Shengcao Wang, Yu-Xiong Xi An Jiao Tong Univ Xian Peoples R China Univ Illinois Champaign IL USA
The limited scale of current 3D shape datasets hinders the advancements in 3D shape understanding, and motivates multi-modal learning approaches which transfer learned knowledge from data-abundant 2D image and languag... 详细信息
来源: 评论
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Und...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (CVPR)
作者: Chen, Sijin Chen, Xin Zhang, Chi Li, Mingsheng Yu, Gang Fei, Hao Zhu, Hongyuan Fan, Jiayuan Chen, Tao Fudan Univ Shanghai Peoples R China Tencent PCG Shenzhen Peoples R China Natl Univ Singapore Singapore Singapore ASTAR Inst InfoComm Res I2R Singapore Singapore ASTAR Ctr Frontier AI Res CFAR Singapore Singapore
Recent progress in Large Multimodal Models (LMM) has opened up great possibilities for various applications in the field of human-machine interactions. However, developing LMMs that can comprehend, reason, and plan in... 详细信息
来源: 评论