咨询与建议

限定检索结果

文献类型

  • 11,885 篇 会议
  • 5 篇 期刊文献

馆藏范围

  • 11,890 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 8,059 篇 工学
    • 7,617 篇 计算机科学与技术...
    • 796 篇 机械工程
    • 688 篇 电气工程
    • 360 篇 软件工程
    • 228 篇 控制科学与工程
    • 40 篇 光学工程
    • 19 篇 生物工程
    • 17 篇 信息与通信工程
    • 12 篇 生物医学工程(可授...
    • 6 篇 电子科学与技术(可...
    • 6 篇 建筑学
    • 6 篇 交通运输工程
    • 5 篇 仪器科学与技术
    • 5 篇 化学工程与技术
    • 5 篇 安全科学与工程
    • 4 篇 土木工程
  • 3,347 篇 医学
    • 3,346 篇 临床医学
    • 4 篇 基础医学(可授医学...
    • 4 篇 公共卫生与预防医...
  • 253 篇 理学
    • 198 篇 系统科学
    • 32 篇 物理学
    • 21 篇 生物学
    • 18 篇 数学
    • 9 篇 统计学(可授理学、...
    • 7 篇 化学
  • 17 篇 管理学
    • 12 篇 管理科学与工程(可...
    • 7 篇 图书情报与档案管...
    • 5 篇 工商管理
  • 3 篇 法学
    • 3 篇 社会学
  • 3 篇 教育学
    • 3 篇 教育学
  • 2 篇 农学
  • 1 篇 经济学
  • 1 篇 军事学

主题

  • 5,633 篇 computer vision
  • 2,668 篇 training
  • 2,203 篇 pattern recognit...
  • 1,747 篇 computational mo...
  • 1,502 篇 visualization
  • 1,360 篇 three-dimensiona...
  • 1,074 篇 semantics
  • 999 篇 benchmark testin...
  • 986 篇 codes
  • 959 篇 computer archite...
  • 891 篇 deep learning
  • 777 篇 conferences
  • 754 篇 task analysis
  • 700 篇 feature extracti...
  • 561 篇 transformers
  • 533 篇 face recognition
  • 527 篇 neural networks
  • 495 篇 object detection
  • 490 篇 image segmentati...
  • 468 篇 cameras

机构

  • 174 篇 univ sci & techn...
  • 145 篇 carnegie mellon ...
  • 144 篇 univ chinese aca...
  • 144 篇 tsinghua univ pe...
  • 134 篇 chinese univ hon...
  • 110 篇 zhejiang univ pe...
  • 109 篇 peng cheng lab p...
  • 99 篇 swiss fed inst t...
  • 91 篇 tsinghua univers...
  • 90 篇 shanghai ai lab ...
  • 87 篇 sensetime res pe...
  • 86 篇 shanghai jiao to...
  • 83 篇 zhejiang univers...
  • 82 篇 tech univ munich...
  • 79 篇 university of sc...
  • 79 篇 stanford univ st...
  • 78 篇 univ hong kong p...
  • 77 篇 australian natl ...
  • 76 篇 alibaba grp peop...
  • 75 篇 peng cheng labor...

作者

  • 75 篇 timofte radu
  • 64 篇 van gool luc
  • 50 篇 zhang lei
  • 43 篇 yang yi
  • 37 篇 loy chen change
  • 36 篇 tao dacheng
  • 32 篇 zhou jie
  • 31 篇 chen chen
  • 30 篇 liu yang
  • 30 篇 tian qi
  • 29 篇 sun jian
  • 29 篇 zha zheng-jun
  • 28 篇 li xin
  • 27 篇 qi tian
  • 26 篇 vasconcelos nuno
  • 25 篇 liu xiaoming
  • 25 篇 darrell trevor
  • 24 篇 zheng wei-shi
  • 24 篇 luo ping
  • 24 篇 ying shan

语言

  • 11,863 篇 英文
  • 26 篇 其他
  • 1 篇 中文
检索条件"任意字段=2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024"
11890 条 记 录,以下是561-570 订阅
排序:
ALINA: Advanced Line Identification and Notation Algorithm
ALINA: Advanced Line Identification and Notation Algorithm
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Khan, Mohammed Abdul Hafeez Ganeriwala, Parth Bhattacharyya, Siddhartha Neogi, Natasha Muthalagu, Raja Florida Inst Technol Melbourne FL 32901 USA NASA Langley Res Ctr Hampton VA 23665 USA BITS Pilani Dubai Campus Dubai U Arab Emirates
Labels are the cornerstone of supervised machine learning algorithms. Most visual recognition methods are fully supervised, using bounding boxes or pixel-wise segmentations for object localization. Traditional labelin... 详细信息
来源: 评论
GestFormer: Multiscale Wavelet Pooling Transformer Network for Dynamic Hand Gesture recognition
GestFormer: Multiscale Wavelet Pooling Transformer Network f...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Garg, Mallika Ghosh, Debashis Pradhan, Pyari Mohan Indian Inst Technol Dept Elect & Commun Engn Roorkee Uttar Pradesh India
Transformer model have achieved state-of-the-art results in many applications like NLP, classification, etc. But their exploration in gesture recognition task is still limited. So, we propose a novel GestFormer archit... 详细信息
来源: 评论
RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback
RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment fro...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Yu, Tianyu Yao, Yuan Zhang, Haoye He, Taiwen Han, Yifeng Cui, Ganqu Hu, Jinyi Liu, Zhiyuan Zheng, Hai-Tao Sun, Maosong Tsinghua Univ Beijing Peoples R China Natl Univ Singapore Singapore Singapore Tsinghua Univ Shenzhen Int Grad Sch Beijing Peoples R China Pengcheng Lab Shenzhen Peoples R China
Multimodal Large Language Models (MLLMs) have recently demonstrated impressive capabilities in multimodal understanding, reasoning, and interaction. However, existing MLLMs prevalently suffer from serious hallucinatio... 详细信息
来源: 评论
Theoretically Achieving Continuous Representation of Oriented Bounding Boxes
Theoretically Achieving Continuous Representation of Oriente...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Xiao, Zi-Kai Yang, Guo-Ye Yang, Xue Mul, Tai-Jiang Yan, Junchi Hui, Shi-Min Tsinghua Univ Dept Comp Sci & Technol Beijing Peoples R China Shanghai Jiao Tong Univ Dept CSE Shanghai Peoples R China Shanghai Jiao Tong Univ MoE Key Lab AI Shanghai Peoples R China
Considerable efforts have been devoted to Oriented Object Detection (OOD). However, one lasting issue regarding the discontinuity in Oriented Bounding Box (OBB) representation remains unresolved, which is an inherent ... 详细信息
来源: 评论
Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction
Generalizable Whole Slide Image Classification with Fine-Gra...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Li, Hao Chen, Ying Chen, Yifei Yu, Rongshan Yang, Wenxian Wang, Liansheng Ding, Bowen Han, Yuchen Xiamen Univ Sch Informat Xiamen Peoples R China Huawei Xiamen Peoples R China Aginome Sci Xiamen Peoples R China Shanghai Jiao Tong Univ Shanghai Chest Hosp Dept Pathol Sch Med Shanghai Peoples R China
Whole Slide Image (WSI) classification is often formulated as a Multiple Instance Learning (MIL) problem. Recently, vision-Language Models (VLMs) have demonstrated remarkable performance in WSI classification. However... 详细信息
来源: 评论
One Embedding to Predict Them All: Visible and Thermal Universal Face Representations for Soft Biometric Estimation via vision Transformers
One Embedding to Predict Them All: Visible and Thermal Unive...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Mirabet-Herranz, Nelida Galdi, Chiara Dugelay, Jean-Luc EURECOM Campus SophiaTech450 Route Chappes F-06410 Biot France
Human faces encode a vast amount of information including not only uniquely distinctive features of the individual but also demographic information such as a person's age, gender, and weight. Such information is r... 详细信息
来源: 评论
Interpreting COVID Lateral Flow Tests' Results with Foundation Models
Interpreting COVID Lateral Flow Tests' Results with Foundati...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Pandey, Stuti Myers-Dean, Josh Reynolds, Jarek Gurari, Danna Univ Colorado Boulder CO 80309 USA Univ Texas Austin Austin TX 78712 USA
Lateral flow tests (LFTs) enable rapid, low-cost testing for health conditions including Covid, pregnancy, HIV, and malaria. Automated readers of LFT results can yield many benefits including empowering blind people t... 详细信息
来源: 评论
BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in vision-based Roadside 3D Object Detection
BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Represen...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Wang, Wenjie Lu, Yehao Zheng, Guangcong Zhan, Shuigen Ye, Xiaoqing Tan, Zichang Wang, Jingdong Wang, Gaoang Li, Xi Zhejiang Univ Coll Comp Sci & Technol Hangzhou Zhejiang Peoples R China Zhejiang Univ Polytech Inst Hangzhou Zhejiang Peoples R China Baidu Beijing Peoples R China Zhejiang Singapore Innovat & AI Joint Res Lab Singapore Singapore
vision-based roadside 3D object detection has attracted rising attention in autonomous driving domain, since it encompasses inherent advantages in reducing blind spots and expanding perception range. While previous wo... 详细信息
来源: 评论
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Und...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Chen, Sijin Chen, Xin Zhang, Chi Li, Mingsheng Yu, Gang Fei, Hao Zhu, Hongyuan Fan, Jiayuan Chen, Tao Fudan Univ Shanghai Peoples R China Tencent PCG Shenzhen Peoples R China Natl Univ Singapore Singapore Singapore ASTAR Inst InfoComm Res I2R Singapore Singapore ASTAR Ctr Frontier AI Res CFAR Singapore Singapore
Recent progress in Large Multimodal Models (LMM) has opened up great possibilities for various applications in the field of human-machine interactions. However, developing LMMs that can comprehend, reason, and plan in... 详细信息
来源: 评论
Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
Embodied Multi-Modal Agent trained by an LLM from a Parallel...
收藏 引用
ieee/cvf conference on computer vision and pattern recognition (cvpr)
作者: Yang, Yijun Zhou, Tianyi Li, Kanxue Tao, Dapeng Li, Lusong Shen, Li He, Xiaodong Jiang, Jing Shi, Yuhui Southern Univ Sci & Technol Shenzhen Peoples R China Univ Maryland College Pk MD 20742 USA Yunnan Univ Kunming Yunnan Peoples R China JD Explore Acad Beijing Peoples R China Univ Technol Sydney Sydney NSW Australia
While large language models (LLMs) excel in a simulated world of texts, they struggle to interact with the more realistic world without perceptions of other modalities such as visual or audio signals. Although vision-... 详细信息
来源: 评论