咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >Custom 8-bit floating point va... 收藏
arXiv

Custom 8-bit floating point value format for reducing shared memory bank conflict in approximate nearest neighbor search

作     者:Ootomo, Hiroyuki Naruse, Akira 

作者机构:Dept. of Computer Science Tokyo Institute of Technology Tokyo Japan NVIDIA Tokyo Japan 

出 版 物:《arXiv》 (arXiv)

年 卷 期:2023年

核心收录:

主  题:Nearest neighbor search 

摘      要:The k-nearest neighbor search is used in various applications such as machine learning, computer vision, database search, and information retrieval. While the computational cost of the exact nearest neighbor search is enormous, an approximate nearest neighbor search (ANNS) has been attracting much attention. IVFPQ is one of the ANNS methods. Although we can leverage the high bandwidth and low latency of shared memory to compute the search phase of the IVFPQ on NVIDIA GPUs, the throughput can degrade due to shared memory bank conflict. To reduce the bank conflict and improve the search throughput, we propose a custom 8-bit floating point value format. This format doesn’t have a sign bit and can be converted from/to FP32 with a few instructions. We use this format for IVFPQ on GPUs and achieved better performance without significant recall loss compared to FP32 and FP16. Copyright © 2023, The Authors. All rights reserved.

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分