咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >Learning-based Sketches for Fr... 收藏
arXiv

Learning-based Sketches for Frequency Estimation in Data Streams without Ground Truth

作     者:Yuan, Xinyu Qiao, Yan Li, Meng Wei, Zhenchun Feng, Cuiying 

作者机构:School of Computer Science and Information Engineering Hefei University of Technology China School of Information and Software Engineering University of Electronic Science and Technology of China China 

出 版 物:《arXiv》 (arXiv)

年 卷 期:2024年

核心收录:

主  题:Self supervised learning 

摘      要:Estimating the frequency of items on the high-volume, fast data stream has been extensively studied in many areas, such as database and network measurement. Traditional sketch algorithms only allow to give very rough estimates with limited memory cost, whereas some learning-augmented algorithms have been proposed recently, their offline framework requires actual frequencies that are challenging to access in general for training, and speed is too slow for real-time processing, despite the still coarse-grained accuracy. To this end, we propose a more practical learning-based estimation framework namely UCL-sketch, by following the line of equation-based sketch to estimate per-key frequencies. In a nutshell, there are two key techniques: online training via equivalent learning without ground truth, and highly scalable architecture with logical estimation buckets. We implemented experiments on both real-world and synthetic datasets. The results demonstrate that our method greatly outperforms existing state-of-the-art sketches regarding per-key accuracy and distribution, while preserving resource efficiency. Our code is attached in the supplementary material, and will be made publicly available at https://***/Y-debug-sys/UCL-sketch. Copyright © 2024, The Authors. All rights reserved.

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分