Many big data systems are written in languages such as C, C++, Java, and Scala for high efficiency, whereas data analysts often use python to conduct data wrangling, statistical analysis, and machine learning. User-de...
详细信息
ISBN:
(纸本)9798400704222
Many big data systems are written in languages such as C, C++, Java, and Scala for high efficiency, whereas data analysts often use python to conduct data wrangling, statistical analysis, and machine learning. User-defined functions (udfs) are commonly used in these systems to bridge the gap between the two ecosystems. Debugging complex udfs in data-processing systems is challenging due to the required coordination between language debuggers and the data-processing engine, as well as the debugging overhead on large volumes of data. In this paper, we showcase Udon, a novel debugger to support line-by-line debugging of udfs in data-processing systems. Udon encapsulates modern line-by-line debugging primitives, such as those to set breakpoints, perform code inspections, and make code modifications while executing a udf on a single tuple. In this demonstration, we use real-world scenarios to showcase the experience of using Udon for line-by-line debugging of a udf.
暂无评论