With the widespread use of mobile and sensing devices, there has been an explosion of high velocity, transient data having spatial and temporal characteristics. Interactive analysis at such scale and speed require sup...
详细信息
ISBN:
(纸本)9781450395298
With the widespread use of mobile and sensing devices, there has been an explosion of high velocity, transient data having spatial and temporal characteristics. Interactive analysis at such scale and speed require support for highly efficient query processing back-end frameworks coupled with lightweight yet powerful front-end interfaces. While existing in-memory distributedstream processing frameworks are perfect candidates for scalable big data processing, spatio-temporal systems in this domain are mostly dominated by specify-once-apply-continuously query model. Any modification in query state requires query restart limiting system responsiveness and producing outdated or in the worst case erroneous results. Furthermore, most of the contemporary spatio-temporal big data systems are designed to operate in a single execution environment limiting their applicability to users accustomed to other similar frameworks with different APIs. In this paper, we demonstrate SPEAR-Board;an interactive web-based interface integrated with cross-platform stream processing engine;SPEAR, capable of seamlessly handling spatio-temporal query state changes in real-time. We demonstrate working of SPEAR-Board with respect to spatio-temporal Range and Nearest Neighbor queries backed by Apache Spark and Apache Flink deployed over cloud resources.
With the advent of IoT and emerging 5G technology, real-time streaming data are being generated at unprecedented speed and volume having both temporal and spatial dimensions. Effective analysis at such scale and speed...
详细信息
ISBN:
(纸本)9781728191843
With the advent of IoT and emerging 5G technology, real-time streaming data are being generated at unprecedented speed and volume having both temporal and spatial dimensions. Effective analysis at such scale and speed require support for dynamically adjusting querying capabilities in real-time. In the spatio-temporal domain, this warrants data as well as query optimization strategies especially for objects with changing motion states. Contemporary spatio-temporal data stream management systems in the distributed domain are mostly dominated by specify-once-apply-continuously query model. Any modification in query state requires query restart limiting system responsiveness and producing outdated or in worst case erroneous results. In this paper, we propose adaptations of principles from streaming databases, spatial data management, and distributed computing to support dynamic spatio-temporal query processing over high-velocity big data streams. We first formulate a set of spatio-temporal data types and functions to seamlessly handle changes in distributed query states. We develop a comprehensive set of streaming spatio-temporal querying methods and propose geohash based dynamic spatial partitioning for effective parallel processing. We implement a prototype on top of Apache Flink, where the in-memory stream processing fits nicely with our spatio-temporal models. Comparative evaluation of our prototype demonstrates the effectiveness of our strategy by maintaining high consistent processing rates for both stationary as well as moving queries over high velocity spatio-temporal big data streams.
暂无评论