With the rapid development of network and mobile devices, the era of ten thousands households interconnection is gradually coming. The variety of applications is endless, putting higher demands on computing resources ...
详细信息
The rapid rise in spatial data volumes from diverse sources necessitate efficient spatial data processing capability. Although most relational databases support spatial extensions of SQL query features, they offer lim...
详细信息
Sysrepo is a YANG(Yet another next Generation)-based configuration and operational state data store for Unix/Linux based applications. Applications will be able to use sysrepo to store their configurations, provided b...
详细信息
With the development of machine learning technology in various fields, such as medical care, smart manufacturing, etc., the data has exploded. It is a challenge to train a deep learning model for different application...
详细信息
The proceedings contain 26 papers. The topics discussed include: a selective and biased choice of techniques for building a distributed data store;accelerating the performance of distributed stream processing systems ...
ISBN:
(纸本)9798400701221
The proceedings contain 26 papers. The topics discussed include: a selective and biased choice of techniques for building a distributed data store;accelerating the performance of distributed stream processing systems with in-network computing;secure distributed data and event processing at scale: where are we now?;adaptive distributed streaming similarity joins;I will survive: an event-driven conformance checking approach over process streams;on improving streaming system Autoscaler behavior using windowing and weighting methods;practical forecasting of cryptocoins timeseries using correlation patterns;an exploratory analysis of methods for real-time data deduplication in streaming processes;considerations for integrating virtual threads in a Java framework: a Quarkus example in a resource-constrained environment;and discovery of breakout patterns in financial tick data via parallel stream processing with in-order guarantees.
Enforcing scheduling policies at end-hosts with software schedulers suffers from high CPU consumption, low throughput, and inaccuracy. Offloading scheduling functions to the network interface card (NIC) provides a pro...
详细信息
ISBN:
(数字)9781665471770
ISBN:
(纸本)9781665471770
Enforcing scheduling policies at end-hosts with software schedulers suffers from high CPU consumption, low throughput, and inaccuracy. Offloading scheduling functions to the network interface card (NIC) provides a promising direction to address these problems. However, existing efforts in scheduling offloading suffer from inflexible on-NIC packet schedulers, which cannot execute complex hierarchies of network policies. In this paper, we present FlowValve, the first parallel packet scheduler for Network Processor (NP)-based SmartNlCs that offloads critical functions of Linux traffic control, including packet classifying and scheduling. The key insight behind FlowValve is to abstract inherent queues attached to the NIC interface (wire side) as a single FIFO queue and perform specialized tail drop to mix the FIFO queue with expected flow proportions. FlowValve takes advantage of on-chip multi-core parallelism and hardware accelerations to produce high throughput. Meanwhile, it substantially reduces CPU and memory burdens on endhosts. We prototype FlowValve on a Netronome Agilio SmartNIC and demonstrate its effectiveness against non-offloaded kernel schedulers and DPDK QoS Scheduler. We find that FlowValve outperforms both in accurately enforcing network policies while driving line rate performance (i.e., 40Gbps), which contributes to saving at least two CPU cores.
Recent advances in generative AI epitomized by Large Language Models (LLMs) demonstrated remarkable capabilities in generating human-like text and understanding contextual nuances. In parallel, advances in domestic ap...
详细信息
The proceedings contain 50 papers. The topics discussed include: cohort-based federated learning credit evaluation method in the metaverse;prediction of the number of postgraduate entrance examination applicants based...
ISBN:
(纸本)9781450399883
The proceedings contain 50 papers. The topics discussed include: cohort-based federated learning credit evaluation method in the metaverse;prediction of the number of postgraduate entrance examination applicants based on LSTM and statistical analysis method;a probe set determination method based on spanning tree algorithm;semantic deep mining method for recommendation based on small-world graphs;a novel distributed K-Means clustering algorithm for big text data;an optimized hybrid gauss-seidel smoother in AMG Solver of Hypre on Sunway many-core architecture;matrix multiplication with diagonals: structured sparse matrices and beyond;PEPC: parallel and extensible PC implementation for causal structure learning;and parallel optimization of SLIC algorithm for new-generation Sunway processors.
With the ever-increasing computational demand of DNN training workloads, distributed training has been widely adopted. A combination of data, model and pipeline parallelism strategy, called hybrid parallelism distribu...
详细信息
ISBN:
(纸本)9798400701405
With the ever-increasing computational demand of DNN training workloads, distributed training has been widely adopted. A combination of data, model and pipeline parallelism strategy, called hybrid parallelism distributed training, is imported to tackle the problem of deploying large-scale models. However, how to evaluate the hybrid strategy and the utilization of each device remains a challenge since existing works either profile on a real large-scale cluster with high time and money costs or only analyze a specific type of parallelism without considering the hybrid parallelism. In this work, we proposed DistSim, an event-based performance model to accurately analyze each device's computation and communication activities with low profiling costs. DistDim breaks down the model into events according to the given distributed strategy, which can be profiled on two nodes. Then DistSim leverages the hierarchy of different parallel strategies to generate the computation and communication event-flow from layer level to model level and finally the activity timeline of each device participating in training. Experiment shows that DistSim can reach <4% errors when predicting distributing training batch time and <5% errors when predicting a single device's activity time in various hybrid strategy settings. We also provide a use-case of DistSim, automatically evaluate and search the best distributed training strategy, and find a hybrid strategy with at most 7.37x throughput improvement.
Connected vehicles need to generate, store, process, and exchange a multitude of information with their environment. Much of this information is privacy-critical and thus regulated by privacy laws like the GDPR ha Eur...
详细信息
ISBN:
(纸本)9781665469586
Connected vehicles need to generate, store, process, and exchange a multitude of information with their environment. Much of this information is privacy-critical and thus regulated by privacy laws like the GDPR ha Europe. In this paper, we analyze and rate exemplary data (flows) of the electric driving domain with regard to their criticality based on a reference architecture. We classify the corresponding ECUs based on their processed privacy-critical data and propose technical mitigation measures and technologies in form of generic privacy-enhancing building blocks according to the classification and requirements derived from the GDPR.
暂无评论