On-device deep learning enables mobile devices to perform complex tasks, such as object detection and voice translation, regardless of the network condition. The advanced deep learning model gives an excellent perform...
详细信息
To deal with the adjustment difficulties caused by the access of photovoltaic (PV) in distribution network, this paper introduces a reactive power optimization system of distribution network based on edge calculation....
详细信息
The present paper provides a new algorithm for distributed optimization of Mixed-Integer Nonlinear Programs (MINLPs) with separable, affinely coupled decision variables. The algorithm is based on a standard branch and...
详细信息
ISBN:
(纸本)9781450377324
The present paper provides a new algorithm for distributed optimization of Mixed-Integer Nonlinear Programs (MINLPs) with separable, affinely coupled decision variables. The algorithm is based on a standard branch and bound method but extended to allow for distributed or parallelcomputing framework. For the subclass of Mixed-Integer Convex Programs, convergence and optimality guarantees are given. A suite of benchmark problems are used to evaluate the performance of the new distributed algorithm.
The CMS experiment requires vast amounts of computational capacity in order to generate, process and analyze the data coming from proton-proton collisions at the Large Hadron Collider, as well as Monte Carlo simulatio...
详细信息
As a NP-hard problem, it is always baffling to figure out a scheduling strategy to arrange the interconnected tasks of a workflow on the infinite number of resources in the cloud environment so that the workflow can b...
详细信息
ISBN:
(纸本)9781665426558
As a NP-hard problem, it is always baffling to figure out a scheduling strategy to arrange the interconnected tasks of a workflow on the infinite number of resources in the cloud environment so that the workflow can be addressed efficiently and robustly. This paper focuses on scheduling the workflow's tasks on the cloud resources with less rental cost of resources while the whole schedule length (makespan) will not exceed the given deadline. As one of the most popular evolutionary algorithms, particle swarm optimization (PSO) has been successfully applied for the workflow scheduling problem. Inspired by the idea of multiple groups and the distributedparallelcomputing, we develop an enhanced PSO algorithm for the workflow scheduling problem in clouds. Besides, a pretreatment strategy is adopted to simplify the workflow's structure. The experimental results demonstrate that our proposal has good performance on improving the algorithm's searching ability and finding better solutions.
Machine Learning (ML) is extensively employed for key functions of Connected and Autonomous Vehicles (CAVs), where many models are executed simultaneously within a vehicle to provide diverse applications, from percept...
详细信息
ISBN:
(数字)9798350354812
ISBN:
(纸本)9798350354829
Machine Learning (ML) is extensively employed for key functions of Connected and Autonomous Vehicles (CAVs), where many models are executed simultaneously within a vehicle to provide diverse applications, from perception to planning and control. One of the most appealing ML approaches for CAVs is Federated Learning (FL) due to its privacy-preserving nature and distributed learning capabilities. However, current FL approaches mostly focus on single-model training and are unsuitable for parallel training of multiple models. Multi-model FL involves training multiple ML models to perform different tasks, often simultaneously, to meet the demands of different applications within the same context. In this way, this work introduces MELRO, an FL model assignment algorithm based on link duration, training latency, and data entropy from CAVs. MELRO balances computing resources and addresses high vehicle mobility while considering the heterogeneity of data and availability of resources in CAVs. The assignment algorithm takes advantage of data transmitted periodically by CAVs, such as beacons, to calculate link duration and training latency, define the model assignment matrix for CAVs, and consider data entropy. Finally, MELRO increases accuracy for FL applications by at least 11.76% while reducing training latency by 25% and maintaining computational resource usage.
The distributed alternating direction method of mul-tipliers (ADMM) is an effective algorithm to solve large-scale op-timization problems. However, there are still massive computation and communication cost in distrib...
详细信息
The distributed alternating direction method of mul-tipliers (ADMM) is an effective algorithm to solve large-scale op-timization problems. However, there are still massive computation and communication cost in distributed ADMM when processing high-dimensional data. To solve this problem, we propose a distributed ADMM with sparse computation and Allreduce communication (SCAC-ADMM) which can process high-dimensional data effectively. In the algorithm, each node optimizes a sub-model of the target model in parallel. Then, the target model is obtained by aggregating all sub-models. The features in the sub-model are named associated features. In SCAC-ADMM, we first design a selecting method of associated features to determine the composition of each sub-model. This method can limit the dimension of the sub-model by setting appropriate parameters, so as to limit the computation cost. Secondly, to reduce the communication traffic caused by transmitting high-dimensional parameters, we propose a novel Allreduce communication model which can only aggregate associated parameters in sub-models. Experiments on high-dimensional datasets show that SCAC-ADMM has less computation cost and higher communication efficiency than traditional distributed ADMM. When solving large-scale logistic regression problem, SCAC-ADMM can reduce the system time by 73% compared with traditional distributed ADMM.
In the field of multi-core computing systems, optimizing the distribution of tasks across multiple cores is essential to harness maximum performance potential. Traditional static task allocation methods, often employe...
详细信息
ISBN:
(数字)9798331540364
ISBN:
(纸本)9798331540371
In the field of multi-core computing systems, optimizing the distribution of tasks across multiple cores is essential to harness maximum performance potential. Traditional static task allocation methods, often employed in lexical analysis-the initial and vital phase of compiling and interpreting source code-frequently result in imbalanced workloads and suboptimal resource utilization. This paper introduces an innovative and sophisticated mathematical model for dynamic task allocation specifically tailored to lexical analysis, aiming to revolutionize how tasks are distributed across multiple cores. By employing advanced dynamic allocation techniques, our proposed model not only ensures balanced workload distribution but also scales efficiently with varying computational demands. This novel approach significantly enhances the efficiency of lexical analysis in multi-core environments, leading to substantial improvements in processing times and resource utilization. Extensive experimental results validate the superiority of our method, showcasing dramatic reductions in processing latency and a more effective exploitation of multi-core capabilities. This dynamic model not only advances the field of lexical analysis but also sets a new benchmark for task allocation strategies in multi-core computing systems, promising broader applicability and future innovations in compiler design and resource management.
In Data-Intensive Scalable computing (DISC) Systems, data transformations are concealed by exposed APIs, and intermediate execution moments are masked under dataflow transitions. Consequently, many crucial features an...
详细信息
ISBN:
(纸本)9781728170022
In Data-Intensive Scalable computing (DISC) Systems, data transformations are concealed by exposed APIs, and intermediate execution moments are masked under dataflow transitions. Consequently, many crucial features and optimizations (e.g., debugging, data provenance, runtime skew detection) are not well-supported. Inspired by our experience in implementing features and optimizations over DISC systems, we present SEIZE, a unified framework that enables dataflow inspection wiretapping the data-path with listening logic -in MapReduce-style programming model. We generalize our lessons learned by providing a set of primitives defining dataflow inspection, orchestration options for different inspection granularities, and operator decomposition and dataflow puncutation strategy for dataflow intervention. We demonstrate the generality and flexibility of the approach by deploying SEIZE in both Apache Spark and Apache Flink. Our experiments show that, the overhead introduced by the inspection logic is most of the time negligible (less than 5% in Spark and 10% in Flink).
The distributed network control in software-defined networking is greatly challenged by the design demands of evolving a running distributed control plane (DCP), and by the complexity of handling the control state dyn...
详细信息
The distributed network control in software-defined networking is greatly challenged by the design demands of evolving a running distributed control plane (DCP), and by the complexity of handling the control state dynamics. In this paper, we propose RIFFLE to address the problem using a distributed network operating system approach. RIFFLE enables the evolvability of control states of DCP deployed on a global scale. RIFFLE handles spatial and temporal dynamics that occur when updating and migrating the distributed control states, which overcomes the critical barrier to the evolvability. It fills the gap by enabling temporal reconfigurability in DCP. Moreover, RIFFLE reduces the complexities of building and maintaining network control services in DCP by enabling componentization and abstracting the underlying network dynamics. We evaluated the prototype RIFFLE through experiments running in PlanetLab. The results show that the prototype scales well for 135 nodes. We also validate that RIFFLE ensures the continuity and low content request delay for the supported the information-centric network supported during cache update periods owing to the enabled evolvability.
暂无评论