The following topics are dealt with: parallel, distributed and network-based processing; performance analysis; Web computing; failure handling; Java and Jini; parallel and distributed programming tools for grids; unor...
详细信息
ISBN:
(纸本)0769514448
The following topics are dealt with: parallel, distributed and network-based processing; performance analysis; Web computing; failure handling; Java and Jini; parallel and distributed programming tools for grids; unorthodox computing architectures; systems and applications; message passing; scheduling; algorithms; and mobile ad hoc networks.
distributed lock services are extensively utilized in distributed systems to serialize concurrent accesses to shared resources. The need for fast and scalable lock services has become more pronounced with decreasing t...
ISBN:
(纸本)9781939133403
distributed lock services are extensively utilized in distributed systems to serialize concurrent accesses to shared resources. The need for fast and scalable lock services has become more pronounced with decreasing task execution times and expanding dataset scales. However, traditional lock managers, reliant on server CPUs to handle lock requests, experience significant queuing delays in lock grant latency. Advanced network hardware (e.g. programmable switches) presents an avenue to manage locks without queuing delays due to their high packet processing power. Nevertheless, their constrained memory capacity restricts the manageable lock scale, thereby limiting their effect in large-scale *** paper presents FISSLOCK, a fast and scalable distributed lock service that exploits the programmable switch to improve (tail) latency and peak throughput for millions of locks. The key idea behind FISSLOCK is the concept of lock fission, which decouples lock management into grant decision and participant maintenance. FISSLOCK leverages the programmable switch to decide lock grants synchronously and relies on servers to maintain participants (i.e., holders and waiters) asynchronously. By using the programmable switch for routing, FISSLOCK enables on-demand fine-grained lock migration, thereby reducing the lock grant and release delays. FISSLOCK carefully designs and implements grant decision procedure on the programmable switch, supporting over one million locks. Evaluation using various benchmarks and a real-world application shows the efficiency of FISSLOCK. Compared to the state-of-the-art switch-based approach (NetLock), FISSLOCK cuts up to 79.1% (from 43.0%) of median lock grant time in the microbenchmark and improves transaction throughput for TATP and TPC-C by 1.76× and 2.28×, respectively.
With the rise of AI-Generated Content (AIGC) services in wireless networks, efficient and high-quality distributed training of Large Language Models (LLMs) has become essential for enabling the large-scale application...
详细信息
ISBN:
(数字)9798350368369
ISBN:
(纸本)9798350368376
With the rise of AI-Generated Content (AIGC) services in wireless networks, efficient and high-quality distributed training of Large Language Models (LLMs) has become essential for enabling the large-scale application of next generation AI technologies. However, the extensive parameters of LLMs impose significant demands on memory, computing power and communication resources in heterogeneous networks. To efficiently utilize the dispersed network resources, this paper presents a First-Pipeline- Then-Federated Learning (FPTFL) approach with a hybrid parallel scheduling strategy to facilitate the training of Transformer-based LLMs. We propose a block-wise splitting mechanism to partition the Transformer's encoder into distinct segments, which are deployed cross individual devices. The encoder parameters and intermediate smashed data are uploaded to the edge server, where the whole model is updated through federated aggregation. Particularly, we develop a fine-grained computation-efficient method based on pipeline parallelism, enabling the segments to cooperatively train the entire encoder. An optimization problem is formulated to determine the LLM segments and the number of micro-batches under network resource constraints, with the goal of minimizing the total latency of LLM training services. Simulation results demonstrate that our approach enables Transformer-based model training on resource-constrained devices, preserves model performance, and reduces waiting time.
Renewable energy generation forecasting plays crucial roles in advanced smart grid and sustainable practices. Although many RNN related methods have been utilized to predict power generation time series data, they oft...
详细信息
The paper is devoted to the analysis of the network protection mechanism "nervous network system" based on the bio-inspired metaphor. Packet-level simulation is proposed in order to investigate the "ner...
详细信息
ISBN:
(纸本)9780769549392;9781467353212
The paper is devoted to the analysis of the network protection mechanism "nervous network system" based on the bio-inspired metaphor. Packet-level simulation is proposed in order to investigate the "nervous network system" protection mechanism. We describe the architecture of the protection system based on the given mechanism, its operation algorithms and present results of the experiments. Using obtained results the efficiency of the protection mechanism "nervous network system" against infrastructure attacks is analyzed.
Media spaces provide users with flexible support for easy interaction with technology and with each other, both at the same place and over distance. From a technological perspective the development of these environmen...
详细信息
ISBN:
(纸本)9780769539393
Media spaces provide users with flexible support for easy interaction with technology and with each other, both at the same place and over distance. From a technological perspective the development of these environments is often inefficient, since most environments are developed specifically, without any synergies or reuse of previous concepts and implementations. In this paper we present the cooperative media space PPPSpace that is based on powerful technical parallel and distributed software engineering concepts and at the same time easy to use for end-users.
We propose a method for the parallel execution of applications that process continuous streams of data. Unlike pipeline-based approaches, which are frequently employed to parallelize software for multi-core processors...
详细信息
ISBN:
(纸本)9780769543284
We propose a method for the parallel execution of applications that process continuous streams of data. Unlike pipeline-based approaches, which are frequently employed to parallelize software for multi-core processors, our method supports nonlinear structures that may contain conditionals. Nonlinear structures reduce the latency for processing an element from a stream, which is particularly important for embedded systems that are subject to real-time constraints.
The paper proposes a new approach to multicriteria optimization of the structure of secure information interaction in distributed computing systems. A distinguishing feature of the approach is the use of the sequentia...
详细信息
ISBN:
(纸本)9781728165820
The paper proposes a new approach to multicriteria optimization of the structure of secure information interaction in distributed computing systems. A distinguishing feature of the approach is the use of the sequential concessions method. This approach provides the acceptable flexibility in comparing the system construction variants by taking into account not only the priority of the system according to efficiency indicators, but also the relationships of the indicators. The mathematical basis of the proposed method is considered. The procedure of implementation of the proposed method for ensuring security of information interaction in a distributed hierarchical automated control system is demonstrated, and its experimental assessment is given.
parallel programmers mandate high-level parallel programming tools allowing to reduce the effort of the efficient parallelization of their applications. parallel programming leveraging parallel patterns has recently r...
详细信息
ISBN:
(纸本)9781728116440
parallel programmers mandate high-level parallel programming tools allowing to reduce the effort of the efficient parallelization of their applications. parallel programming leveraging parallel patterns has recently received renovated attention thanks to their clear functional and parallel semantics. In this work, we propose a synergy between the well-known Actors-based programming model and the pattern-basedparallelization methodology. We present our preliminary results in that direction, discussing and assessing the implementation of the Map parallel pattern by using an Actor-based software accelerator abstraction that seamlessly integrates within the C++ Actor Framework (ICAF). The results obtained on the Intel Xeon Phi KNL platform demonstrate good performance figures achieved with negligible programming efforts.
In this paper we use the Adapteva Epiphany manycore chip to demonstrate how the throughput and the latency of a baseband signal processing chain, typically found in LTE or WiFi, can be optimized by a combination of ta...
详细信息
ISBN:
(纸本)9781467387767
In this paper we use the Adapteva Epiphany manycore chip to demonstrate how the throughput and the latency of a baseband signal processing chain, typically found in LTE or WiFi, can be optimized by a combination of task- and data parallelization, and data pipelining. The parallelization and data pipelining are facilitated by the shared memory architecture of the Epiphany, and the fact that a processor on one core can write directly into the memory of any other core on the chip.
暂无评论