In current social computing system, not only hardware but also software experiences a directly discarded mode. Such directly discarded mode may result in huge waste. The major challenge in green computing is the recyc...
详细信息
In current social computing system, not only hardware but also software experiences a directly discarded mode. Such directly discarded mode may result in huge waste. The major challenge in green computing is the recyclability of the computing system. In order to address this challenge in the software field, this paper proposes a design idea of green software embodying the adaptability and recyclability. The adaptable and recyclable strategy may consist of two phases. The first one is compressing the increasingly deeper software stack, another one is keeping the functionality recycling and code reuse. The adaptability and recyclability mean automatically decomposing the complex software into several parts which are ease to be reused and automatically selecting the feasible parts for the on-writing software. And we also explore the system software design path to adaptability and recyclability in our previous work as well as in the future.
Thread migration is an effective technique for fault resilience and load balancing in high performance computing. However, flexible thread migration is not easy to achieve. In this paper, we present an approach to cre...
详细信息
Thread migration is an effective technique for fault resilience and load balancing in high performance computing. However, flexible thread migration is not easy to achieve. In this paper, we present an approach to creating migratory threads dynamically at application level. The targets must be executable source files. Compared to the static ones, the number of threads created during migration with the new scheme is independent from the context. Various thread cases are considered for migration. Thread state is handled at application level. Variables are saved in structures in C language for speedy state construction. In order to solve the inability of traditional methods in tracking the created threads, a tree of active function calls is used to record the callback. The preprocessor of the proposed one is written in YACC for accuracy and programmability. The migration enabling process is transparent to programmers with the help of the preprocess. The experimental results have demonstrated the effectiveness of this approach.
As the semiconductor industry advances to nano-technology points, Network on Chip (NoC) components are becoming vulnerable to errors during the system operation. Consequently, fault-tolerant techniques for NoC are nee...
详细信息
Mobile edge computing (MEC) has produced incredible outcomes in the context of computationally intensive mobile applications by offloading computation to a neighboring server to limit the energy usage of user equipmen...
详细信息
The need for precise timing is increasing with the development of the smart city, and intelligent transport is a key application that often relies on the distribution of accurate timing. Existing time synchronization ...
详细信息
ISBN:
(数字)9798350362244
ISBN:
(纸本)9798350362251
The need for precise timing is increasing with the development of the smart city, and intelligent transport is a key application that often relies on the distribution of accurate timing. Existing time synchronization solutions cannot adequately meet the precise timing needs for practical applications in intelligent transportation. Global Navigation Satellite systems (GNSS) signals, for instance, can be obstructed, weakened, or deflected in urban canyon environments. While certain sectors like telecommunications and smart grids have successfully adopted IEEE 1588 for high-precision time synchronization, its dependence on standalone networks and limited application within local area networks poses challenges when catering to the wide-ranging and mobile demands of intelligent transportation applications. This paper categorizes the time synchronization requirements of intelligent transport and proposes a time synchronization scheme over converged networks, which separates time transmission and precision compensation based on software-defined networking (SDN) principles, and makes it possible to provide precise timing for intelligent transport devices, including on-board units (OBUs) and roadside units (RSUs), over wired and wireless communication networks. Experimental results indicate that the proposed scheme achieves a relative time error of less than 1.2 microseconds over the converged networks.
A cloud computing provider can dynamically allocate virtual machines (VM) based on the needs of the customers, while maintaining the privileged access to the Management Virtual Machine that directly manages the hardwa...
详细信息
A cloud computing provider can dynamically allocate virtual machines (VM) based on the needs of the customers, while maintaining the privileged access to the Management Virtual Machine that directly manages the hardware and supports the guest VMs. The customers must trust the cloud providers to protect the confidentiality and integrity of their applications and data. However, as the VMs from different customers are running on the same host, an attack to the management virtual machine will easily lead to the compromise of the guest VMs. Therefore, it is critical for a cloud computing system to ensure the trustworthiness of management VMs. To this end, we propose VMGuard, an integrity monitoring and detecting system for management virtual machines in a distributed environment. VMGuard utilizes a special VM, Guard Domain, which runs on each physical node to monitor the co-resident management VMs. The integrity measurements collected by the Guard Domains are sent to the VMGuard server for safe store and independent analysis. The experimental evaluation of a Xen-based prototype shows that VMGuard can quickly detect the root kit attacks while the performance overhead is low.
Heterogeneity is considered as a solution for supercomputers to scale to petascale. Many systems which are composed of general CPUs and special processing units such as Cells, GPGPUs and FPGAs have been implemented. I...
详细信息
Heterogeneity is considered as a solution for supercomputers to scale to petascale. Many systems which are composed of general CPUs and special processing units such as Cells, GPGPUs and FPGAs have been implemented. In these systems, CPU needs interact with special processing units to process data together, thus communications between these heterogeneous processing units become a key problem, and the communication subsytem should provide low latency and high bandwidth. In this paper, we propose HPP-Controller, which is designed for connecting two different types of CPUs (AMD and Loongson) in one node. It connects heterogeneous CPUs on top of no-coherent HyperTransport (HT) fabric and supports Global Physical Address Space. We implement a FPGA-based prototype and evaluate it via experiments. Initial results show that HPP-Controller has low latency of 0.75 us and high bandwidth close to bandwith of HT links.
Heterogeneous multi-core processors have strong potential for performance improvement, energy efficiency and area efficiency, compared to the homogeneous multi-core processors. The present methods of execution migrati...
详细信息
Orthogonal multiple access (OMA) is widely employed in traditional cellular wireless communication systems due to a low-complexity receiver is needed. However, multiple access scheme with higher spectral efficiency an...
详细信息
ISBN:
(纸本)9781538670125;9781538670118
Orthogonal multiple access (OMA) is widely employed in traditional cellular wireless communication systems due to a low-complexity receiver is needed. However, multiple access scheme with higher spectral efficiency and the capability of access more users than OMA scheme is preferred for the future wireless communication systems. In this work, we propose a non-orthogonal multiple access scheme called sparse superposition multiple access (SSMA) by superposing more users to a limited number of time-frequency resource for the uplink transmission in the fifth generation (5G) wireless communication systems. Moreover, we design two sparse superposition matrices (SSMs) for 150% and 200% overloaded SSMA scheme. Then, a low complexity belief propagation detection and decoding algorithm based multiple user detection is proposed. Furthermore, we utilize extrinsic information transfer chart to analyze the superiority of the proposed SSMs theoretically and the numerical results show that the proposed SSM with an overload of 150% has a gain of about 1 dB compared to the existing scheme. Finally, simulation results demonstrate that the proposed scheme outperforms the traditional OMA scheme and the existing scheme significantly.
Optimization directed inlining is a good direction for inlining, but it does not consider the factor of the execution frequency and size of the function. Although a traditional inlining model considers the factor of e...
详细信息
Optimization directed inlining is a good direction for inlining, but it does not consider the factor of the execution frequency and size of the function. Although a traditional inlining model considers the factor of execution frequency and size of the function, it does not consider the optimization after inlining. In this paper, a new inline model, loop fusion conscious inline model, is proposed to avoid these drawbacks of the inline model of the past. It considers both execution frequency and size and optimization. The inlining method which only considers loop fusion is implemented and is added into the ORC's original inline model. Then the new inline model is built and the model is tuned for high performance. In the experiment, some fact is found that temperature (execution frequency) isn't effective in some cases, and the reason is analyzed. Experiment result shows that the new model can greatly improve the performance of the compiler, and some SPEC CPU 2000 benchmark's peak performance can increase as high as 6%, and 1% on average.
暂无评论