检索结果-内蒙古大学图书馆

2012 International Conference on Mechanical and Electronic Engineering, ICMEE 2012

作者： Lin, Yufei Tang, Yuhua Zhang, Xin National Laboratory for Parallel and Distributed Processing School of Computer National University of Defense Technology Changsha China

ISBN: (纸本)9783642315152

As the electronic technology develops, the integration levels of CPUs and memories keep growing, and the speeds of communication devices are improved. The high-performance computing (HPC) systems consist of processing nodes and communication network, and their sizes are advanced by the development of electronic technology. Then the scalability of a large-scale parallel computing system, i.e. whether the computing performance is increased with the system size, becomes a major goal pursued by designers of parallel algorithms and high-performance parallel machines. parallel speedup is a popular way to measure the scalability. This paper proposes the definition of HPC system scalability based on speedup first, and then analyzes the influence of function G(P), which describes how the workload changes with processor number, on the system scalability. Through case studies, we analyze some typical programs based on our scalability theorems and the results show that our analysis approach is correct. © 2012 Springer-Verlag Berlin Heidelberg.

关键词： Scalability

来源：评论

学校读者我要写书评

暂无评论

Link scheduling in wireless networks with successive interference cancellation

Link scheduling in wireless networks with successive interfe...

引用

International Conference on Mobile Ad-hoc and Sensor Networks

作者： Lv, Shaohe Wang, Xiaodong Zhou, Xingming National Laboratory of Parallel and Distributed Processing National University of Defense Technology Changsha Hunan 410073 China

ISBN: (纸本)9780769543154

Successive interference cancellation (SIC) is an effective technique of multipacket reception to combat interference. As not all collision are resolvable, careful transmission coordination is required. We study link scheduling in wireless networks with SIC at the physical layer. A new model, simultaneity graph (SG), is proposed to characterize the link correlation introduced by SIC. Then two new scheduling schemes are presented: 1) a slotoriented scheme which assigns a maximal feasible link set to a time slot and 2) a link-oriented scheme which assigns each link a sufficient number of slots. The performance is evaluated by simulations and the results demonstrate that the throughput gain is on average 50% and up to 110% over IEEE 802.11. The complexity of SG is only a bit higher than that of the available widely-used models (e.g., conflict graph). © 2010 IEEE.

关键词： Wireless networks

来源：评论

学校读者我要写书评

暂无评论

Versionized process based on non-volatile random-access memory for fine-grained fault tolerance

引用

Frontiers of Information Technology & Electronic Engineering 2018年第2期19卷 192-205页

作者： Wen-zhe ZHANG Kai LU Xiao-ping WANG Science and Technology on Parallel and Distributed Processing Laboratory College of ComputerNational University of Defense Technology

Non-volatile random-access memory（NVRAM） technology is maturing rapidly and its byte-persistence feature allows the design of new and efficient fault tolerance mechanisms. In this paper we propose the versionized process（Ver P）, a new process model based on NVRAM that is natively non-volatile and fault tolerant. We introduce an intermediate software layer that allows us to run a process directly on NVRAM and to put all the process states into NVRAM, and then propose a mechanism to versionize all the process data. Each piece of the process data is given a special version number, which increases with the modification of that piece of data. The version number can effectively help us trace the modification of any data and recover it to a consistent state after a system *** with traditional checkpoint methods, our work can achieve fine-grained fault tolerance at very little cost.

关键词： Non-volatile memory Byte-persistence Versionized process Version number

来源：评论

学校读者我要写书评

暂无评论

HyperSpring: Accurate and stable latency estimation in the hyperbolic space

HyperSpring: Accurate and stable latency estimation in the h...

引用

15th International Conference on parallel and distributed Systems, ICPADS '09

作者： Fu, Yongquan Wang, Yijie National Key Laboratory for Parallel and Distributed Processing School of Computer National University of Defense Technology China

ISBN: (纸本)9780769539003

Predicting network latencies between Internet hosts can efficiently support large-scale Internet applications, e.g., file sharing service and the overlay construction. Several study use the Hyperbolic space to model the Internet densecore and many-tendril structure. However, existing Hyperbolic space based embedding approaches are not designed for accurate latency estimation in the distributed context. We present HyperSpring, which estimates latency by modelling a mass spring system in the Hyperbolic similar with Vivaldi. HyperSpring adopts coordinate initialization to speed up the convergence of coordinate computation, uses multiple-round symmetric updates to escape from bad local minima, and stabilizes coordinates by compensating RTT measurements to reduce the coordinate drifts. Evaluation results based on a network trace of 226 PlanetLab nodes indicate that, compared to Euclidean-space based Vivaldi, HyperSpring provides performance improvements for most nodes, and incurs slightly higher distortions for a small number of nodes. © 2009 IEEE.

关键词： Hyperbolic space Latency estimation Mass spring field

来源：评论

学校读者我要写书评

暂无评论

Storage allocation for redundancy scheme in reliability-aware cloud systems

Storage allocation for redundancy scheme in reliability-awar...

引用

2011 IEEE 3rd International Conference on Communication Software and Networks, ICCSN 2011

作者： Huang, Zhen Yuan, Yuan Peng, Yuxing National Laboratory of Parallel and Distributed Processing Computer Department National University of Defense Technology Changsha China

ISBN: (纸本)9781612844855

As the burst increasing of created and demand on information and data, the efficient solution on storage management is highly required in the cloud storage systems. As an important component of management, storage allocation scheme aims to use a low redundancy and also to achieve a high reliability. However, the two aims are hard to be unified. Considering the practical situation of Cloud systems, we propose a systematic storage allocation scheme to touch them both. And we also study the impact of many factors to the data reliability. © 2011 IEEE.

关键词： Information management

来源：评论

学校读者我要写书评

暂无评论

Automatic Data Distribution for Improving Data Locality on the Cell BE Architecture

Automatic Data Distribution for Improving Data Locality on t...

引用

22nd International Workshop on Languages and Compilers for parallel Computing

作者： Wang, Miao Bodin, Francois Matz, Sebastien National Laboratory for Parallel and Distributed Processing NUDT China CAPS Entreprise Rennes France University of Rennes 1 Rennes France

ISBN: (纸本)9783642133732

Multicore systems provide potential to improve the performance of the applications. However, substantial programming effort is required to exploit the power of the parallelism. This paper presents a single source compiler to map the data-parallel programs onto Cell Broadband Engine. Based on the distributed memory model, the compiler performs automatic data distribution and generates SPMD programs with message-passing primitives for Cell. We evaluate our compiler using a range of computation intensive benchmarks, high performance is achieved on Cell platform. In contrast to OpenMP, our method can fully exploit data locality through managing the shared data using inter-processor communication instead of accessing main memory, which significantly reduces the off-chip memory access overhead.

关键词： Message passing

来源：评论

学校读者我要写书评

暂无评论

Iaso： an autonomous fault-tolerant management system for supercomputers

引用

Frontiers of Computer Science 2014年第3期8卷 378-390页

作者： Kai LU Xiaoping WANG Gen LI Ruibo WANG Wanqing CHI Yongpeng LIU Hongwei TANG Hua FENG Yinghui GAO Science and Technology on Parallel and Distributed Processing Laboratory National University of Defense Technology Changsha 410073 China College of Computer National University of Defense Technology Changsha 410073 China ATR Laboratory National University of Defense Technology Changsha 410073 China

With the increase of system scale, the inherent reliability of supercomputers becomes lower and lower. The cost of fault handling and task recovery increases so rapidly that the reliability issue will soon harm the usability of supercomputers. This issue is referred to as the ＂reliability wall＂, which is regarded as a critical problem for current and future supercomputers. To address this problem, we propose an autonomous fault-tolerant system, named Iaso, in MilkyWay- 2 system. Iaso introduces the concept of autonomous management in supercomputers. By autonomous management, the computer itself, rather than manpower, takes charge of the fault management work. Iaso automatically manage the whole lifecycle of faults, including fault detection, fault diagnosis, fault isolation, and task recovery. Iaso endows the autonomous features with MilkyWay-2 system, such as self-awareness, self-diagnosis, self-healing, and self-protection. With the help of Iaso, the cost of fault handling in supercomputers reduces from several hours to a few seconds. Iaso greatly improves the usability and reliability of MilkyWay-2 system.

关键词： supercomputer autonomous management fault tolerant fault management MilkyWay-2 system

来源：评论

学校读者我要写书评

暂无评论

Automated WCET analysis based on program modes

Automated WCET analysis based on program modes

引用

1st International Workshop on Automation of Software Test, AST'06, Co-located with the 28th International Conference on Software Engineering, ICSE 2009

作者： Ji, Meng-Luo Wang, Ji Li, Shuhao Qi, Zhi-Chang School of Computer National University of Defense Technology China National Laboratory for Parallel and Distributed Processing China

ISBN: (纸本)1595934081

Program mode is a regular trajectory of the execution of a program that is determined by the values of its input variables. By exploiting program modes we may make Worst Case Execution Time (WCET) analysis more precise. This paper presents a novel method to automatically find program modes and calculate the WCET of programs. It consists of two phases. In phase one, we firstly automatically find the modes of a program by mode-relevant program slicing;then we compute the precondition for each mode using a path-wise test data generation method;after that, we can either conclude that it is an infeasible path, or get its precondition. In phase two, we calculate the WCET estimate of each given mode for modern RISC processors with caches and pipelines. The experiments are demonstrated to show the effectiveness of the method. Copyright 2006 ACM.

关键词： Real time systems

来源：评论

学校读者我要写书评

暂无评论

Automating software FMEA via formal analysis of dependence relations

Automating software FMEA via formal analysis of dependence r...

引用

32nd Annual IEEE International Computer Software and Applications Conference, COMPSAC 2008

作者： Dong, Wei Wang, Ji Zhao, Changzhi Zhang, Xian Tian, Jie School of Computer National University of Defense Technology China National Laboratory for Parallel and Distributed Processing China

ISBN: (纸本)9780769532622

The paper presents the ongoing work of studying FMEA method for embedded safety critical software via formal analysis of various dependence relations among software elements, which can fairly improve the automation and precision of both system level and detailed level FMEA. These dependence relations are depicted by the formal models abstracted from software design and implementation, and the FMEA processes for both structural and object-oriented software are proposed respectively. The initial result of case study shows the effectiveness of the approach. I 2008 IEEE.

关键词： Safety engineering

来源：评论

学校读者我要写书评

暂无评论

Towards a new methodology for estimating available bandwidth on network paths

引用

7th International Symposium on Advanced parallel processing Technologies, APPT 2007

作者： Lv, Shaohe Wang, Xiaodong Zhou, Xingming Yin, Jianping National Laboratory for Parallel and Distributed Processing National University of Defense Technology Changsha Hunan 410073 China

ISBN: (纸本)9783540768364

This paper presents a novel methodology, called COPP, to estimate available bandwidth over a given network path. COPP deploys a particular probe scheme, namely chirp of packet pairs, which is composed of several packet pairs with decremental inter-packet spacing. After each pair chirp is received, COPP discovers all turning points, e.g. such packet pair that is interfered more seriously in contrast to its adjacent pairs, and give each point a distinct weight according to the degree to which the pair and its consecutive neighbors are interfered by cross traffic to yield an estimate. The final estimate is the average of the results of all chirps in a measurement episode. Two decision rules are developed to determine whether a packet pair is turning point We test COPP in various simulations and find that COPP can provide accurate results with relatively less overhead while adapt to network variations rapidly. © Springer-Verlag Berlin Heidelberg 2007.

关键词： Bandwidth

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：