检索结果-内蒙古大学图书馆

Proceedings of the 20th international conference companion on World wide web

作者： Tang, Jintao Wang, Ting Wang, Ji Lu, Qin Li, Wenjie School of Computer National University of Defense Technology Changsha China Department of Computing Hong Kong Polytechnic University Hong Kong Hong Kong National Laboratory for Parallel and Distributed Processing Changsha China

ISBN: (纸本)9781450305181

Applying graph clustering algorithms in real world networks needs to overcome two main challenges: the lack of prior knowledge and the scalability issue. This paper proposes a novel method based on the topological features of complex networks to optimize the clustering algorithms in real-world networks. More specifically, the features are used for parameter estimation and performance optimization. The proposed method is evaluated on real-world networks extracted from the web. Experimental results show improvement both in terms of Adjusted Rand index values as well as runtime efficiency. © 2011 Authors.

关键词： Parameter estimation

来源：评论

学校读者我要写书评

暂无评论

Heterogeneity-Aware Peak Power Management for Accelerator-Based Systems

Heterogeneity-Aware Peak Power Management for Accelerator-Ba...

引用

International Conference on parallel and distributed Systems (ICPADS)

作者： Guibin Wang Yisong Lin National Laboratory for Parallel and Distributed Processing National University of Defense Technology China

Power management has become one of the first-order considerations in high performance computing field. Many recent studies focus on optimizing the performance of a computer system within a given power budget. However, most existing solutions adopt fixed period control mechanism and are transparent to the running applications. Although the application-transparent control mechanism has relatively good portability, it exhibits low efficiency in accelerator-based heterogeneous parallel systems. In typical accelerator-based parallel systems, different processing units have largely different processing speeds and power consumption. Under a given power constraint, how to choose the processor to be slowed down and how to schedule a parallel task onto different processors for the maximum performance are different from those in homogeneous systems and have not been well studied. From the motivating example in this paper, we could find that in order to efficiently harness the heterogeneous parallel processing, one should not only perform dynamic voltage/frequency scaling (DVFS) to meet the power budget, but also tune the parallel task scheduling to adapt to the changes. In this paper, we propose a heterogeneity-aware peak power management, which extends existing application-transparent power controller with an application-aware power controller. Firstly, we theoretically analyze the conditions for the maximum performance given a power budget for heterogeneous systems. Based on this result, we provide a power-constrained parallel task partition algorithm, which coordinates parallel task partition and voltage scaling for heterogeneous processing units to achieve the optimal performance given a system power budget. Finally, we evaluate the proposed method on a typical CPU-GPU heterogeneous system, and validate the superiority of application-aware power controller over the existing method.

关键词： Graphics processing unit Power demand Partitioning algorithms Schedules Kernel Monitoring Benchmark testing

来源：评论

学校读者我要写书评

暂无评论

Power Optimization for GPU Programs Based on Software Prefetching

Power Optimization for GPU Programs Based on Software Prefet...

引用

IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom)

作者： Yisong Lin Tao Tang Guibin Wang Key Laboratory of Science and Technology for National Defence of Parallel and Distributed Processing National University of Defense and Technology Changsha China

GPUs render higher computing unit density than contemporary CPUs and thus exhibit much higher power consumption despite its higher power efficiency. The power consumption has become an important issue that impacts CPU's applications, thereby necessitating the low power optimization technology for GPUs. Software prefetching is an efficient way to alleviate the memory wall problem which overlaps the computing and memory access latencies. However, software prefetching will cause some power overhead because it increases the number and density of the instructions. Thus, we should consider the balance between the performance income and the power overhead when applying the optimization. To address this problem, in this paper we first analyze the multi-thread execution model of GPU and validate the potential space of software prefetching optimization. Then we give the software prefetching method for GPU programs to improve the performance. Aiming at two different objects: energy optimization under performance constraint and performance optimization under power constraint, we discuss the optimization methods based on software prefetching and dynamic voltage scaling technologies. The experimental results show that our method can efficiently optimize the energy consumption (performance) under the performance (power) constraint.

关键词： Prefetching Graphics processing unit Optimization Power demand Registers

来源：评论

学校读者我要写书评

暂无评论

A unique vertex deleting algorithm for graph isomorphism

A unique vertex deleting algorithm for graph isomorphism

引用

2011 International Symposium on Image and Data Fusion, ISIDF 2011

作者： Zhang, Baida Tang, Yuhua Wu, Junjie Huang, Linqi National Laboratory for Parallel and Distributed Processing School of Computer National University of Defense Technology Changsha Hunan 410073 China School of Resources and Safety Engineering Central South University Changsha 410083 China

ISBN: (纸本)9781457709692

Graph isomorphism problem has applications in many fields, such as chemistry, computer science, electronics, and network theory. But the exponential complexity of the algorithm makes the testing is time consuming. In this paper, a new algorithm named Unique Vertex Delete (UVD for short) is presented to speed up the process of isomorphic testing. The main idea of UVD algorithm is deleting vertex continuously to reduce the scale of the problem, and new field is added for each vertex to indicate the information of deleted neighbor vertices. Theoretical analysis and experiments show that the UVD algorithm consistently and significantly outperforms existing state-of-the-art approaches. © 2011 IEEE.

关键词： Set theory

来源：评论

学校读者我要写书评

暂无评论

Component model supporting trustworthiness-oriented software evolution

引用

Ruan Jian Xue Bao/Journal of Software 2011年第1期22卷 17-27页

作者： Ding, Bo Wang, Huai-Min Shi, Dian-Xi Li, Xiao School of Computer National University of Defense Technology Changsha 410073 China National Defense Laboratory for Parallel and Distributed Processing Changsha 410073 China

Environment-Driven adaptation is an important means ensuring software integrity. Confronted with scenarios not anticipated during the developmental stage, the predefined adaptability of the software should be adjusted to ensure that its behavior agree with the users' expectation. The premise of this kind of adjustment are efficient software engineering mechanisms. Based on the principles of the Separation of Concerns and the Dynamic Software Architecture (DSA) technology, this paper proposes a component model named ACOE (adaptive component model for open environment) that supports the online fine-grained adjustment to software adaptability. ACOE encapsulates adaptation concerns, such as sensing, decision, and execution into components, and connectors, and then supports their dynamic configuration with the DSA technology. As a result, a third-party can adjust the adaptability by selectively upgrading it when necessary. An ACOE container prototype and experimental applications are implemented to validate this approach. © Copyright 2011, Institute of Software, the Chinese Academy of sciences. All rights reserved.

关键词： Memory architecture

来源：评论

学校读者我要写书评

暂无评论

Research on Online Failure Prediction Model and Status Pretreatment Method for Exascale System

Research on Online Failure Prediction Model and Status Pretr...

引用

International Conference on Cyber-Enabled distributed Computing and Knowledge Discovery, CyberC

作者： Hao Zhou Yanhuang Jiang National Laboratory of Parallel and Distributed Processing College of Computer Science National University of Defense Technology Changsha China

The reliability issue of Exascale system is extremely serious. Traditional passive fault-tolerant methods, such as rollback-recovery, can not fully guarantee system reliability any more because of their large executing overhead and long recovering duration. Active fault tolerance is expected to become another important fault-tolerant approach for Exascale system. Focusing on system failure prediction, which is one key step of active fault tolerance, we construct online failure prediction model and research on the effective method of system status pretreatment. In order to improve the accuracy and real-time feature of current methods, the proposed Improved Adaptive Semantic Filter (IASF) method processes the latest system logs regularly, filtering useless information out of them according to their semantics. Adopting the main idea of Vector Space Model (VSM), IASF method creates Event Vector corresponding to each log record. By calculating the cosine of vectorial angle, it evaluates the semantics correlation between different log records, and then executes temporal and spatial redundant filter considering the burst feature of log records. IASF method is insensitive to the type of system log and does not introduce any expert system or domain knowledge. The experiment result shows that system can eliminate about 99.6% of useless log records after executing IASF method.

关键词： Fault tolerance Fault tolerant systems Vectors Correlation Information filters Predictive models

来源：评论

学校读者我要写书评

暂无评论

不确定数据流上的并行Skyline查询算法

不确定数据流上的并行Skyline查询算法

引用

第29届中国数据库学术会议

作者： WANG Guangdong 王广东 WANG Yijie 王意洁 LI Xiaoyong 李小勇 WANG Yuan 王媛 National Key Laboratory for Parallel and Distributed Processing College of Computer Science Nation 国防科技大学计算机学院并行与分布处理国家重点实验室长沙410073

不确定数据流上的Skyline查询技术逐步引起研究者的关注，传统的集中式流处理算法难以满足海量数据的查询需求，并且云计算所提供的海量计算资源和有效的存储管理模式，为研究并行Skyline查询技术提供了充足的条件。基于上述事实，提出... 详细信息

不确定数据流上的Skyline查询技术逐步引起研究者的关注，传统的集中式流处理算法难以满足海量数据的查询需求，并且云计算所提供的海量计算资源和有效的存储管理模式，为研究并行Skyline查询技术提供了充足的条件。基于上述事实，提出了一种不确定数据流上的并行Skyline查询算法(PSUDS)。该算法通过交叉划分滑动窗口的方式，将集中式流查询转化为并行处理，以并行执行的方式来解决集中式算法处理性能不足的问题。大量实验结果表明，该算法具有较好的并行可扩展性。

关键词：不确定数据流云计算并行处理查询算法性能测试

来源：评论

学校读者我要写书评

暂无评论

LoGPX: A new communication model for message-passing programs

LoGPX: A new communication model for message-passing program...

引用

International Symposium on Performance Evaluation of Computer & Telecommunication Systems SPECTS

作者： Yufei Lin Yuhua Tang Xiaowei Guo Xinhai Xu National Laboratory for Parallel and Distributed Processing National University of Defense Technology Changsha China

Message Passing Interface (MPI) is a de facto standard for writing high-performance message-passing applications on distributed memory systems. To design effective applications and predict the performance of future systems, an accurate communication model is needed. In this paper, we discuss the characteristics of current systems and MPI implementations, then propose a more complete communication model, named LoGPX, which synthetically captures the influences of MPI communication protocol, the invoking time of communication primitive, hardware resource and network contention. Based on the model, we obtain the condition that the message transmission cost reaches the infimum, and we show that if ignoring some factors of LoGPX model, it can be degenerated to several popular models such as LogP, LogGP, LoGPC and LogGPO.

关键词： Protocols Receivers Message passing Network interfaces Hardware Analytical models Routing

来源：评论

学校读者我要写书评

暂无评论

基于移动云计算的大规模上下文流处理框架研究

基于移动云计算的大规模上下文流处理框架研究

引用

第九届中国通信学会学术年会

作者： XIAO You 肖友 SHI Dian-xi 史殿习 National Key Laboratory for Parallel and Distributed Processing School of Computer ScienceNational 国防科技大学计算机学院并行与分布处理国家重点实验室长沙410073

随着移动云计算的兴起，有关于大规模移动设备上下文信息处理的研究成为了热点。目前比较流行的解决方案是采用基于MapReduce计算模型的Hadoop框架，它虽能解决大规模数据处理的吞吐量问题但是却不能很好的解决数据处理的时效问题。因... 详细信息

随着移动云计算的兴起，有关于大规模移动设备上下文信息处理的研究成为了热点。目前比较流行的解决方案是采用基于MapReduce计算模型的Hadoop框架，它虽能解决大规模数据处理的吞吐量问题但是却不能很好的解决数据处理的时效问题。因此，本文针对存在的问题，提出了移动云计算与流处理技术相结合的解决方案，并构建与实现了一个实时处理大规模上下文的流处理框架。最后通过此框架对本文的研究内容进行了验证，且以一个道路实况监测的实例验证了框架的可行性及有效性。

关键词：移动云计算上下文信息流处理框架可行性分析

来源：评论

学校读者我要写书评

暂无评论

BIIP:Application Behavior-Aware Insertion Policies for Managing Shared Cache in CMPs

BIIP:Application Behavior-Aware Insertion Policies for Manag...

引用

2011 International Conference on Computers, Communications, Control and Automation

作者： Xiaomin Jia Ping Huang Tianlei Zhao Shubo Qi Guitao Fu Minxuan Zhang National Laboratory for Parallel and Distributed Processing School of Computer National University of Defense Technology

Conventional replacement policy LRU(Least Recently Used) can significantly degrade the overall performance of shared cache of Chip Multi-Processors(CMPs), when the aggregate working set of multiple co-scheduled applications can not fit in cache. Different applications have different inherent cache access behavior characteristics. Replacement policy should take into account that fact so as to derive more performance benefit. This paper proposes application cache Behavior Identification based Insertion Policy(BIIP)replacement policies for managing shared cache in CMPs. BIIP seeks to make use of the cache access behavior characteristics of each co-scheduled application to smartly choose replacement policy. Our evaluation using a full system CMP simulator shows that BIIP improves the overall throughput by 14.8%,11.2%, 5.6% and 7.2% on average over baseline LRU policy,the prevailing cache partitioning scheme UCP and two other shared cache replacement policies PIPP, TADIP, respectively on a 4-core CMP with 16 SPEC CPU2006 workloads. Moreover,BIIP requires a total storage overhead of no more than several counters per core, and does not require changes to the current cache structure.

关键词： chip multi-processors(CMPs),application cache behaviors,last-level caches(LLC),insertion,replacement

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：