检索结果-内蒙古大学图书馆

IEEE International Conference on Field-Programmable technology (FPT)

作者： Guiming Wu Yong Dou Miao Wang National Laboratory of Parallel and Distributed Processing National University of Defense Technology Changsha China Jiangnan Institute of Computing Technology Wuxi China

We present a high performance and memory efficient hardware implementation of matrix multiplication for dense matrices of any size on the FPGA devices. By applying a series of transformations and optimizations on the original serial algorithm, we can obtain an I/O and memory optimized block algorithm for matrix multiplication on FPGAs. A linear array of processing elements (PEs) is proposed to implement this block algorithm. We show significant reduction in hardware resources consuming compared to the related work while increasing clock frequency. Moreover, the memory requirement can be reduced to O(S) from O(S 2 ), where S is the block size. Therefore, more PEs can be integrated into the same FPGA devices.

关键词： Field programmable gate arrays Arrays Random access memory Algorithm design and analysis Hardware Memory management Optimization

来源：评论

学校读者我要写书评

暂无评论

Blocking LU Decomposition for FPGAs

Blocking LU Decomposition for FPGAs

引用

Annual IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM)

作者： Guiming Wu Yong Dou Gregory D. Peterson National Laboratory for Parallel & Distributed Processing National University of Defense Technology Changsha China Electrical Engineering and Computer Science University of Tennessee Knoxville TN USA

To efficiently perform large matrix LU decomposition on FPGAs with limited local memory, the original algorithm needs to be blocked. In this paper, we propose a block LU decomposition algorithm for FPGAs, which is applicable for matrices of arbitrary size. We introduce a high performance hardware design, which mainly consists of a linear array of processing elements (PEs), to implement our block LU decomposition algorithm. A total of 36 PEs can be integrated into a Xilinx Virtex-5 xc5vlx330 FPGA on our self-designed PCI-Express card, reaching a sustained performance of 8.50 GFLOPS at 133 MHz, which outperforms previous work.

关键词： Field programmable gate arrays Matrix decomposition Hardware Power engineering computing Algorithm design and analysis Linear algebra Computer applications Concurrent computing distributed computing Laboratories

来源：评论

学校读者我要写书评

暂无评论

Proceedings - International Conference on Quality Software: Message from the Program Chairs

Proceedings - International Conference on Quality Software

引用

Proceedings - International Conference on Quality Software 2010年 xi页

作者： Wang, Ji Chan, Wing Kwong Kuo, Fei-Ching National Laboratory for Parallel and Distributed Processing Changsha China City University of Hong Kong Hong Kong Hong Kong Swinburne University of Technology Australia

来源：评论

学校读者我要写书评

暂无评论

Routing with uncertainty in wireless mesh networks

Routing with uncertainty in wireless mesh networks

引用

International Workshop on Quality of Service

作者： Fajun Chen Jiangchuan Liu Zongpeng Li Yijie Wang Key Laboratory of Science and Technology for National Defence of Parallel and Distributed Processing National University of Defense Technology China School of Computing Science Simon Fraser University Department of Computer Science University of Caglary

Existing routing protocols for Wireless Mesh Networks (WMNs) are generally optimized with statistical link measures, while not addressing on the intrinsic uncertainty of wireless links. We show evidence that, with the transient link uncertainties at PHY and MAC layers, a pseudo-deterministic routing protocol that relies on average or historic statistics can hardly explore the full potentials of a multi-hop wireless mesh. We study optimal WMN routing using probing-based online anypath forwarding, with explicit consideration of transient link uncertainties. We show the underlying connection between WMN routing and the classic Canadian Traveller Problem (CTP). Inspired by a stochastic recoverable version of CTP (SRCTP), we develop a practical SRCTP-based online routing algorithm under link uncertainties. We study how dynamic next hop selection can be done with low cost, and derive a systematic selection order for minimizing transmission delay. We conduct simulation studies to verify the effectiveness of the SRCTP algorithms under diverse network configurations. In particular, compared to deterministic routing, reduction of end-to-end delay (51.15~73.02%) and improvement on packet delivery ratio (99.76%) are observed.

关键词： Uncertainty Wireless mesh networks Delay Routing protocols Physical layer Statistics Stochastic processes Relays Computer networks Concurrent computing

来源：评论

学校读者我要写书评

暂无评论

nSET: Novel simulation method for single-electron tunnel device with 1-Dimension multiple islands

nSET: Novel simulation method for single-electron tunnel dev...

引用

IEEE International Nanoelectronics Conference (INEC)

作者： Bingcai Sui Yaqing Chi Liang Fang National Laboratory of Parallel and Distributed Processing School of Computer National University of Defense Technology Hunan China National University of Defense Technology China

ISBN: (纸本)9781424435432

Single-electronic transistors (SETs) are considered as the attractive candidates for post-CMOS VLSI due to their ultra-small size and low power consumption. Because SETs with single island can not work at room temperature normally, more and more researchers begin to make research on the SETs with 1-dimension multi-islands. A new simulation method-nSET, is introduced in this paper. Compared with other methods, nSET can simulate the SET device with 1-dimension multiple islands with high speed and accuracy. Through the comparison, it can be get that nSET is accurate and fast compared with the classical Monte Carlo (MC) simulator, and is very useful for the ASIC design of SET devices.

关键词： Tunneling Programming profession Capacitance Circuit simulation Electrostatics Single electron transistors Application specific integrated circuits Discrete event simulation Very large scale integration Energy consumption

来源：评论

学校读者我要写书评

暂无评论

Towards Building Efficient Content-Based Publish/Subscribe Systems over Structured P2P Overlays

Towards Building Efficient Content-Based Publish/Subscribe S...

引用

International Conference on parallel processing (ICPP)

作者： Shengdong Zhang Ji Wang Rui Shen Jie Xu National Laboratory of Parallel and Distributed Processing National University of Defense Technology Changsha China School of Computing University of Leeds Leeds UK

In this paper, we introduce a generic model to deal with the event matching problem of content-based publish/subscribe systems over structured P2P overlays. In this model, we claim that there are three methods (event-oriented, subscription-oriented and hybrid) to make all the matched pairs (event, subscription) meet in a system. By theoretically analyzing the inherent problem of both event-oriented and subscription-oriented methods, we propose PEM (Popularity-based Event Matching), a variant of hybrid method. PEM can achieve better trade-off between event processing load and subscription storage load of a system. PEM has been verified through both mathematical and simulation-based evaluation.

关键词： Subscriptions Decision support systems Publishing Load modeling Mathematical model Bandwidth Routing

来源：评论

学校读者我要写书评

暂无评论

A distributed approach to consistent order delivery of concurrent events in asynchronous DVEs

ICETC 2010 - 2010 2nd International Conference on Education ...

引用

ICETC 2010 - 2010 2nd International Conference on Education technology and Computer 2010年 4卷 V495-V498页

作者： Zhou, Hangjun Zhang, Wei Peng, Yuxing Li, Sikun Key Laboratory of Science and Technology for National Defence of Parallel and Distributed Processing School of Computer Science National University of Defense Technology Changsha Hunan 410073 China Department of Information Management Hunan College of Finance and Economics Changsha Hunan 410205 China

ISBN: (纸本)9781424463688

In large-scale asynchronous distributed virtual environments(DVEs), one of the difficult problems is to deliver the concurrent events in a consistent order at each node. Generally, the previous consistency control approaches can be classified into two categories: causal order and time stamped order. However, causal order approaches can merely preserve the cause-effect relation of events and time stamped order approaches seem intrinsically complex to be used in serverless large-scale asynchronous DVEs. In this paper, we proposed a novel distributed algorithm to identify the concurrent events and preserve the consistent order delivery of them at different nodes. Simulation studies are also carried out to compare the performance of this algorithm with that of the previous ones. The results show that the new algorithm can effectively deliver the concurrent events in consistent order at each node and is more efficient than the previous algorithms in large-scale asynchronous DVEs. © 2010 IEEE.

关键词： Virtual reality

来源：评论

学校读者我要写书评

暂无评论

A compact model for multi-island single electron transistors

A compact model for multi-island single electron transistors

引用

IEEE International Nanoelectronics Conference (INEC)

作者： Yaqing Chi Haiqin Zhong Chao Zhang Liang Fang National Key Laboratory of Parallel and Distributed Processing School of Computer National University of Defense Technology Hunan China National University of Defense Technology Changsha Hunan CN

Multi-island single electron transistor (MISET) is a kind of single electron transistor (SET), which has advantages of the room temperature operating. A novel semi-empirical compact model for MISET is proposed. The new approach combines the orthodox theory of single electron tunneling for single Coulomb island and a novel empirical analysis for a chain of Coulomb islands. The model is verified by the Monte-Carlo method in SIMON simulator, and is much faster than the traditional multi-island SET simulator, which has the advantages for the large scale multi-island SET circuit simulation.

关键词： Single electron transistors Circuit simulation Tunneling Large-scale systems Voltage Temperature Character generation Electrodes distributed processing Concurrent computing

来源：评论

学校读者我要写书评

暂无评论

Using Redundant Threads for Fault Tolerance of OpenMP Programs

Using Redundant Threads for Fault Tolerance of OpenMP Progra...

引用

International Conference on Information science and Applications (ICISA)

作者： Hongyi Fu Yan Ding Key Laboratory of Science and Technology for National Defence of Parallel and Distributed Processing National University of Defense Technology Changsha Hunan China Section #620 School of Computer National University of Defense Technology Changsha Hunan China

ISBN: (纸本)9781424459421

As the wide application of multi-core processor architecture in the domain of high performance computing, fault tolerance for shared memory parallel programs becomes a hot spot of research. For years, checkpointing has been the dominant fault tolerance technology in this field, and recently, many research works have been engaged with it. However, to those programs which deal with large amount of data, checkpointing may induce massive I/O transfer, which will adversely affect scalability. To deal with such a problem, this paper proposes a fault tolerance approach, making use of redundancy, for shared memory parallel programs. Our scheme avoids saving and restoring computational state during the program's execution, hence does not involve I/O operations, so presents explicit advantage over checkpointing in scalability. In this paper, we introduce our approach and the related compiler tool in detail, and give the experimental evaluation result.

关键词： Fault tolerance Checkpointing Scalability Hardware Application software Multicore processing High performance computing Laboratories distributed processing

来源：评论

学校读者我要写书评

暂无评论

Personalized Reputation Model in Cooperative distributed Systems

Personalized Reputation Model in Cooperative Distributed Sys...

引用

International Conference on parallel and distributed Systems (ICPADS)

作者： Wei Liu Yang-Bin Tang Huai-Min Wang School of Computer National University of Defense Technology Changsha Hunan China National Laboratory for Parallel and Distributed Processing Changsha Hunan China Institute of Science National University of Defense Technology Changsha Hunan China

Reputation systems provide a promising way to build trust relationships between users in distributed cooperation systems, such as file sharing, streaming, distributed computing and social network, through which a user can distinguish good services or users from malicious ones and cooperate with them. However, most reputation models mainly focus on evaluating the qualities of different services in one dimension, but care less about the preferences of different users. This paper proposes a personalized reputation model which provides each user a personalized trust view on others according to his preference. In our approach, we aggregate the users' preferences with collaborative filtering method and qualify it with user similarity which is integrated into the computing of reputation values. The experimental results suggest that our model can resist possible kinds of malicious behaviors efficiently.

关键词： Computational modeling Measurement Indexes Aggregates Resists Peer to peer computing Electronic mail

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：