检索结果-内蒙古大学图书馆

6th International Conference on parallel and distributed Computing, Applications and Technologies (PDCAT 2005)

作者： Zeng, LF Yang, XJ Huangchun National Laboratory for Parallel and Distributed Processing China

ISBN: (纸本)0769524052

In order to improve the performance of applications on OpenMP/JIAJIA, we present a new abstraction, Array Relation Vector (ARV), to describe the relation between the data elements of two consistent shared arrays accessed in one computation phase. Based on ARV, we use array grouping to eliminate the pseudo data distributing of small shared data and improve the page locality. Experimental results show that ARV-based array grouping can greatly improve the performance of applications with non-continuous data access and strict access affinity on OpenMP/JIAJIA cluster. For applications with small shared arrays, array grouping can improve the performance obviously when the processor number is small.

关键词： distributed computer systems

来源：评论

学校读者我要写书评

暂无评论

Computing Must and May Alias to Detect Null Pointer Dereference

Computing Must and May Alias to Detect Null Pointer Derefere...

引用

3rd International Symposium on Leveraging Applications of Formal Methods, Verification and Validation

作者： Ma, Xiaodong Wang, Ji Dong, Wei National Laboratory for Parallel and Distributed Processing China

ISBN: (纸本)9783540884781

This paper presents a novel algorithm to detect null pointer dereference errors. The algorithm utilizes both of the must and may alias information in a compact way to improve the precision of the detection. Using may alias information obtained by a fast flow- and context- insensitive analysis algorithm, we compute the must alias generated by the assignment statements and the must alias information is also used to improve the precision of the may alias. We can strong update more expressions using the must alias information, which will reduce the false positives of the detection for null pointer dereference. We have implemented our algorithm in the SUIF2 compiler infrastructure and the experiments results are as expected.

关键词： Information use

来源：评论

学校读者我要写书评

暂无评论

Deep reinforcement learning:a survey

引用

Frontiers of Information Technology & Electronic Engineering 2020年第12期21卷 1726-1744页

作者： Hao-nan WANG Ning LIU Yi-yun ZHANG Da-wei FENG Feng HUANG Dong-sheng LI Yi-ming ZHANG Science and Technology on Parallel and Distributed Processing Laboratory National University of Defense TechnologyChangsha 41OOOOChina

Deep reinforcement learning(RL)has become one of the most popular topics in artificial intelligence *** has been widely used in various fields,such as end-to-end control,robotic control,recommendation systems,and natural language dialogue *** this survey,we systematically categorize the deep RL algorithms and applications,and provide a detailed review over existing deep RL algorithms by dividing them into modelbased methods,model-free methods,and advanced RL *** thoroughly analyze the advances including exploration,inverse RL,and transfer ***,we outline the current representative applications,and analyze four open problems for future research.

关键词： Reinforcement learning Deep reinforcement learning Reinforcement learning applications

来源：评论

学校读者我要写书评

暂无评论

An Efficient Broadcast Authentication Protocol in Wireless Sensor Networks

引用

电子学报(英文版) 2009年第2期18卷 368-372页

作者： ZHAO Xin WANG Xiaodong YU Wanrong ZHOU Xingming National Key Laboratory of Parallel and Distributed Processing National University of Defense Technology Changsha 410073 China

Broadcast authentication is a critical security service in wireless sensor networks. A protocol named μTESLA[1] has been proposed to provide efficient authentication service for such networks. However, when applied t... 详细信息

Broadcast authentication is a critical security service in wireless sensor networks. A protocol named μTESLA^[1] has been proposed to provide efficient authentication service for such networks. However, when applied to applications such as time synchronization and fire alarm in which broadcast messages are sent infrequently, μTESLA encounters problems of wasted key resources and slow message verification. This paper presents a new protocol named GBA (Generalized broadcast authentication), for efficient broadcast authentication in these applications. GBA utilises the one-way key chain mechanism of μTESLA, but modifies the keys and time intervals association, and changes the key disclosure mechanism according to the message transmission model in these applications. The proposed technique can take full use of key resources, and shorten the message verification time to an acceptable level. The analysis and experiments show that GBA is more efficient and practical than μESLA in appli ations with various message transmission models.

关键词：无线传感器网络认证协议广播 applications practical 安全服务身份验证时间同步

来源：评论

学校读者我要写书评

暂无评论

Embedded DHT overlays in virtual computing environments

引用

Science China(Information Sciences) 2010年第3期53卷 483-493页

作者： ZHANG YiMing, LU XiCheng & LI DongSheng 1national laboratory for parallel and distributed processing (PDL), Changsha 410073, China 2School of Computer, national University of Defense Technology, Changsha 410073, China 1. National Laboratory for Parallel and Distributed Processing (PDL) Changsha 410073 China

With the rapid development of computing and networking technologies, people propose to build harmonious, trusted and transparent Internet-based virtual computing environments (iVCE). The overlay-based organization of dynamic Internet resources is an important approach for iVCE to realizing efficient resource sharing. DHT-based overlays are scalable, low-latency and highly available; however, the current DHT overlay (SKY) in iVCE cannot satisfy the "trust" requirements of Internet applications. To address this problem, in this paper we modify SKY and propose TrustedSKY, an embedded DHT overlay technique in iVCE which supports applications to select trusted nodes to form a "trusted subgroup" in the base overlay and realize secure and trusted DHT routing.

关键词： Internet-based virtual computing environment (iVCE) DHT overlay TrustedSKY

来源：评论

学校读者我要写书评

暂无评论

A coarse-grained reconfigurable computing architecture with loop self-pipelining

引用

Science in China(Series F) 2009年第4期52卷 575-587页

作者： DOU Yong WU GuiMing XU dinHui ZHOU XingMing National Laboratory for Parallel & Distributed Processing National University of Defense Technology Changsha 410073 China

Reconfigurable computing tries to achieve the balance between high efficiency of custom computing and flexibility of general-purpose computing. This paper presents the implementation techniques in LEAP, a coarse-grained reconfigurable array, and proposes a speculative execution mechanism for dynamic loop scheduling with the goal of one iteration per cycle and implementation techniques to support decoupling synchronization between the token generator and the collector. This paper also in- troduces the techniques of exploiting both data dependences of intra- and inter-iteration, with the help of two instructions for special data reuses in the loop-carried dependences. The experimental results show that the number of memory accesses reaches on average 3% of an RISC processor simulator with no memory optimization. In a practical image matching application, LEAP architecture achieves about 34 times of speedup in execution cycles, compared with general-purpose processors.

关键词： reconfigurable computing loop pipelining data driven register promotion

来源：评论

学校读者我要写书评

暂无评论

SKY:Efficient peer-to-peer networks based on distributed Kautz graphs

引用

Science in China(Series F) 2009年第4期52卷 588-601页

作者： ZHANG YiMing LU XiCheng LI DongSheng National Laboratory for Parallel and Distributed Processing National University of Defense Technology Changsha 410073 China

Many proposed P2P networks are based on traditional interconnection topologies. Given a static topology, the maintenance mechanism for node join/departure is critical to designing an efficient P2P network. Kautz graphs have many good properties such as constant degree, low congestion and optimal diameter. Due to the complexity in topology maintenance, however, to date there have been no effective P2P networks that are proposed based on Kautz graphs with base ~ 2. To address this problem, this paper presents the ＂distributed Kautz （D-Kautz） graphs＂, which adapt Kautz graphs to the characteristics of P2P networks. Using the D-Kautz graphs we further propose SKY, the first effective P2P network based on Kautz graphs with arbitrary base. The effectiveness of SKY is demonstrated through analysis and simulations.

关键词： peer-to-peer network Kautz graph constant degree topology maintenance D-Kautz graph

来源：评论

学校读者我要写书评

暂无评论

Experimental verification of the parasitic bipolar amplification effect in PMOS single event transients

引用

Chinese Physics B 2014年第7期23卷 775-779页

作者：何益百陈书明 College of Computer National University of Defense Technology Science and Technology on Parallel and Distributed Processing Laboratory National University of Defense Technology

The contribution of parasitic bipolar amplification to SETs is experimentally verified using two P-hit target chains in the normal layout and in the special layout. For PMOSs in the normal layout, the single-event charge collection is composed of diffusion, drift, and the parasitic bipolar effect, while for PMOSs in the special layout, the parasitic bipolar junction transistor cannot turn on. Heavy ion experimental results show that PMOSs without parasitic bipolar amplification have a 21.4% decrease in the average SET pulse width and roughly a 40.2% reduction in the SET cross-section.

关键词： single event effect single event transient parasitic bipolar amplification heavy ion experiments

来源：评论

学校读者我要写书评

暂无评论

Scalability of 3D deterministic particle transport on the Intel MIC architecture

引用

Nuclear Science and Techniques 2015年第5期26卷 88-97页

作者：王庆林刘杰龚春叶邢座程 Science and Technology on Parallel and Distributed Processing Laboratory National University of Defense Technology Science and Technology on Space Physics Laboratory

The key to large-scale parallel solutions of deterministic particle transport problem is single-node computation performance. Hence, single-node computation is often parallelized on multi-core or many-core computer architectures. However, the number of on-chip cores grows quickly with the scale-down of feature size in semiconductor technology. In this paper, we present a scalability investigation of one energy group time-independent deterministic discrete ordinates neutron transport in 3D Cartesian geometry(Sweep3D) on Intel's Many Integrated Core(MIC) architecture, which can provide up to 62 cores with four hardware threads per core now and will own up to 72 in the future. The parallel programming model, Open MP, and vector intrinsic functions are used to exploit thread parallelism and vector parallelism for the discrete ordinates method, respectively. The results on a 57-core MIC coprocessor show that the implementation of Sweep3 D on MIC has good scalability in performance. In addition, the application of the Roofline model to assess the implementation and performance comparison between MIC and Tesla K20 C Graphics processing Unit(GPU) are also reported.

关键词：计算机体系结构可扩展性粒子输运三维几何英特尔麦克风离散坐标法计算性能

来源：评论

学校读者我要写书评

暂无评论

Exploiting a depth context model in visual tracking with correlation filter

引用

Frontiers of Information Technology & Electronic Engineering 2017年第5期18卷 667-679页

作者： Zhao-yun CHEN Lei LUO Da-fei HUANG Mei WEN Chun-yuan ZHANG College of Computer National University of Defense TechnologyChangsha 410073China National Key Laboratory of Parallel and Distributed Processing Changsha 410073China

Recently correlation filter based trackers have attracted considerable attention for their high computational efficiency. However, they cannot handle occlusion and scale variation well enough. This paper aims at preventing the tracker from failure in these two situations by integrating the depth information into a correlation filter based tracker. By using RGB-D data, we construct a depth context model to reveal the spatial correlation between the target and its surrounding regions. Furthermore, we adopt a region growing method to make our tracker robust to occlusion and scale variation. Additional optimizations such as a model updating scheme are applied to improve the performance for longer video sequences. Both qualitative and quantitative evaluations on challenging benchmark image sequences demonstrate that the proposed tracker performs favourably against state-of-the-art algorithms.

关键词： Visual tracking Depth context model Correlation filter Region growing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：