检索结果-内蒙古大学图书馆

Fast arithmetic architectures for public-key algorithms over galois fields gf((2n)m) 15th

15th international conference on the theory and application of cryptographic techniques, EUROCRYPT 1997

作者： Paar, Christof Soria-Rodriguez, Pedro ECE Department Worcester Polytechnic Institute WorcesterMA01609 United States

ISBN: (纸本)3540629750

this contribution describes a new class of arithmetic architectures for Galois fields GF(2k). the main applications of the architecture are public-key systems which are based on the discrete logarithm problem for elliptic curves. the architectures use a representation of the field GF(2k) as GF((2n)m), where k = n · m. the approach explores bit parallel arithmetic in the subfield GF(2n), and serial processing for the extension field arithmetic. this mixed parallel-serial (hybrid) approach can lead to very fast implementations. the principle of these approach was initially suggested by Mastrovito. As the core module, a hybrid multiplier is introduced and several optimizations are discussed. We provide two different approaches to squaring which, in conjunction with the multiplier, yield fast exponentiation architectures. the hybrid architectures are capable of exploring the time-space tradeoff paradigm in a flexible manner. In particular, the number of clock cycles for one field multiplication, which is the atomic operation in most public-key schemes, can be reduced by a factor of n compared to all other known realizations. the acceleration is achieved at the cost of an increased computational complexity. We describe a proof-of-concept implementation of an ASIC for exponentiation in GF((2n)m), m variable. © Springer-Verlag Berlin Heidelberg 1997.

关键词： Cryptography

来源：评论

学校读者我要写书评

暂无评论

Memory-optimized visualization system for limited-bandwidth multiprocessing environments

Memory-optimized visualization system for limited-bandwidth ...

引用

Proceedings of the 1997 4th international conference on High Performance Computing, HiPC

作者： Law, Asish Yagel, Roni Nichimen Graphics Inc Los Angeles United States

Object dataflow is a popular approach used in parallel rendering. the data representing the 3D scene is statically distributed among processors and objects are fetched and cached only on demand. Most previous object dataflow methods were implemented on shared memory architectures and exploited spatial coherency to reduce hardware cache misses. In this paper, we propose an efficient model for object dataflow parallel volume rendering on message passing machines. the algorithm is introduced and its ray storage mechanism is used to support latency hiding by postponing computation on inactive rays. Memory usage is optimized by letting objects migrate and replicate at different processors rather than the common static assignments. Our cache-only-memory approach uses a distributed-directory scheme to trace the location of objects at other nodes. A mechanism to minimize network congestion was implemented which optimizes channel utilization. Unlike previous methods, our approach can benefit from temporal coherence and effectively minimizes communication costs during animation on limited-bandwidth multiprocessing environments. We report results of the algorithm's implementation on several platforms like Cray T3D, Convex SPP and DEC-alpha cluster of workstations (COWs), and achieved higher efficiency and scalability than existing algorithms.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

DataScalar architectures 97

DataScalar architectures

引用

Proceedings of the 1997 24th Annual international Symposium on Computer Architecture

作者： Burger, Doug Kaxiras, Stefanos Goodman, James R. Univ of Wisconsin-Madison Madison WI United States

ISBN: (纸本)9780897919012

DataScalar architectures improve memory system performance by running computation redundantly across multiple processors, which are each tightly coupled with an associated memory. the program data set (and/or text) is distributed across these memories. In this execution model, each processor broadcasts operands it loads from its local memory to all other units. In this paper, we describe the benefits, costs, and problems associated with the DataScalar model. We also present simulation results of one possible implementation of a DataScalar system. In our simulated implementation, six unmodified SPEC95 binaries ran from 7% slower to 50% faster on two nodes, and from 9% to 100% faster on four nodes, than on a system with a comparable, more traditional memory system. Our intuition and results show that DataScalar architectures work best with codes for which traditional parallelization techniques fail. We conclude with a discussion of how DataScalar systems may accommodate traditional parallel processing, thus improving performance over a much wider range of applications than is currently possible with either model.

关键词： Computer architecture

来源：评论

学校读者我要写书评

暂无评论

Topology and shape preserving parallel thinning for 3D digital images — A new approach 9th

Topology and shape preserving parallel thinning for 3D digit...

引用

9th international conference on Image Analysis and processing, ICIAP 1997

作者： Saha, P.K. Dutta Majumder, D. Electronics and Communication Sciences Unit Indian Statistical Institute 203 Barrackpur Trunk Road Calcutta700035 India

ISBN: (纸本)3540635076

this paper is concerned with a new parallel thinning approach for three dimensional (3D) digital images that preserves the topology and maintains their shape. We introduce a new approach of selecting shape points and outer-layer used for erosion during each iteration. the approach produces good skeleton for different types of corners. the concept of using two image versions in thinning is introduced and its necessity in parallel thinning is justified. the robustness of the algorithm under pseudo random noise with respect to shape properties is studied and the results are found to be satisfactory. © Springer-Verlag Berlin Heidelberg 1997.

关键词： Topology

来源：评论

学校读者我要写书评

暂无评论

Cooperative vision in a multi-agent architecture 9th

Cooperative vision in a multi-agent architecture

引用

9th international conference on Image Analysis and processing, ICIAP 1997

作者： Oswald, Norbert Levi, Paul Institute of Parallel and Distributed High-Performance Systems Applied Computer Science - Image Understanding Stuttgart70565 Germany

ISBN: (纸本)3540635076

We present the concept of cooperative vision and its application to a multi-agent system with special attention to the integration of vision. Cooperative vision can be described as a type of distributed vision, where several agents working in a shared environment are involved. the object recognition task was distributed to several agents in order to demonstrate the concept of cooperative vision. this enables, on the one hand, a verification of objects by several agents and, on the other hand, a localization of spatial positions of other agents. A Bayesian approach is used for the combination of conclusions of several agents. Experiments done so far show significant results with regard to both tasks. © Springer-Verlag Berlin Heidelberg 1997.

关键词： Multi agent systems

来源：评论

学校读者我要写书评

暂无评论

Optimal automatic hardware synthesis for signal processing algorithms

Optimal automatic hardware synthesis for signal processing a...

引用

international conference on Digital Signal processing (DSP)

作者： N. Koziris G. Economakos T. Andronikos G. Papakonstantinou P. Tsanakas Department of Electrical and Computer Engineering Computer Science Division National and Technical University of Athens Zografou Greece

this paper presents a complete methodology for the automatic synthesis of VLSI architectures used in digital signal processing. Most signal processing algorithms have the form of an n-dimensional nested loop with unit uniform loop carried dependencies. We model such algorithms with generalized UET grids. We calculate the optimal makespan for the generalized UET grids and then we establish the minimum number of systolic cells required for achieving the optimal makespan. We present a complete methodology for the hardware synthesis of the resulting architecture, based on VHDL. this methodology automatically detects all necessary computation and communication elements and produces optimal layouts. the complexity of our proposed scheduling policy is completely independent of the size of the nested loop and depends only on its dimension, thus being the most efficient (in terms of complexity) known to us. All these methods were implemented and incorporated in an integrated software package which provides the designer with a powerful parallel design environment, from high level signal processing algorithmic specifications to low-level (i.e., actual layouts) optimal implementation. the evaluation was performed using well-known algorithms from signal processing.

关键词： Hardware Signal synthesis Signal processing Signal processing algorithms Signal design Algorithm design and analysis Very large scale integration Digital signal processing Computer architecture Processor scheduling

来源：评论

学校读者我要写书评

暂无评论

An interactive engineering tool for parallel HPC applications

引用

ADVANCES IN ENGINEERING SOFTWARE 1996年第2期26卷 121-131页

作者： Lenke, M LRR-TUM Lehrstuhl für Rechnertechnik und Rechnerorganisation Institut für Informatik Technische Universität München 80290 München Germany

Typical applications of the so-called Grand Challenges need massively parallel computer system architectures. Tools like parallel debuggers, performance analysers and visualizers help the code designer to develop efficient parallel algorithms. Such tools merely support the development cycle. But technical and scientific engineers who make use of parallel high-performance computing applications, e.g. numerical simulation algorithms in computational fluid dynamics (CFD), must be supported in their engineering work by another kind of tool. A tool for the application cycle is required because old, conventional suggestions regarding the arrangement for the application cycle rely on strictly sequential procedures. they are due to the heritage of traditional work on former vector computers. that formative influence is still felt in today's arrangements for the application cycle, prevents a more efficient engineering work and, therefore, must be overcome. New tool conceptions have to be introduced to enable on-line interaction between the technical and scientific engineers and their running parallel simulation. VIPER stands for VIsualization of parallel numerical simulation algorithms for Extended Research and offers physical parameters of the mathematical model and parameters of the numerical method as objects of a graphical user tool interface for online observation and online modification. A special client-server-client process architecture implementation enables technical and scientific engineers who are sitting at their graphic workstation to interact with their parallel simulation algorithms running on a remote parallel computer system. the VIPER prototype is applied on ParNsflex which is a parallel Navier-Stokes solver for real world aero-dynamic problems. A Paragon XP/S was selected as test parallel computer system. A first evaluation indicates the superiority of the VIPER conception against conventional procedures. Copyright (C) 1996 Published by Elsevier Science L

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

On the scalability of 2-d wavelet transform algorithms on fine-grained parallel machines 25

On the scalability of 2-d wavelet transform algorithms on fi...

引用

25th international conference on parallel processing, ICPP 1996

作者： Patel, J.N. Khokhar, A.A. Jamieson, L.H. School of Electrical and Computer Engineering Purdue University West LafayetteIN47907 United States Department of Electrical Engineering University of Delaware NewarkDE19716 United States

ISBN: (纸本)081867623X

We study the scalability of 2-D discrete wavelet transform algorithms on fine-grained parallel architectures. the principal operation in the 2-D DWT is the filtering operation used to implement the filter banks of the 2-D subband decomposition. We demonstrate that there exist combinations of the machine size, image size, and wavelet size for which the time-domain algorithms outperform the frequency domain algorithms, and vice-versa. We, therefore, demonstrate that a hybrid approach which combines time- A nd frequency-domain approaches can yield optimal performance for a broad range of problem and machine sizes. Furthermore, we show the effect of processor speed and the use of separable versus nonseparable wavelets on the crossover points between the algorithm approaches. © 1996 IEEE.

关键词： Discrete wavelet transforms

来源：评论

学校读者我要写书评

暂无评论

VLSI/WSI designs for folded cube-connected cycles architectures

VLSI/WSI designs for folded cube-connected cycles architectu...

引用

Proceedings of the 1996 9th international conference on VLSI Design

作者： Sebastian, M.P. Nagendra Rao, P.S. Jenkins, Lawrence Indian Inst of Science Bangalore India

this paper presents VLSI/WSI designs for a recently introduced parallel architecture known as the folded cube-connected cycles (FCCC). We first discuss two layouts for the FCCC, in which there is no component redundancy. then we incorporate redundancy, and present locally and globally reconfigurable FCCCs. We also discuss the design of universal building blocks for the construction of fault-tolerant FCCCs of various dimensions.

关键词： Integrated circuit layout

来源：评论

学校读者我要写书评

暂无评论

thin Si oxide films for MIS tunnel emitter by hollow cathode enhanced plasma oxidation

引用

thIN SOLID FILMS 1996年第1-2期281卷 412-414页

作者： Usami, K Takahashi, I Miyake, E Moriya, M Cai, XY Kobayashi, T Goto, T The faculty of Electrical Communications The University of Electro-Communications 1-5-1 Chofugaoka Chofu-Shi Tokyo 182 Japan

A DC plasma oxidation system with a hollow cathode which consists of a pair of parallel Si plates was developed. Using this system, thin Si oxide films of less than 40 nm thickness were grown on n-type Si(100) substrates, for the application to the tunnel devices. the film quality and the oxide stoichiometry were estimated by XPS measurements. On the oxide films, the MIS (Metal-Insulator-Semiconductor) diode type tunnel emitters were fabricated. the electrical properties of the diodes, such as I-V characteristics and electron emission into the vacuum were measured. For a typical sample, an electron emission current density of 800 pA/mm(2) into the vacuum was obtained.

关键词： electron emission plasma processing and deposition silicon oxide tunneling

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：