检索结果-内蒙古大学图书馆

Copyright and Reprint Permissions: Abstracting is permitted with credit to the source. Libraries may photocopy beyond the limits of US copyright law, for private use of patrons, those articles in this volume that carry a code at the bottom of the first page, provided that the per-copy fee indicated in the code is paid through the Copyright Clearance Center. the papers in this book comprise the proceedings of the meeting mentioned on the cover and title page. they reflect the authors' opinions and, in the interests of timely dissemination, are published as presented and without change. their inclusion in this publication does not necessarily constitute endorsement by the editors or the Institute of Electrical and Electronics Engineers, Inc.

关键词：

来源：评论

学校读者我要写书评

暂无评论

MLFormer: a high performance MPC linear inference framework for transformers

引用

JOURNAL OF CRYPTOGRAPHIC ENGINEERING 2025年第1期15卷 1-20页

作者： Liu, Siqi Liu, Zhusen Chen, Donglong Dai, Wangchen Zhou, Lu Liu, Zhe Cheung, Ray C. C. Koc, Cetin Kaya BNU HKBU United Int Coll Guangdong Prov Key Lab IRADS Zhuhai 519000 Peoples R China Hangzhou Innovat Inst Beihang Univ Hangzhou 311121 Peoples R China Zhejiang Lab Hangzhou 310000 Peoples R China Sun Yat Sen Univ Shenzhen 518107 Peoples R China Nanjing Univ Aeronaut & Astronaut Nanjing 210000 Peoples R China City Univ Hong Kong Hong Kong 310000 Peoples R China Igdir Univ Igdir Turkiye Univ Calif Santa Barbara St Barbara CA USA

Transformer-based models are widely used in natural language processing tasks, and their application has been further extended to computer vision as well. In their usage, data security has become a crucial concern when deploying deep learning services on cloud platforms. To address these security concerns, Multi-party computation (MPC) is employed to prevent data and model leakage during the inference process. However, Transformer model introduces several challenges for MPC computation, including the time overhead of the Softmax (normalized exponential) function, the accuracy issue caused by the "dynamic range" of approximated division and exponential, and the high memory overhead when processing long sequences. To overcome these challenges, we propose MLformer, an MPC-based inference framework for transformer models based on Crypten Knott et al. (Adv Neural Inf Process Syst 34: 4961-4973, 2021), a secure machine learning framework suggested by Facebook AI Research group, in the semi-honest adversary model. In this framework, we replace the softmax attention with linear attention, which has linear time and memory complexity with input length. the modification eliminates the softmax function entirely, resulting in lower time and memory overhead. To ensure the accuracy of linear attention, we propose the scaled linear attention to address the dynamic range issue caused by the MPC division used and a new approximate division function is proposed to reduce the computational time of the attention block. Furthermore, to improve the efficiency and accuracy of MPC exponential and reciprocal which are commonly used in transformer model, we propose a novel MPC exponential protocol and first integrate the efficient reciprocal protocol Bar-Ilan and Beaver (in Proceedings of the 8th annual acm symposium on principles of distributed computing, pp. 201-209, 1989) to our framework. Additionally, we optimize the computation of causal linear attention, which is utilized in private in

关键词： Multi-party computation Linear transformer Private inference parallel processing GPU

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms for personalized communication and sorting with an experimental study

Parallel algorithms for personalized communication and sorti...

引用

Proceedings of the 1996 8th annual acm symposium on parallel algorithms and architectures

作者： Helman, David R. Bader, David A. Jaja, Joseph Univ of Maryland College Park United States

Two novel variations on sample sort, one using only two rounds of regular all-to-all personalized communication in a scheme that yields very good load balancing with virtually no overhead and another using regular sampling for choosing splitters, were studied. the two were coded in Split-C and were run on a variety of platforms. Results were consistent with theoretical analysis and illustrated the scalability and efficiency of the algorithms.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Resource scheduling for parallel database and scientific applications 96

Resource scheduling for parallel database and scientific app...

引用

Proceedings of the 1996 8th annual acm symposium on parallel algorithms and architectures

作者： Chakrabarti, Soumen Muthukrishnan, S. Univ of California at Berkeley Berkeley CA United States

ISBN: (纸本)9780897918091

Scheduling problems that are critical and prevalent in practical parallel computing are computed. A polynomial time makespan algorithm that produces a schedule of length O(V+Φ log T), which is therefore an O(log T) approximation is presented to solve these problems. the makespan algorithm can be extended to minimize the weighted average completion time over all the jobs to the same approximation factor of O(log T).

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

Library of basic PRAM algorithms and its implementation in FORK

Library of basic PRAM algorithms and its implementation in F...

引用

Proceedings of the 1996 8th annual acm symposium on parallel algorithms and architectures

作者： Kessler, Christoph W. Traeff, Jesper Larsson Universitaet Trier Trier Germany

A library, called PAD, of basic parallel algorithms and data structures for the PRAM is currently being implemented using the PRAM programming language Fork95. Main motivations of the PAD project is to study the PRAM as a practical programming model, and to provide an organized collection of basic PRAM algorithms for the SB-PRAM under completion at the University of Saarbruecken. We give a brief survey of Fork95, and describe the main components of PAD. Finally we report on the status of the language and library and discuss further developments.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Finding minimum spanning forests in logarithmic time and linear work using random sampling 96

Finding minimum spanning forests in logarithmic time and lin...

引用

Proceedings of the 1996 8th annual acm symposium on parallel algorithms and architectures

作者： Cole, Richard Klein, Philip N. Tarjan, Robert E. New York Univ New York NY United States

ISBN: (纸本)9780897918091

We describe a randomized CRCW PRAM algorithm that finds a minimum spanning forest of an n-vertex graph in O(log n) time and linear work. this shaves a factor of 2log* n off the best previous running time for a linear-work algorithm. the novelty in our approach is to divide the computation into two phases, the first of which finds only a partial solution. this idea has been used previously in parallel connected components algorithms.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Universal continuous routing strategies 96

Universal continuous routing strategies

引用

8th annual acm symposium on parallel algorithms and architectures

作者： Scheideler, C Vocking, B Univ Gesamthsch Paderborn Dept Math & Comp Sci D-33095 Paderborn Germany Univ Gesamthsch Paderborn Heinz Nixdorf Inst D-33095 Paderborn Germany

ISBN: (纸本)9780897918091

We analyze universal routing protocols, that is, protocols that can be used for any communication pattern in any network, under a stochastic model of continuous message generation, In particular, we present two universal protocols, a store-and-forward and a wormhole routing protocol, and characterize their performance by the following three parameters: the maximum message generation rate for which the protocol is stable, the expected delay of a message from generation to service, and the time the protocol needs to recover from worst-case scenarios. Both protocols yield significant performance improvements over all previously known continuous routing protocols. In addition, we present adaptations of our protocols to continuous routing in node-symmetric networks, butterflies, and meshes.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：