检索结果-内蒙古大学图书馆

Fast Search of the Optimal Contraction Sequence in Tensor Networks

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING 2021年第3期15卷 574-586页

作者： Liang, Ling Xu, Jianyu Deng, Lei Yan, Mingyu Hu, Xing Zhang, Zheng Li, Guoqi Xie, Yuan Univ Calif Santa Barbara Dept Elect & Comp Engn Santa Barbara CA 93106 USA Tsinghua Univ Ctr Brain Inspired Comp Res Dept Precis Instrument Beijing 100084 Peoples R China

Tensor network and tensor computation are widely applied in scientific and engineering domains like quantum physics, electronic design automation, and machine learning. As one of the most fundamental operations for tensor networks, a tensor contraction eliminates the sharing orders among tensors and produces a compact sub-network. Different contraction sequence usually yields distinct storage and compute costs, and searching the optimal sequence is known as a hard problem. Prior work have designed heuristic and fast algorithms to solve this problem, however, several issues still remain unsolved. For example, the data format and data structure are not efficient, the constraints during modeling are impractical, the search of the optimal solution might fail, and the search cost is very high. In this paper, we first introduce a log(k) order representation and design an adjacency matrix-based data structure to efficiently accelerate the search of the optimal contraction sequence. Then, we propose an outer product pruning method with acceptable overhead to reduce the search space. Finally, we use a multithread optimization in our implementation to further improve the execution performance. We also present indepth analysis of factors that influence the search time. This work provides a full-stack solution for optimal contraction sequence search from both high-level data structure and search algorithm to low-level execution parallelism, and it will benefit a broad range of tensor-related applications.

关键词： Tensor contraction adjacency matrix BFS algorithm search space reduction multithread optimization

来源：评论

学校读者我要写书评

暂无评论

optimization of multithread for Long Digit Multiplier by using Ancient India Vedic Mathematic 14

Optimization of Multithread for Long Digit Multiplier by usi...

引用

2017 14th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON)

作者： Thongbai, Nopphagaw Tuwanuti, Panwit King Mongkuts Inst Technol Ladkrabang Fac Informat Technol Bangkok Thailand

ISBN: (纸本)9781538604496

Any processor's performance is dependent on three important factor speed, area and power. The better tread-off between factors, an effective once. Multiplier are common used in computation process. In this paper, the proposed multiplier by design based on the sutra "Urdhva Tiryakbhyam and Nikhilam" of Vedic are analyzed and the performance results of multiplier are compare with conventional multipliers and karatsuba once of the most popular. In conclusion of experiment Vedic mathematics can be improvement computation effective. Specifically, in accurate computation, in many digit mathematics or very long digit, that want power of computation. By helping of special opcode instruction that bundle in processor such as SSE, AVX and etc. in modern processor to increment parallel process level in single core by SIMD and vector register.

关键词： Vedic Multiplier multithread optimization

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：