检索结果-内蒙古大学图书馆

Fast, General parallel computation for Machine Learning 18

Fast, General Parallel Computation for Machine Learning

47th International Conference on parallel Processing (ICPP) / International Workshop on Embedded Multicore Systems (EMS)

作者： Yancey, Robin Elizabeth Matloff, Norman Univ Calif Davis Davis CA 95616 USA

ISBN: (纸本)9781450365239

Today the terms machine learning (ML) and Big Data are closely correlated. This, and the complexity of many ML algorithms, motivates a search for fast parallel computation methods. A further motivating factor is a need to deal with memory size limitations, especially for the moderately-sized machines common in many ML applications. In addition, it is desirable to develop generally applicable methods, rather than needing to develop a different parallel approach for every ML algorithm. In this work, we apply a technique we call Software Alchemy to ML. We are particularly interested in ML for recommender systems, and explore the feasibility of SA in that context.

关键词： parallel computation machine learning superlinear speedup recommender systems

来源：评论

学校读者我要写书评

暂无评论

FAST parallel computation OF POLYNOMIALS USING FEW PROCESSORS

引用

SIAM JOURNAL ON COMPUTING 1983年第4期12卷 641-644页

作者： VALIANT, LG SKYUM, S BERKOWITZ, S RACKOFF, C AARHUS UNIV DEPT COMP SCIDK-8000 AARHUS CDENMARK UNIV TORONTO DEPT COMP SCITORONTO M5S 1A7ONTARIOCANADA

It is shown that any multivariate polynomial of degree d that can be computed sequentially in C steps can be computed in parallel in $O((\log d)(\log C + \log d))$ steps using only $(Cd)^{O(1)} $ processors.

关键词： parallel computation polynomials complexity theory

来源：评论

学校读者我要写书评

暂无评论

Acceleration of parallel computation for Derived Micro-Modeling Circuit by Exploiting GPU Memory Bandwidth Limit

Acceleration of Parallel Computation for Derived Micro-Model...

引用

IEEE-MTT-S International Conference on Numerical Electromagnetic and Multiphysics Modeling and Optimization for RF, Microwave, and Terahertz Applications (NEMO)

作者： Dou, Yuhang Wu, Ke-Li Chinese Univ Hong Kong Dept Elect & Engn Hong Kong Hong Kong Peoples R China

ISBN: (纸本)9781509048373

It has been shown that a newly proposed micro-modeling method for deriving a concise passive circuit of a large-scale EM problem is highly suitable for GPU parallel computation. However, due to the memory bandwidth limit of GPU, the utilization of GPU is far from its peak performance because more than 97% processing time is occupied by the frequent data transactions. This paper proposes an effective strategy for GPU acceleration of the micro-modeling algorithm, which can significantly reduce data transactions between off-chip memory and in-chip memory of GPUs. A practical numerical example of a large-scale interconnection and packaging problem shows that the proposed strategy is effective and the parallel computation of the micro-modeling circuit using GPUs will be further accelerated by one order of magnitude if 4 or more iterative derivation processes of can be conducted by one run.

关键词： Graphics processing unit (GPU) Model order reduction (MOR) micro-modeling circuit parallel computation

来源：评论

学校读者我要写书评

暂无评论

The parallel computation of Green Function Based On the Characteristic Length of Ship 13

The Parallel Computation of Green Function Based On the Char...

引用

13th International Symposium on Distributed Computing and Applications to Business, Engineering and Science (DCABES)

作者： Yu, Zixiang Li, Dan Jin Shengping Gui, Yufeng Zhang Shesheng Wuhan Univ Technol Assoc Math Modeling Wuhan 430070 Peoples R China

ISBN: (纸本)9781479941698

The change of ship's characteristic length can change scientific researching method. The paper gains the calculation parameters of Green function and the relational expression of vessels' characteristic length by establishing the analysis theory of ships' characteristic length dimension based on the control equation of green function and gives the statistical expression for ships' characteristic length. Moreover, it constructs a parallel algorithm of green function. The numerical results show that our algorithm has high parallel calculating ratio.

关键词： Green function shipping probability parallel computation

来源：评论

学校读者我要写书评

暂无评论

On an Orthogonal Method of Finding Approximate Solutions of Ill-Conditioned Algebraic Systems and parallel computation

On an Orthogonal Method of Finding Approximate Solutions of ...

引用

World Congress on Engineering (WCE 2013)

作者： Otelbaev, M. Tuleuov, B. I. Zhussupova, D. LN Gumilyov Eurasian Natl Univ Dept Mech & Math Astana 010008 Kazakhstan

A new method of finding approximate solutions of linear algebraic systems with ill-conditioned or singular matrices, using Schmidt orthogonalization, is presented. This method can be effectively used for arranging par... 详细信息

ISBN: (纸本)9789881925107

关键词： ill conditioned matrices eigenvalues approximate solutions parallel computation Schmidt orthogonalization

来源：评论

学校读者我要写书评

暂无评论

Data Compression and parallel computation Model Research under Big Data Environment 7

Data Compression and Parallel Computation Model Research und...

引用

7th International Conference on Computer Communication and Informatics (ICCCI)

作者： Sun, Yueqiu Gong, Xian Yang, Yihe Shanghai Univ Finance & Econ Sch Stat & Management Shanghai 200433 Peoples R China Shanghai Univ Finance & Econ Sch Informat Management & Engn Shanghai 200433 Peoples R China Beijing Univ Posts & Telecommun Sch Informat & Commun Engn Beijing 100086 Peoples R China

ISBN: (纸本)9781467388566;9781467388559

In big data environment, data loss is a crucial issue which probably will occur due to the high network traffic, transmission delay and lesser bandwidth. This problem could be solved by adopting data compression schemes. These schemes could be classified into two types based on their actions: lossless compression and lossy compression. Lossy compression changes the output which will not be the same as input. Lossless compression changes the output and produces the output same as the input data. So the network overhead could be increased. The existing fixed and variable length coding technique have high robustness but poor efficiency. The efficiency problem can be solved by using the proposed scheme called "Data compression and parallel computation research model". This proposed model uses a more sophisticated coding technique for the data compression and increases the efficiency while reducing the delay. Simulation results have shown that the proposed data compression and parallel computation research model has the better signal to noise ratio, increases the efficiency and reduces the delay when comparing to the existing models.

关键词： Data Compression parallel computation Big Data Environment Storage Space Network Performance Network Delay

来源：评论

学校读者我要写书评

暂无评论

Faster Explicit Formulae for Computing Pairings via Elliptic Nets and Their parallel computation 11th

Faster Explicit Formulae for Computing Pairings via Elliptic...

引用

11th International Workshop on Security (IWSEC)

作者： Onuki, Hiroshi Teruya, Tadanori Kanayama, Naoki Uchiyama, Shigenori Tokyo Metropolitan Univ 1-1 Minami Ohsawa Hachioji Tokyo 1920372 Japan Natl Inst Adv Ind Sci & Technol Koto Ku 2-4-7 Aomi Tokyo 1350064 Japan Univ Tsukuba 1-1-1 Tennodai Tsukuba Ibaraki 3058573 Japan

ISBN: (纸本)9783319445243;9783319445236

In this paper, we discuss computations of optimal pairings over some pairing-friendly curves and a symmetric pairing over supersingular curves via elliptic nets. We show that optimal pairings can be computed more efficiently if we use twists of elliptic curves and give formulae for computing optimal pairings via elliptic nets of these twist curves. Furthermore, we propose parallel algorithms for these pairings and estimate the costs of these algorithms in certain reasonable assumptions.

关键词： Optimal pairing Symmetric pairing Tate pairing Elliptic net parallel computation

来源：评论

学校读者我要写书评

暂无评论

High Performance Multicore SHA-256 Accelerator using Fully parallel computation and Local Memory 24

High Performance Multicore SHA-256 Accelerator using Fully P...

引用

IEEE Symposium on Low-Power and High-Speed Chips (IEEE COOL CHIPS)

作者： Van Dai Phan Hoai Luan Pham Thi Hong Tran Nakashima, Yasuhiko Nara Inst Sci & Technol Grad Sch Informat Sci 8916-5 Takayama Cho Ikoma Nara 6300192 Japan

ISBN: (纸本)9781665415033

Integrity checking is indispensable in the current technological age. One of the most popular algorithms for integrity checking is SHA-256. To achieve high performance, many applications generally design SHA-256 in hardware. However, the processing rate of SHA-256 is often low due to a large number of computations. Besides, data must be repeated in many loops to generate a hash, which requires transferring data multiple times between accelerator and off-chip memory if not using local memory. In this paper, an ALU combining fully parallel computation and pipeline layers is proposed to increase the SHA-256 processing rate. Moreover, the local memory is attached near ALU for reducing off-chip memory access during the iterations of computing. In the high hash rate, we design a SoC-based multicore SHA-256 accelerator. As a result, our proposed accelerator enhances throughput by more than 40% and be 2x higher hardware efficiency compared with the state-of-the-art design.

关键词： SHA-256 parallel computation Local Memory FPGA

来源：评论

学校读者我要写书评

暂无评论

Research on Tool Path Planning Method for Five-axis NC Machining Based on parallel computation

Research on Tool Path Planning Method for Five-axis NC Machi...

引用

IEEE International Conference on Automation and Logistics

作者： Yu Wujia Ning Fanghua Hangzhou DianZi Univ Sch Automat Hangzhou Zhejiang Peoples R China Zhejiang Sci Tech Univ Coll Mech Engn & Automat Hangzhou Zhejiang Peoples R China

ISBN: (纸本)9781424425020

parallel computation is an effective technology to improve the executive performance of computer programs. In this paper, a new machining region parallel planning method is presented to generate optimal tool paths for 5-axis sculptured surface machining. Based on existing machining region planning methods, the improved method adopted data decomposition mode and OpenMP to implement the parallel computing strategy. By dividing the part surface into two or more areas, this method can generate tool paths with higher performance and shorter total length. This means it can reduce the cost of 5-axis machining. Computer implementation and examples were shown in this paper to prove the validity of the new method.

关键词： parallel computation tool paths planning five-axis

来源：评论

学校读者我要写书评

暂无评论

DroMPI: parallel computation Over Drop Computing 24

DroMPI: Parallel Computation Over Drop Computing

引用

24th IEEE/ACM International Symposium on Cluster, Cloud, and Internet Computing (CCGrid)

作者： Grosu, George-Mircea Nistor, Silvia-Elena Ciobanu, Radu-Ioan Dobre, Ciprian Pop, Florin Natl Univ Sci & Technol Politehn Bucharest Bucharest Romania Natl Inst Res & Dev Informat ICI Bucharest Romania Acad Romanian Scientists Bucharest Romania

ISBN: (纸本)9798350377521;9798350377514

With the advancement of technology and the spread of multi-core systems, the need for parallelization arises and the interest in programming models is growing. At the same time, new distributed computing models have been proposed, being in fierce competition to obtain the highest possible performance. The Drop Computing Paradigm proposes the idea of decentralized computing over ad-hoc opportunistic networks of mobile and Edge devices. In this respect, the Drop Computing model does not only aim to achieve a minimum turnaround time but also to optimize other characteristics related to mobile devices, such as limited resources and opportunistic communication. Therefore, it is necessary to define a new programming model called DroMPI that intends to extend the capabilities of current parallel and distributed programming models, based on the Drop Computing paradigm. Therefore, the solution aims to develop a library that takes advantage of hardware capabilities in the interest of the Drop Computing paradigm and also provides programmers with a high-level programming interface. The library's features will be based on the Message Passing Interface (MPI) standard, which will be responsible for inter-node parallelization. The name of the library, DroMPI, is an acronym for Drop Computing and MPI. The implementation of the model will be responsible for the management of communication between nodes and for providing an Application Programming Interface (API) for the development of parallel applications in the Drop Computing paradigm.

关键词： Decentralized Computing parallel computation Drop Computing Application Programming Interface MPI

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：