检索结果-内蒙古大学图书馆

The Heat Equation: High-Performance Scientific Computing Case Study

COMPUTING IN SCIENCE & ENGINEERING 2018年第5期20卷 114-127页

作者： Schuster, Micah D. Wentworth Inst Technol Dept Comp Sci & Networking Boston MA 02115 USA

In recent years, high performance computing and powerful supercomputers are becoming a staple in many areas of academia and industry. The author introduces the concepts of shared memory programming in the context of solving the heat equation, which will allow the exploration of several finite difference and parallelization schemes.

关键词： Diffusion Finite Difference Methods Parabolic Equations Parallel Programming Partial Differential Equations Physics Computing Shared Memory Systems Powerful Supercomputers Academia Industry Shared Memory Programming Heat Equation High Performance Scientific Computing Case Study Finite Difference scheme parallelization scheme Mathematical Model Instruction Sets Heating Systems Parallel Processing Runtime Task Analysis Your Homework Assignment Heat Equation High Performance Computing HPC Scientific Computing Shared Memory Programming Open MP

来源：评论

学校读者我要写书评

暂无评论

The Potential of the Intel® Xeon Phi™ for Supervised Deep Learning 17

The Potential of the Intel® Xeon Phi™ for Supervised Deep ...

引用

2015 IEEE 17th International Conference on High Performance Computing and Communications (HPCC)

作者： Viebke, Andre Pllana, Sabri Linnaeus Univ Dept Comp Sci S-35195 Vaxjo Sweden

ISBN: (纸本)9781479989379

Supervised learning of Convolutional Neural Networks (CNNs), also known as supervised Deep Learning, is a computationally demanding process. To find the most suitable parameters of a network for a given application, numerous training sessions are required. Therefore, reducing the training time per session is essential to fully utilize CNNs in practice. While numerous research groups have addressed the training of CNNs using GPUs, so far not much attention has been paid to the Intel Xeon Phi coprocessor. In this paper we investigate empirically and theoretically the potential of the Intel Xeon Phi for supervised learning of CNNs. We design and implement a parallelization scheme named CHAOS that exploits both the thread-and SIMD-parallelism of the coprocessor. Our approach is evaluated on the Intel Xeon Phi 7120P using the MNIST dataset of handwritten digits for various thread counts and CNN architectures. Results show a 103.5x speed up when training our large network for 15 epochs using 244 threads, compared to one thread on the coprocessor. Moreover, we develop a performance model and use it to assess our implementation and answer what-if questions.

关键词： graphics processing units learning (artificial intelligence) neural nets parallel processing CHAOS CNN GPU Intel Xeon Phi 7120P Intel Xeon Phi coprocessor SIMD-parallelism convolutional neural network parallelization scheme supervised deep learning thread-parallelism Biological neural networks Chaos Coprocessors Instruction sets Machine learning Neurons Training Convolutional Neural Networks Intel Xeon Phi Many-core Parallel Computing Supervised Deep Learning

来源：评论

学校读者我要写书评

暂无评论

DOVIS 2.0: an efficient and easy to use parallel virtual screening tool based on AutoDock 4.0

引用

CHEMISTRY CENTRAL JOURNAL 2008年第1期2卷 18-18页

作者： Jiang, Xiaohui Kumar, Kamal Hu, Xin Wallqvist, Anders Reifman, Jaques USA Med Res & Mat Command Biotechnol HPC Software Applicat Inst Telemed & Adv Technol Res Ctr Ft Detrick MD 21702 USA

Background: Small-molecule docking is an important tool in studying receptor-ligand interactions and in identifying potential drug candidates. Previously, we developed a software tool (DOVIS) to perform large-scale virtual screening of small molecules in parallel on Linux clusters, using AutoDock 3.05 as the docking engine. DOVIS enables the seamless screening of millions of compounds on high-performance computing platforms. In this paper, we report significant advances in the software implementation of DOVIS 2.0, including enhanced screening capability, improved file system efficiency, and extended usability. Implementation: To keep DOVIS up-to-date, we upgraded the software's docking engine to the more accurate AutoDock 4.0 code. We developed a new parallelization scheme to improve runtime efficiency and modified the AutoDock code to reduce excessive file operations during large-scale virtual screening jobs. We also implemented an algorithm to output docked ligands in an industry standard format, sd-file format, which can be easily interfaced with other modeling programs. Finally, we constructed a wrapper-script interface to enable automatic rescoring of docked ligands by arbitrarily selected third-party scoring programs. Conclusion: The significance of the new DOVIS 2.0 software compared with the previous version lies in its improved performance and usability. The new version makes the computation highly efficient by automating load balancing, significantly reducing excessive file operations by more than 95%, providing outputs that conform to industry standard sd-file format, and providing a general wrapper-script interface for rescoring of docked ligands. The new DOVIS 2.0 package is freely available to the public under the GNU General Public License.

关键词： Dock Root Mean Square Deviation Virtual Screening parallelization scheme Energy Grid

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：