检索结果-内蒙古大学图书馆

arXiv 2019年

作者： Somnath, Suhas Smith, Chris R. Laanait, Nouamane Vasudevan, Rama K. Ievlev, Anton Belianinov, Alex Lupini, Andrew R. Shankar, Mallikarjun Kalinin, Sergei V. Jesse, Stephen Advanced Data and Workflows Group National Center for Computational Sciences Center for Nanophase Materials Sciences Computational Chemical and Materials Sciences Computational Science and Engineering Division Oak Ridge National Laboratory Oak RidgeTN37831 United States

Materials science is undergoing profound changes due to advances in characterization instrumentation that have resulted in an explosion of data in terms of volume, velocity, variety and complexity. Harnessing these data for scientific research requires an evolution of the associated computing and data infrastructure, bridging scientific instrumentation with super- and cloud-computing. Here, we describe Universal Spectroscopy and Imaging data (USID), a data model capable of representing data from most common instruments, modalities, dimensionalities, and sizes. We pair this schema with the hierarchical data file format (HDF5) to maximize compatibility, exchangeability, traceability, and reproducibility. We discuss a family of community-driven, open-source, and free python software packages for storing, processing and visualizing data. The first is pyUSID which provides the tools to read and write USID HDF5 files in addition to a scalable framework for parallelizing data analysis. The second is Pycroscopy, which provides algorithms for scientific analysis of nanoscale imaging and spectroscopy modalities and is built on top of pyUSID and USID. The instrument-agnostic nature of USID facilitates the development of analysis code independent of instrumentation and task in Pycroscopy which in turn can bring scientific communities together and break down barriers in the age of open-science. The interested reader is encouraged to be a part of this ongoing community-driven effort to collectively accelerate materials research and discovery through the realms of big data. Copyright © 2019, The Authors. All rights reserved.

关键词： Open systems

来源：评论

学校读者我要写书评

暂无评论

Big data analytics on HPC architectures: Performance and cost

Big data analytics on HPC architectures: Performance and cos...

引用

IEEE International Conference on Big data

作者： Peter Xenopoulos Jamison Daniel Michael Matheson Sreenivas Sukumar Pomona College Claremont CA Advanced Data and Workflows Group Oak Ridge National Laboratory Oak Ridge TN

ISBN: (纸本)9781467390064

data driven science, accompanied by the explosion of petabytes of data, has called into need dedicated analytics computing resources. Dedicated analytics clusters require large capital outlays due to their expensive hardware requirements. Additionally, if such resources are located far from the data they analyze, they also incur substantial data transfer, which has both cost and latency implications. In this paper, we benchmark a variety of high-performance computing (HPC) architectures for classic data science algorithms, as well as conduct a cost analysis of these architectures. Additionally, we compare algorithms across analytic frameworks, as well as explore hidden costs in the form of queuing mechanisms. We observe that node architectures with large memory and high memory bandwidth are better suited for big data analytics on HPC hardware. We also conclude that cloud computing is more cost effective for small or experimental data workloads, but HPC is more cost effective at scale. Additionally, we quantify the hidden costs of queuing and how it relates to data science workloads. Finally, we observe that software developed for the cloud, such as Spark, performs significantly worse than pbdR when run in HPC environments.

关键词： Computer architecture Cloud computing Hardware Benchmark testing Sparks Matrix decomposition data science

来源：评论

学校读者我要写书评

暂无评论

data Mining for better material synthesis: the case of pulsed laser deposition of complex oxides

arXiv

引用

arXiv 2017年

作者： Young, Steven R. Maksov, Artem Ziatdinov, Maxim Cao, Ye Burch, Matthew Balachandran, Janakiraman Li, Linglong Somnath, Suhas Patton, Robert M. Kalinin, Sergei V. Vasudevan, Rama K. Computational Sciences and Engineering Division Institute for Functional Imaging of Materials Center for Nanophase Materials Sciences Oak Ridge National Laboratory Oak Ridge TN37831 United States Bredesen Center for Interdisciplinary Research University of Tennessee KnoxvilleTN37996 United States Multi-disciplinary Materials Research Center Frontier Institute of Science and Technology Xi’an Jiaotong University Xi’anShaanxi710049 China Advanced Data and Workflows Group National Center for Computational Sciences Oak Ridge National Laboratory Oak RidgeTN37831 United States

The pursuit of more advanced electronics, and finding solutions to energy needs often hinges upon the discovery and optimization of new functional materials. However, the discovery rate of these materials is alarmingly low. Much of the information that could drive this rate higher is scattered across tens of thousands of papers in the extant literature published over several decades but is not in an indexed form, and cannot be used in entirety without substantial effort. Many of these limitations can be circumvented if the experimentalist has access to systematized collections of prior experimental procedures and results. Here, we investigate the property-processing relationship during growth of oxide films by pulsed laser deposition. To do so, we develop an enabling software tool to (1) mine the literature of relevant papers for synthesis parameters and functional properties of previously studied materials, (2) enhance the accuracy of this mining through crowd sourcing approaches, (3) create a searchable repository that will be a community-wide resource enabling material scientists to leverage this information, and (4) provide through the Jupyter notebook platform, simple machine-learning-based analysis to learn the complex interactions between growth parameters and functional properties (all data/codes available on https://***/ORNL-dataMatls). The results allow visualization of growth windows, trends and outliers, and which can serve as a template for analyzing the distribution of growth conditions, provide starting points for related compounds and act as feedback for first-principles calculations. Such tools will comprise an integral part of the materials design schema in the coming decade. Copyright © 2017, The Authors. All rights reserved.

关键词： Pulsed laser deposition

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：