检索结果-内蒙古大学图书馆

DOCKER UNIFIED UIMA INTERFACE: New perspectives for nlp on big data

SOFTWAREX 2025年 29卷

作者： Abrami, Giuseppe Genios, Markos Fitzermann, Filip Baumartz, Daniel Mehler, Alexander Goethe Univ Frankfurt Texttechnol Frankfurt Germany

Processing large amounts of natural language text using machine learning-based models is becoming important in many disciplines. This demand is being met by a variety of approaches, resulting in the heterogeneous deployment of separate, partly incompatible, not natively scalable applications. To overcome the technological bottleneck involved, we have developed DOCKER UNIFIED UIMA INTERFACE, a system for the standardized, parallel, platform-independent, distributed and microservices-based solution for processing large and extensive text corpora with any nlp method. We present DUUI as a framework that enables automated orchestration of GPU-based nlp processes beyond the existing Docker Swarm cluster variant, and in addition to the adaptation to new runtime environments such as Kubernetes. Therefore, anew driver for DUUI is introduced, which enables the lightweight orchestration of DUUI processes within a Kubernetes environment in a scalable setup. In this way, the paper opens up novel text-technological perspectives for existing practices indisciplines that deal with the scientific analysis of large amounts of data based on nlp.

关键词： Docker Kubernetes UIMA distributed nlp

来源：评论

学校读者我要写书评

暂无评论

In-storage Processing of I/O Intensive Applications on Computational Storage Drives 23

In-storage Processing of I/O Intensive Applications on Compu...

引用

23rd International Symposium on Quality Electronic Design (ISQED)

作者： HeydariGorji, Ali Torabzadehkashi, Mahdi Rezaei, Siavash Bobarshad, Hossein Alves, Vladimir Chou, Pai H. Univ Calif Irvine Irvine CA 92697 USA NGD Syst Inc Irvine CA USA

ISBN: (纸本)9781665494663

Computational storage drives (CSD) are solid-state drives (SSD) empowered by general-purpose processors that can perform in-storage processing. They have the potential to improve both performance and energy significantly for big-data analytics by bringing compute to data, thereby eliminating costly data transfer while offering better privacy. In this work, we introduce Solana, the first-ever high-capacity(12-TB) CSD in El.S form factor, and present an actual prototype for evaluation. To demonstrate the benefits of in-storage processing on CSD, we deploy several natural language processing (nlp) applications on datacenter-grade storage servers comprised of clusters of the Solana. Experimental results show up to 3.1x speedup in processing while reducing the energy consumption and data transfer by 67% and 68%, respectively, compared to regular enterprise SSDs.

关键词： Computational Storage Drives In-Storage Processing Near-data processing Natural Language Processing distributed nlp

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：