检索结果-内蒙古大学图书馆

Computer, Big Data and Artificial Intelligence (ICCBD+AI), international conference on

作者： Junjie Li Haodong Zou Wangdong Wu Lei Wang Lichen Wang Wenpan Liu Mei Yan State Grid Electric Power Research Institute Co. Ltd Nanjing China Information & Telecommunication Branch State Grid Jiangsu Electric Power Co. Ltd. Nanjing China State Grid Information & Telecommunication Branch

ISBN: (数字)9798331533991

ISBN: (纸本)9798331534004

Cloud storage is a vital component of cloud architecture, often utilizing distributed key-value stores like Amazon S3 and Google Cloud Storage for managing da-ta and metadata. these systems distribute data across nodes using key range or consistent hashing, but they face challenges such as load imbalance and limited parallelism due to uneven data distribution and varying node performance. Cur-rent implementations, such as MongoDB, address these imbalances by migrating data between nodes but often neglect the characteristics of the underlying data structures, leading to increased overhead from costly delete and insert operations. To address these issues, this design leverages the properties of the LSM tree, a commonly used storage engine, to optimize data migration. the approach intro-duces hot zone prediction using nonlinear regression to accurately identify data hotspots based on key characteristics, insertion time, and TTL. A storage engine-aware migration system is developed to migrate grouped SSTable files rather than individual key-value pairs, significantly reducing migration overhead. Additionally, the data migration I/O process is offloaded using the NVMe-oF protocol, minimizing CPU involvement and preserving node performance. Implemented on mongo-rocks, this solution improves load balancing by directly moving SSTable files across nodes, enhancing efficiency and reducing performance degradation in distributed key-value stores.

关键词： Degradation Cloud computing Protocols Distributed databases parallel processing Metadata Load management Optimization Faces Engines

来源：评论

学校读者我要写书评

暂无评论

parallel algorithm for the unsupervised binning of metagenomic sequences 21

Parallel algorithm for the unsupervised binning of metagenom...

引用

5th international conference on Machine Learning and Soft Computing, ICMLSC 2021

作者： Hoang, Vu Vinh, Le Van Hoai, Tran Van Lang, Tran Van Bao, Huynh Quang Faculty of Information Technology HCMC University of Technology and Education Viet Nam Faculty of Computer Science and Engineering Ho Chi Minh City Viet Nam University of Technology VNU-HCM Viet Nam Institute of Applied Mechanics and Informatics VAST Viet Nam

ISBN: (纸本)9781450387613

the binning of metagenomic sequences is one of crucial steps in metagenomic projects which allow the study of uncultured organisms. Although the projects need to analyze a huge amount of data, most available binning methods run in single mode, and thus require much processing time. this paper proposes a parallel binning algorithm for metagenomic sequences without reference databases. the method is able to utilize the strength of computing clusters and shared-memory multiprocessing systems by using MPI and OpenMP techniques. Experimental results show that the proposed algorithm outperforms a single-mode binning algorithm in the aspect of computational performance while still achieving similar classification quality. the source codes and datasets used in this work can be downloaded from https://***/BiMetaPL. © 2021 ACM.

关键词： Multiprocessing systems

来源：评论

学校读者我要写书评

暂无评论

Optimizing B⁺-Tree Searches on Coupled CPU-GPU architectures 20th

Optimizing B<SUP>+</SUP>-Tree Searches on Coupled CPU-GPU Ar...

引用

20th international conference on algorithms and architectures for parallel processing (ICA3PP)

作者： Huang, Han Luan, Hua Beijing Normal Univ Beijing Peoples R China

ISBN: (纸本)9783030602451;9783030602444

the B+-tree is an important index in the fields of data warehousing and database management systems. With the development of new hardware technologies, the B+-tree needs to be revisited to fully take advantage of hardware resources. In this paper, we focus on optimization techniques to increase the searching performance of B+-trees on the coupled CPU-GPU architecture. First, we propose a hierarchical searching approach on the single coupled GPU to efficiently deal with leaf nodes of B+-trees. It adopts a flexible strategy to determine the number of work items in a work group to search one key in order to reduce irregular memory accesses and divergent branches in the work group. Second, we present a co-processing pipeline method on the coupled architecture. the CPU and the integrated GPU process the sorting and searching tasks simultaneously to hide sorting and partial searching latencies. A distribution model is designed to support the workload balance strategy based on real-time performance. Our performance study shows that the hierarchical searching scheme provides an improvement up to 36% on the GPU compared to the baseline algorithm with fixed number of work items and the co-processing pipeline method further increases the throughput by a factor of 1.8. To the best of our knowledge, this paper is the first study to consider both the CPU and the coupled GPU to optimize B+-trees searches.

关键词： B+-trees the coupled architecture Integrated GPU Co-processing

来源：评论

学校读者我要写书评

暂无评论

Generic complexity of solving of equations in finite groups, semigroups and fields 5

Generic complexity of solving of equations in finite groups,...

引用

5th international Scientific and Technical conference on Mechanical Science and Technology Update, MSTU 2021

作者： Rybalov, Alexander Shevlyakov, Artem Sobolev Institute of Mathematics prospekt Koptyuga 4 Novosibirsk630090 Russia Omsk State Technical University prospekt Mira 11 Omsk644050 Russia

the paper is devoted by investigation of generic complexity of the algorithmic problem of solving of systems of equations in finite groups, finite semigroups and finite fields. We show that if this problem is intractable in the worst case and P = BPP, then there is no polynomial strongly generic algorithm, which solves it. the first author is supported by Russian Science Foundation, grant 19-11-00209. the second author is supported by Russian Science Foundation, grant 18-71-10028. © Published under licence by IOP Publishing Ltd.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

Application of a new parallel column model to simulating maldistribution in packed columns

引用

CHEMICAL ENGINEERING AND processing-PROCESS INTENSIFICATION 2022年 171卷 108436-108436页

作者： Kooijman, Hendrik A. Zhou, Jingsong Taylor, Ross Clarkson Univ Potsdam NY 13699 USA Sulzer GTC Technol US Inc USA 079 900 Threadneedle St STE 800 Houston TX 77079 USA

parallel column models were first proposed in the 1950s for investigating the sensitivity of the HETP in a packed column to maldistribution. Although it is possible to develop a parallel column model using a process simulator, it may take considerable effort to arrange and specify the columns. Recently, we have described what we call a parallel Column Model (PCM) with the aim of making it easier to model Dividing Wall Columns (DWCs), we realized that it should also be possible to use the PCM to model maldistribution in packed columns. Both equilibrium stage and rate-based column models can easily be used within this framework. In this paper we review the literature on simulating packed columns with maldistribution. We also show how easily our PCM may be used to describe maldis-tribution in packed columns and show how our results match the obtained results of earlier papers. We propose a simple bed effectiveness approximation that can use the results from a regular column simulation and assign stages to specific packed beds such that the resulting column suffers less from liquid maldistribution. We illustrate the use of this approximation with two practical examples involving the design of commercial scale columns.

关键词： Packed beds

来源：评论

学校读者我要写书评

暂无评论

An experimental approach to compare various deep learning architectures for sentiment analysis 5

An experimental approach to compare various deep learning ar...

引用

5th IEEE international conference on Computing Communication and Automation, ICCCA 2020

作者： Ghorpade, Parag Lende, Akash Chavan, Laukik Shaikh, Rehan Shaikh, Nuzhat Chavan, Ajinkya Modern Education Society's College of Engineering B.E. Dept. of Computer Engineering Pune India Technical Analyst II Mastercard Pune India

ISBN: (纸本)9781728163246

this paper aims to study the efficiency of various seq2seq deep learning architectures for the solution of toxic speech classification and performing efficient sentiment analysis using unilingual publicly available dataset. Numerical examples are presented along with various validation metrics and graphs to indicate the efficiency of the various NLP techniques and confirm the experimental findings of the paper. We also compare and contrast between traditionally used natural language processing models and state of the art model like Bidirectional Encoder Representations from Transformers or BERT. © 2020 IEEE.

关键词： Recurrent neural networks

来源：评论

学校读者我要写书评

暂无评论

GPU-Accelerated Computations for Supersonic Flow Modeling on Hybrid Grids 5

GPU-Accelerated Computations for Supersonic Flow Modeling on...

引用

5th international conference on Mechanical, Control and Computer Engineering (ICMCCE)

作者： Tian, Zhengyu Lai, Jianqi Yang, Fan Li, Hua Natl Univ Def Technol Coll Aerosp Sci & Engn Changsha Peoples R China

ISBN: (纸本)9781665423144

With its strong floating-point operation capability and high memory bandwidth in data parallelism, the graphics processing unit (GPU) has been widely used in general-purpose computing. GPU-based computations have been extensively applied in the field of computational fluid dynamics (CFD). this paper aims to design an extremely efficient double-precision GPU-accelerated parallel algorithm for supersonic flow computations on hybrid grids. Compute unified device architecture (CUDA) is used as a general-purpose parallel computing platform and programming model to perform parallel computing codes on GPUs. the cell-centered finite volume method based on unstructured grids is used in the spatial discretization of governing equations, whereas the three-stage explicit Runge-Kutta scheme with second-order accuracy is used for temporal discretization. the turbulence is solved by using the K-omega SST two-equation model. three test cases are studied to validate the computational accuracy of the proposed algorithm. the numerical results agree well with the experiment data, thereby suggesting that the GPU-accelerated parallel algorithm has good accuracy.

关键词： graphics processing unit compute unified device architecture supersonic flow hybrid grids parallel algorithm

来源：评论

学校读者我要写书评

暂无评论

A Distributed Framework for Online Stream Data Clustering 20th

A Distributed Framework for Online Stream Data Clustering

引用

20th international conference on algorithms and architectures for parallel processing (ICA3PP)

作者： Ding, Jiafeng Fang, Junhua Chao, Pingfu Xu, Jiajie Zhao, PengPeng Zhao, Lei Soochow Univ Dept Comp Sci & Technol Suzhou Peoples R China Univ Queensland Brisbane Qld Australia

ISBN: (纸本)9783030602451;9783030602444

the recent prevalence of positioning sensors and mobile devices generates a massive amount of spatial-temporal data from moving objects in real-time. As one of the fundamental processes in data analysis, the clustering on spatial-temporal data creates various applications, like event detection and travel pattern extraction. However, most of the existing works only focus on the offline scenario, which is not applicable to online time-sensitive applications due to their low efficiency and ignorance of temporal features. In this paper, we propose a distributed streaming framework for spatial-temporal data clustering, which accepts various clustering algorithms while ensuring low resource consumption and result correctness. the framework includes a dynamic partitioning strategy for continuous load-balancing and a cluster-merging algorithm based on convex hulls [10], which guarantees the result correctness. Extensive experiments on real dataset prove the effectiveness of our proposed framework and its advantage over existing solutions.

关键词： Real-time cluster analysis Distributed stream processing Spatial-temporal data mining Top-k query parallel computing

来源：评论

学校读者我要写书评

暂无评论

Pimiento: A Vertex-Centric Graph-processing Framework on a Single Machine 19th

Pimiento: A Vertex-Centric Graph-Processing Framework on a S...

引用

19th international conference on algorithms and architectures for parallel processing (ICA3PP)

作者： Huang, Jianqiang Qin, Wei Wang, Xiaoying Chen, Wenguang Tsinghua Univ Dept Comp Sci & Technol Beijing 100084 Peoples R China Qinghai Univ Dept Comp Technol & Applicat Xining 810016 Peoples R China

ISBN: (纸本)9783030389611;9783030389604

Here, we describe a method for handling large graphs with data sizes exceeding memory capacity using minimal hardware resources. this method (called Pimiento) is a vertex-centric graph-processing framework on a single machine and represents a semi-external graph-computing system, where all vertices are stored in memory, and all edges are stored externally in compressed sparse row data-storage format. Pimiento uses a multi-core CPU, memory, and multi-threaded data preprocessing to optimize disk I/O in order to reduce random-access overhead in the graph-algorithm implementation process. An on-the-fly update-accumulated mechanism was designed to reduce the time that the graph algorithm accesses disks during execution. Our experiments compared external this method with other graph-processing systems, including GraphChi, X-Stream, and FlashGraph, revealing that Pimiento achieved 7.5x, 4x, 1.6x better performance on large real-world graphs and synthetic graphs in the same experimental environment.

关键词： Vertex-centric Graph processing Semi-external Passing message Asynchronous update accumulation

来源：评论

学校读者我要写书评

暂无评论

SCALPsim, a tool for modeling asynchronous Self-Organizing 3-D NoC architectures 27

SCALPsim, a tool for modeling asynchronous Self-Organizing 3...

引用

27th IEEE international conference on Electronics, Circuits and Systems (IEEE ICECS)

作者： Barrientos, Diego Sousa, Claudio Upegui, Andres Girau, Bernard Univ Appl Sci Western Switzerland Geneva Switzerland Univ Lorraine LORIA UMR 7503 CNRS Vandoeuvre Les Nancy France

ISBN: (纸本)9781728160443

Manycore architectures are mainly composed of a very large amount of computing nodes interconnected with a multiplicity of links usually forming a NoC-like mesh architecture. High-speed links permit to obtain a higher throughput but are much more expensive than normal links, making the interconnection of the system a cost/performance trade-off. Simulating such architectures is very important in order to characterise the optimal network topology for a given problem. In this work we introduce SCALPsim: a simulation framework permitting to evaluate routing algorithms and network properties in 1-D, 2-D and 3-D regular mesh topologies simultaneously using links of different characteristics in terms of latency and throughput. these features are particularly interesting in large scale systems with processing elements grouped into clusters, where communication properties differ largely inside and between clusters. this paper presents the framework and an application based on Cellular Self-Organizing Maps - CSOM.

关键词： Cellular Self-Organising Maps parallel computation multi-FPGA

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：