检索结果-内蒙古大学图书馆

Proceedings of PDSW 2021: IEEE/ACM 6th international parallel Data Systems Workshop, Held in conjunction with SC 2021: the international conference for High Performance computing, Networking, Storage and Analysis

Proceedings of PDSW 2021: IEEE/ACM 6th International Paralle...

引用

6th IEEE/ACM international parallel Data Systems Workshop, PDSW 2021

ISBN: (纸本)9781665418379

the proceedings contain 7 papers. the topics discussed include: new challenges of benchmarking all-flash storage for HPC;understanding the I/O impact on the performance of high-throughput molecular docking;I/O bottleneck detection and tuning: connecting the dots using interactive log analysis;data-aware storage tiering for deep learning;SCTuner: an autotuner addressing dynamic I/O needs on supercomputer I/O subsystems;user-centric system fault identification using IO500 benchmark;and verifying IO synchronization from MPI traces.

关键词：

来源：评论

学校读者我要写书评

暂无评论

PDGC 2020 - 2020 6th international conference on parallel, Distributed and Grid computing

PDGC 2020 - 2020 6th International Conference on Parallel, D...

引用

6th international conference on parallel, Distributed and Grid computing, PDGC 2020

ISBN: (纸本)9781728171326

the proceedings contain 111 papers. the topics discussed include: a review of machine learning based recommendation approaches for cricket;operating of a drone using human intent recognition and characteristics of an EEG Signal;role of Indian IT laws in smart healthcare devices in the intensive care unit in India;comparative analysis of different symmetric encryption techniques based on computation time;a study on analyzing the impact of feature selection on predictive machine learning algorithms;TCB minimization towards secured and lightweight IoT end device architecture using virtualization at fog node;a novel perspective to threat modelling using design thinking and agile principles;android malware detection using chi feature selection and ensemble learning method;and prediction and monitoring of air pollution using Internet of things.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Preface: the 6th international conference on computing and applied Informatics

引用

AIP conference Proceedings 2024年第1期2987卷

作者： Tarigan, Jos Timanta

Preface: the 6th international conference on computing and applied Informatics, AIP conference Proceedings, Volume 2987, Issue 1, 19 April 2024, 010001, ht

关键词：

来源：评论

学校读者我要写书评

暂无评论

Configuring the parallel Committees Database Architecture through Bayesian Optimization 6

Configuring the Parallel Committees Database Architecture th...

引用

6th international conference on Blockchain computing and Applications, BCCA 2024

作者： Solat, Siamak Mabrouk, Ahmed Paris France CSAI Lab CRIGEN ENGIE Group Stains France

ISBN: (纸本)9798350351538

Distributed databases are often used when scalability, fault tolerance, and high availability are crucial. they excel in scenarios where traditional, centralized databases may struggle to handle the increasing volume of data or traffic. Common use cases include large-scale web applications, cloud computing environments, and systems requiring geographic distribution for improved performance and resilience. the "parallel Committees" architecture was recently proposed, designed, and published in [1] as an innovative and novel approach for distributed databases. the configuration of the parallel Committees architecture must be precisely tailored based on a set of input parameters. In this paper, we propose an approach to configure the parallel Committees database architecture through Bayesian optimization, aiming to identify the optimal parameter configuration for each specific use case. © 2024 IEEE.

关键词： Cloud computing architecture

来源：评论

学校读者我要写书评

暂无评论

Research on Cloud Infrastructure for Large-Scale parallel computing in Genetic Disease 6

Research on Cloud Infrastructure for Large-Scale Parallel Co...

引用

6th international conference on Frontier Technologies of Information and Computer, ICFTIC 2024

作者： Wu, Wei Amazon Web Services Amazon Seattle98121 United States

ISBN: (数字)9798331541750

ISBN: (纸本)9798331541750

the exponential growth in the amount of data generated by genomic studies of genetic diseases reflects the rapid development of this field. the limitations of traditional on-premises computing resources, in terms of compute speed, storage scalability and parallel processing power, make it challenging to cope with the exponential growth in data. In this paper, we put forward a cloud-based multi-layer parallel computing framework designed to accelerate and optimise high-throughput analysis of genetic data. Firstly, the elastic expansion characteristics of the cloud platform are employed to distribute the genetic data stored in the object storage system, thereby enabling the dynamic reading and processing of the data. Subsequently, the integration of containerisation and virtualisation technologies enables the implementation of massively parallel computing on a multi-node cloud cluster, allowing each node to independently process disparate gene sub-data blocks. the experimental results demonstrate that the proposed framework exhibits superior performance in terms of data processing speed and computational efficiency compared to traditional local computing methods. © 2024 IEEE.

关键词： Cloud platforms

来源：评论

学校读者我要写书评

暂无评论

parallel Neighborhood Expansion Based Graph Edge Partitioning Algorithm 6

Parallel Neighborhood Expansion Based Graph Edge Partitionin...

引用

6th IEEE international conference on Power, Intelligent computing and Systems, ICPICS 2024

作者： Huang, Jijie Chen, Yang Foshan University Department of Electronics and Information Engineering Foshan China

ISBN: (纸本)9798350374315

this paper proposes a novel parallel neighborhood expansion-based algorithm for graph edge partitioning aimed at addressing computational efficiency and scalability issues in large-scale graph data processing. the algorithm employs innovative parallel strategies and load balancing techniques, effectively enhancing the speed and quality of graph edge partitioning. Experimental results demonstrate outstanding performance of the algorithm when handling graphs ranging one hundred thousand to millions of edges. Compared to existing methods, it significantly improves execution speed while maintaining comparable partitioning quality. In highly parallel environments, the algorithm exhibits excellent scalability and load balancing performance. Moreover, it demonstrates impressive memory efficiency, enabling processing of even larger-scale graph data. © 2024 IEEE.

关键词： Graph algorithms

来源：评论

学校读者我要写书评

暂无评论

A 0.2-pJ/SOP Digital Spiking Neuromorphic Processor with Temporal parallel Dataflow and Efficient Synapse Memory Compression 6

A 0.2-pJ/SOP Digital Spiking Neuromorphic Processor with Tem...

引用

6th international conference on AI Circuits and Systems (AICAS)

作者： Lin, Yu-Hsuan Tang, Kea-Tiong Natl Tsing Hua Univ Hsinchu Taiwan

ISBN: (纸本)9798350383638;9798350383645

the inherent high sparsity of the spiking neural networks (SNNs) and the low power consumption brought by the event-driven computing characteristics suit edge devices with extremely high energy efficiency requirements. On resource-constrained mobile devices, we also require memory saving. Unlike conventional artificial neural networks, SNNs are suitable for processing complex temporal data. However, computing in the time dimension requires repeated access to the data for multiple time steps, resulting in high energy consumption. We propose Temporally parallel Weight-Friendly (TPWF) dataflow, which reduces energy consumption through parallel computing across time steps. At the same time, considering the high sparsity of the spike event, this paper proposes a sparse aware strategy, which can realize high-energy-efficiency membrane potential accumulation calculation by the neuron burst weight search circuit. Furthermore, this paper proposes an efficient synaptic memory structure to reduce hardware resource usage while maintaining performance and network size. Use run- length encoding to record weights, realize synaptic connections that can support different configurations, such as sparse connection networks, and save a lot of memory. Using a fully connected 256-128-128-10 network to classify 16x16 MNIST training images achieves energy per synaptic operation (SOP) of 0.2pJ, up to 1.9x speedup, and 2x reduction in memory accesses.

关键词： Spiking neural networks (SNNs) accelerator dataflow weight sparsity energy efficiency

来源：评论

学校读者我要写书评

暂无评论

PS-Based Heterogeneous computing Framework 6

PS-Based Heterogeneous Computing Framework

引用

6th international conference on Next Generation Data-Driven Networks, NGDN 2024

作者： Guo, Xi He, Ming Qin, Shouhao Wang, Dexuan Li, Chenjun China United Network Communications Corporation Research Institute Beijing China GuoChuang Software Co. Ltd. Hefei China

ISBN: (纸本)9798350388374

In the current heterogeneous computing environment, different types of accelerated computing resources such as GPU(Graphics Processing Unit) and NPU(Neural Processing Unit) coexist in the cluster. However, because these resources use different communication protocols and architectures, heterogeneous computing resources cannot effectively communicate and collaborate. To solve this problem, we propose a heterogeneous distributed computing framework based on the Parameter Server(PS) architecture and achieve heterogeneous parallel computing capabilities across hardware devices. this allows users to uniformly use all available heterogeneous computing resources in the cluster to perform large-scale data parallel deep learning training tasks. We have conducted experiments and evaluations on this framework in multiple scenarios, and the results show that the NPU and GPU computing power in a heterogeneous computing cluster can be used simultaneously to complete a same computing task without affecting accuracy, while ensuring high training efficiency. Overall, this framework provides an effective solution for high-performance distributed computing in heterogeneous computing environments. © 2024 IEEE.

关键词： computing power

来源：评论

学校读者我要写书评

暂无评论

A Cost-Effective Baugh-Wooley Approximate Multiplier for FPGA-based Machine Learning computing 6

A Cost-Effective Baugh-Wooley Approximate Multiplier for FPG...

引用

6th international conference on AI Circuits and Systems (AICAS)

作者： Vakili, Shervin Inst Natl Rech Sci Energie Mat Telecommun Res Ctr Montreal PQ Canada

ISBN: (纸本)9798350383638;9798350383645

Deep learning hardware accelerators commonly incorporate a substantial quantity of multiplier units. Yet, the considerable complexity of multiplier circuits renders them a bottleneck, contributing to increased costs and latency. Approximate computing proves to be an effective strategy for mitigating the overhead associated with multipliers. this paper introduces an original approximation technique for signed multiplication on FPGAs. the approach involves a novel segmentation method applied to the Baugh-Wooley multiplication algorithm. Each segment is optimally accommodated within look-up table resources of modern AMD-Xilinx FPGA families. the paper details the design of an INT8 multiplier using the proposed approach, presenting implementation results and accuracy assessments for the inference of benchmark deep learning models. the implementation results reveal significant savings of 53.6% in LUT utilization compared to the standard INT8 Xilinx multiplier. Accuracy measurements conducted on four popular deep learning benchmarks show an average accuracy degradation of 4.8% in post-training deployment and 0.7% after retraining. the source code for this work is available on Github(1).

关键词： approximate multiplier field-programmable gate array machine learning computing

来源：评论

学校读者我要写书评

暂无评论

Design of an Efficient parallel Random Number Generator Using a Single LFSR for Stochastic computing 6

Design of an Efficient Parallel Random Number Generator Usin...

引用

6th international conference on Artificial Intelligence in Information and Communication, ICAIIC 2024

作者： Lee, Donghui Seo, Hyoju Kim, Yongtae School of Computer Science and Engineering Kyungpook National University Daegu Korea Republic of

ISBN: (纸本)9798350344349

this paper proposes a parallel random number generator (RNG) using a single linear feedback shift register (LFSR) to generate two distinct random numbers, achieving twice the operational speed of a traditional serial RNG. the proposed RNG generates two distinct random numbers utilizing an LFSR. When implemented in a 65-nm CMOS technology, the proposed design leads to a 15.6% improvement in area and a 14.8 % improvement in power efficiency, addressing the trade-off between accuracy and energy efficiency in stochastic computing (SC). Furthermore, the proposed design not only matches but surpasses the performance of serial SC in an edge-detection digital image processing application. therefore, for enhanced hardware efficiency and improved accuracy, the proposed parallel RNG architecture can be effectively employed. © 2024 IEEE.

关键词： Image processing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：