检索结果-内蒙古大学图书馆

19th ieee international symposium on parallel and distributed processing with applications, 11th ieee international Conference on Big Data and Cloud Computing, 14th ieee international Conference on Social Computing and Networking and 11th ieee international Conference on Sustainable Computing and Communications, ISPA/BDCloud/SocialCom/SustainCom 2021

19th IEEE International Symposium on Parallel and Distribute...

引用

19th ieee international symposium on parallel and distributed processing with applications, 11th ieee international Conference on Big Data and Cloud Computing, 14th ieee international Conference on Social Computing and Networking and 11th ieee international Conference on Sustainable Computing and Communications, ISPA/BDCloud/SocialCom/SustainCom 2021

ISBN: (纸本)9781665435741

The proceedings contain 222 papers. The topics discussed include: DRL-deploy: adaptive service function chains deployment with deep reinforcement learning;accuracy vs. efficiency: achieving both through hardware-aware quantization and reconfigurable architecture with mixed precision;cmss: collaborative modeling of safety and security requirements for network protocols;FGPA: fine-grained pipelined acceleration for depthwise separable CNN in resource constraint scenarios;Dyacon: JointCloud dynamic access control model of data security based on verifiable credentials;understanding the runtime overheads of deep learning inference on edge devices;and alleviating imbalance in synchronous distributed training of deep neural networks.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Proceedings - 20th ieee international symposium on parallel and distributed processing with applications, 12th ieee international Conference on Big Data and Cloud Computing, 12th ieee international Conference on Sustainable Computing and Communications and 15th ieee international Conference on Social Computing and Networking, ISPA/BDCloud/SocialCom/SustainCom 2022

Proceedings - 20th IEEE International Symposium on Parallel ...

引用

20th ieee international symposium on parallel and distributed processing with applications, 12th ieee international Conference on Big Data and Cloud Computing, 12th ieee international Conference on Sustainable Computing and Communications and 15th ieee international Conference on Social Computing and Networking, ISPA/BDCloud/SocialCom/SustainCom 2022

ISBN: (纸本)9781665464970

The proceedings contain 117 papers. The topics discussed include: detection of a novel dual attack in named data networking;fair DMA scheduler for low-latency accelerator offloading;multi-attribute decision-making method based on interval intuitionistic trapezoidal fuzzy number to determine the expert weight: note: sub-titles are not captured in Xplore and should not be used;binary-level directed symbolic execution through pattern learning;an efficient metric-based approach for static use-after-free detection;a graph convolution neural network based method for insider threat detection;maintenance worker scheduling for charging pile fault: a multi-agent RL approach;towards secure bilateral friend query with conjunctive policy matching in social networks;structure-noise-aware anchor link prediction across social networks;file system to support secure cloud-based sharing;discovering agent models using process mining: initial approach and a case study;and towards agent-based simulation of the parallel trading market of pharmaceuticals.

关键词：

来源：评论

学校读者我要写书评

暂无评论

The 2010 ieee international symposium on parallel and distributed processing with applications (ISPA10) FOREWORD

引用

JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS 2013年第1期36卷 1-1页

作者： Horng, Shi-Jinn Wu, Chung-Hsien Natl Taiwan Univ Sci & Technol Dept Comp Sci & Informat Engn Taipei Taiwan Natl Cheng Kung Univ Dept Comp Sci & Informat Engn Tainan 70101 Taiwan

Click to increase image sizeClick to decrease image size

关键词：

来源：评论

学校读者我要写书评

暂无评论

HSMU-SpGEMM: Achieving High Shared Memory Utilization for parallel Sparse General Matrix-Matrix Multiplication on Modern GPUs 31

HSMU-SpGEMM: Achieving High Shared Memory Utilization for Pa...

引用

31st ieee international symposium on High Performance Computer Architecture, HPCA 2025

作者： Wu, Min Luo, Huizhang Li, Fenfang Zhang, Yiran Tang, Zhuo Li, Kenli Zhang, Jeff Liu, Chubo Hunan University Changsha China Arizona State University Tempe United States

ISBN: (纸本)9798331506476

Sparse general matrix-matrix multiplication (SpGEMM) is a core primitive for numerous scientific applications. Traditional hash-based approaches fail to strike a balance between reducing hash collisions and efficiently utilizing fast shared memory, which significantly undermines the performance of executing SpGEMM on GPUs. To address this issue, this paper introduces a novel accumulator design that achieves high shared memory utilization on modern GPUs. For the proposed high shared memory utilization algorithm, i.e., HSMU-SpGEMM1, we further optimize different symbolic stages. Our evaluations with four state-of-the-art hash-based SpGEMM libraries (Nsparse, spECK, OpSparse, and NVIDIA's cuSPARSE) on three NVIDIA GPUs (Ampere, Ada Lovelace, Turing) demonstrate significant performance benefits from HSMU-SpGEMM.1HSMU-SpGEMM is available at https://***/wuminqaq/HSMUSpGEMM © 2025 ieee.

关键词： Graphics processing unit

来源：评论

学校读者我要写书评

暂无评论

Developing Scalable and Fault Tolerant distributed Architecture for Big Unstructured Data Systems 6

Developing Scalable and Fault Tolerant Distributed Architect...

引用

6th international Conference on Mobile Computing and Sustainable Informatics, ICMCSI 2025

作者： Saranya, G. Kumaran, K. Pratheep, V. Dhanush, S.R. School of Computer Science and Engineering Vellore Institue of Technology Chennai India

ISBN: (数字)9798331522667

ISBN: (纸本)9798331522667

In the era of big data, efficiently processing and retrieving insights from unstructured data presents a critical challenge. This paper introduces a scalable leader-worker distributed data pipeline designed to handle large sets of unstructured data blobs. Leveraging data parallelism, the system distributes workload across worker nodes, ensuring high performance for embarrassingly parallel tasks with minimal inter-process communication. Fault tolerance is achieved through an optimized next-in-line leader re-election strategy, addressing cascading failures and mitigating the thundering herd effect. The use of Apache Zookeeper for service discovery ensures seamless coordination within the cluster while maintaining a loosely coupled architecture for enhanced scalability. Google Protocol Buffers are employed for secure data transmission, offering tamper resistance against eavesdropping and man-in-the-middle attacks. The system also integrates the TF-IDF algorithm for efficient information retrieval, demonstrating the ability to achieve data parallelism while preserving task uniformity across nodes. Performance evaluation highlights significant reductions in response times with increasing cluster size, validating the system’s scalability, efficiency, and robustness. The proposed architecture is well-suited for real-world applications requiring secure, fault-tolerant, and scalable solutions for unstructured data analysis. © 2025 ieee.

关键词： Big data

来源：评论

学校读者我要写书评

暂无评论

UniNDP: A Unified Compilation and Simulation Tool for Near DRAM processing Architectures 31

UniNDP: A Unified Compilation and Simulation Tool for Near D...

引用

31st ieee international symposium on High Performance Computer Architecture, HPCA 2025

作者： Xie, Tongxin Zhu, Zhenhua Li, Bing He, Yukai Li, Cong Sun, Guangyu Yang, Huazhong Xie, Yuan Wang, Yu Tsinghua University Dept. of EE BNRist China HKUST Hong Kong Institute of Microelectronics Chinese Academy of Sciences China Capital Normal University China Peking University China

ISBN: (纸本)9798331506476

Near DRAM processing (NDP) architectures have emerged to be a promising solution for commercializing in-memory computing and addressing the 'memory wall' problem, especially for the memory-intensive machine learning (ML) workloads. In NDP architectures, the processing Units (PUs) are distributed next to different memory units to exploit the high internal bandwidth. Therefore, in order to fully utilize the bandwidth advantage of NDP architectures for ML applications, meticulous evaluations and optimizations of data placement in DRAM and workload scheduling among different PUs are required. However, existing simulation and compilation tools face two insuperable obstacles to achieving these targets. On the one hand, tools for traditional von Neumann architectures only focus on the data access behaviors between the host and DRAM and treat DRAM as a whole part, which cannot support NDP architectures with multiple independent processing and memory units working simultaneously. On the other hand, existing NDP simulators and compilers are designed for specific DRAM technology and NDP architecture, lacking compatibility for various NDP architectures. In order to overcome these challenges and optimize data mapping and workload scheduling for different NDP architectures, we propose UniNDP, a unified NDP compilation and simulation tool for ML applications. Firstly, we propose a unified tree-based NDP hardware abstraction and the corresponding instruction set, enabling the support for various NDP architectures based on different DRAM technologies. Secondly, we design a cycle-accurate and instruction-driven NDP simulator to evaluate hardware performance by accurately tracking the working status of memory elements and PUs. The accurate simulation can provide effective guidance for compilation. Thirdly, we design an NDP compiler that optimizes data partition, mapping, and workload scheduling in different DRAM hierarchies. Furthermore, to enhance the compilation efficiency, we propo

关键词： Static random access storage

来源：评论

学校读者我要写书评

暂无评论

FHENDI: A Near-DRAM Accelerator for Compiler-Generated Fully Homomorphic Encryption applications 31

FHENDI: A Near-DRAM Accelerator for Compiler-Generated Fully...

引用

31st ieee international symposium on High Performance Computer Architecture, HPCA 2025

作者： Park, Yongmo Amarnath, Aporva Pal, Subhankar Swaminathan, Karthik Buyuktosunoglu, Alper Shaul, Hayim Aharoni, Ehud Drucker, Nir Lu, Wei D. Soceanu, Omri Bose, Pradip IBM T. J. Watson Research Center NY United States IBM Research Israel University of Michigan MI United States

ISBN: (纸本)9798331506476

Fully homomorphic encryption (FHE) is a powerful cryptographic technique that enables computation on encrypted data without needing to decrypt it. It has broad applications in scenarios where sensitive data needs to be processed in the cloud or in other untrusted environments. FHE applications are both compute- and memory-intensive, owing to expensive operations on large data. While prior works address the challenges of efficient compute using dedicated hardware, expensive memory transfers still remain a major limiting factor. In this work, we propose a hierarchical near-DRAM processing (NDP) solution for FHE applications, called FHENDI, that harnesses the massive DRAM bank bandwidth. We observe various data access patterns in FHE that reveal distinct levels of parallelism: element-wise, limb-wise, coefficient-wise, and ciphertext-wise. FHENDI exploits these levels of parallelism to map FHE operations and data onto different hierarchies of our design, while addressing three major challenges with NDP for FHE: (i) the lack of bank-to-bank communication support, (ii) limited die-to-die bandwidth, and (iii) large memory access latencies. We resolve the first problem through a novel, conflict-free mapping algorithm built atop localized permutation networks that enables efficient element-wise and butterfly operations in FHE. The second problem is addressed by pipelining the execution of parallel bootstrap operations observed in compiled FHE workloads. Finally, we hide the memory access latency behind computation latency by exploiting a dual-banking scheme and subarray-level parallelism (SLP) of the DRAM banks. We evaluate FHENDI using representative workloads in the domains of privacy-preserving machine learning inference on CNNs and Transformers, database range query, and sorting, that are obtained using a compiler framework called HElayers. We compare FHENDI with a server-class CPU and GPU running the state-of-the-art HEaaN library, and an FHE accelerator ASIC, and repo

关键词： Program compilers

来源：评论

学校读者我要写书评

暂无评论

IPDPS 2008 - Proceedings of the 22nd ieee international parallel and distributed processing symposium, Program and CD-ROM

IPDPS 2008 - Proceedings of the 22nd IEEE International Para...

引用

IPDPS 2008 - 22nd ieee international parallel and distributed processing symposium

ISBN: (纸本)9781424416943

The proceedings contain 461 papers. The topics discussed include: how to make discretionary access control secure against trojan horses;random number generation for serial, parallel, distributed, and grid-based financial computations;mobility control schemes with quick convergence in wireless sensor networks;design and implementation of a tool for modeling and programming deadlock free meta-pipeline applications;analytic performance models for bounded queuing systems;on the construction of paired many-to-many disjoint path covers in hypercube-like interconnection networks with faulty elements;a scalable configurable architecture for the massively parallel GCA model;state management for distributed python applications;a fault-tolerant system for Java/CORBA objects;and improving data availability for a cluster file system through replication.

关键词： CD-ROM

来源：评论

学校读者我要写书评

暂无评论

IPDPS 2009 - Proceedings of the 2009 ieee international parallel and distributed processing symposium

IPDPS 2009 - Proceedings of the 2009 IEEE International Para...

引用

23rd ieee international parallel and distributed processing symposium, IPDPS 2009

ISBN: (纸本)9781424437504

The proceedings contain 362 papers. The topics discussed include: uniform scattering of autonomous mobile robots in a grid;performance study of interference on sharing GPU and CPU resources with multiple applications;resource allocation strategies for constructive in-network stream processing;deciding model of population size in time-constrained task scheduling;improving accuracy of host load predictions on computational grids by artificial neural networks;combining multiple heuristics on discrete resources;predictive analysis and optimization of pipelined wavefront computations;RSA encryption and decryption using the redundant number system on the FPGA;computation with a constant number of steps in membrane computing;analytical model of inter-node communication under multi-versioned coherence mechanisms;and a distributed approach for the problem of routing and wavelength assignment in WDM networks.

关键词：

来源：评论

学校读者我要写书评

暂无评论

2021 ieee international parallel and distributed processing symposium Workshops, IPDPSW 2021 - In conjunction with ieee IPDPS 2021

2021 IEEE International Parallel and Distributed Processing ...

引用

2021 ieee international parallel and distributed processing symposium Workshops, IPDPSW 2021

ISBN: (纸本)9781665435772

The proceedings contain 117 papers. The topics discussed include: resource elasticity at task-level;evaluation of vertex reordering for graph applications;on the predictability of quantum circuit fidelity using machine learning;improving the operational capability of automated empirical performance modeling;development of a middleware to create an efficient unified programming model for heterogeneous computing;task-level checkpointing for nested fork-join programs;verifiable coded computing: towards fast and secure distributed computing;hierarchical cost analysis for distributed deep learning;pattern-aware vectorization for sparse matrix computations;and heterogeneity-aware deep learning workload deployments on the computing continuum.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：