the proceedings contain 32 papers from the 16th symposium on computer architecture and high performance computing. the topics discussed include: self-monitored adaptive cache warm up for microprocessor simulation;the ...
详细信息
the proceedings contain 32 papers from the 16th symposium on computer architecture and high performance computing. the topics discussed include: self-monitored adaptive cache warm up for microprocessor simulation;the eDRAM based L3-Chache of the BlueGene/L supercomputer processor node;multi-profile instruction based compression;a study of errant pipeline flushes caused by value misspeculation;design space exploration using T&D-bench;value predictors for reuse through speculation on traces;optimizations for compiled simulation using instruction type information;and highperformance communication system based on generic programming.
the proceedings contain 16 papers. the topics discussed include: outline of a thick control flow architecture;a dynamic load balance algorithm for the S4 parallel stream processing engine;a processor workload distribu...
ISBN:
(纸本)9781509048441
the proceedings contain 16 papers. the topics discussed include: outline of a thick control flow architecture;a dynamic load balance algorithm for the S4 parallel stream processing engine;a processor workload distribution algorithm for massively parallel applications;parallelism and scalability: a solution focused on the cloud computing processing service billing;task scheduling in sucuri dataflow library;synchronization-free automatic parallelization for arbitrarily nested affine loops;thread footprint analysis for the design of multithreaded applications and multicore systems;a hybrid parallel algorithm for the auction algorithm in multicore systems;and dataflow to hardware synthesis framework on FPGAs.
Community detection is the problem of finding naturally forming clusters in networks. It is an important problem in mining and analyzing social and other complex networks. Community detection can be used to analyze co...
详细信息
ISBN:
(纸本)9783031785405;9783031785412
Community detection is the problem of finding naturally forming clusters in networks. It is an important problem in mining and analyzing social and other complex networks. Community detection can be used to analyze complex systems in the real world and has applications in many areas, including network science, data mining, and computational biology. Label propagation is a community detection method that is simpler and faster than other methods such as Louvain, InfoMap, and spectral-based approaches. Some real-world networks can be very large and have billions of nodes and edges. Sequential algorithms might not be suitable for dealing with such large networks. this paper presents distributed-memory and hybrid parallel community detection algorithms based on the label propagation method. We incorporated novel optimizations and communication schemes, leading to very efficient and scalable algorithms. We also discuss various load-balancing schemes and present their comparative performances. these algorithms have been implemented and evaluated using large high-performancecomputing systems. Our hybrid algorithm is scalable to thousands of processors and has the capability to process massive networks. this algorithm was able to detect communities in the Metaclust50 network, a massive network with 282 million nodes and 42 billion edges, in 654 s using 4096 processors.
the article discusses various reports published within the issue, including one on a dual-thread speculation system and another on a parallel version of the Tricluster algorithm.
the article discusses various reports published within the issue, including one on a dual-thread speculation system and another on a parallel version of the Tricluster algorithm.
the proceedings contain 31 papers. the topics discussed include: highperformancecomputing in science and engineering;towards grid implementations of metaheuristics for hard combinatorial optimization problems;a new ...
详细信息
ISBN:
(纸本)076952446X
the proceedings contain 31 papers. the topics discussed include: highperformancecomputing in science and engineering;towards grid implementations of metaheuristics for hard combinatorial optimization problems;a new multi-processor architecture for parallel lazy cyclic reference counting;reconfigurable optical interconnection system supporting concurrent application-specific parallel computing;managing the execution of large scale MPI applications on computational grids;function outlining and partial inlining;reusing traces in a dynamic conditional execution architecture;cooperation of neighboring PEs in clustered architectures;a new parallel environment for interactive simulations implementing safe multithreading with MPI;a time petri-net-based approach for software synthesis considering overheads;and analyzing and improving clustering based sampling for microprocessor simulation.
the proceedings contain 22 papers. the topics discussed include: towards production code effective portability among vector machines and microprocessors-based architectures;data segmentation management infrastructure ...
详细信息
ISBN:
(纸本)0769527043
the proceedings contain 22 papers. the topics discussed include: towards production code effective portability among vector machines and microprocessors-based architectures;data segmentation management infrastructure in a database grid;detecting malicious manipulation in grid environments;policy-based resource allocation in hierarchical virtual organizations for global grids;a speculative trace reuse architecture with reduced hardware requirements;controlling the power and area of neural branch predictors for practical implementation in high-performance processors;a run-time system for efficient execution of scientific workflows on distributed environments;dual-thread speculation: two threads in the machine are worth eight in the bush;characterizing the performance of data management systems on hyper-threaded architectures;and ultra-fast CPU performance prediction: extending the monte carlo approach.
the proceedings contain 18 papers. the topics discussed include: exploring federated learning to trace depression in social media with language models;computing seismic attributes with deep-learning models;DASS: dynam...
ISBN:
(纸本)9798350381603
the proceedings contain 18 papers. the topics discussed include: exploring federated learning to trace depression in social media with language models;computing seismic attributes with deep-learning models;DASS: dynamic adaptive sub-target specialization;optimizing microservices performance and resource utilization through containerized grouping: an experimental study;assessing the performance of an architecture-aware optimization tool for neural networks;an exploratory study of deep learning for predicting computational tasks behavior in HPC systems;exploring federated learning to trace depression in social media with language models;computing seismic attributes with deep-learning models;and energy consumption analysis of instruction cache prefetching methods.
the proceedings contain 23 papers. the topics discussed include: performance modeling and estimation of a configurable output stationary neural network accelerator;NeurOPar, a neural network-driven EDP optimization st...
ISBN:
(纸本)9798350305487
the proceedings contain 23 papers. the topics discussed include: performance modeling and estimation of a configurable output stationary neural network accelerator;NeurOPar, a neural network-driven EDP optimization strategy for parallel workloads;exploiting the potential of flexible processing units;reverse time migration with lossy and lossless wavefield compression;performance tuning for GPU-embedded systems: machine-learning-based and analytical model-driven tuning methodologies;WCSim: a cloud computing simulator with support for bag of tasks workflows;performance modeling of MARE2DEM’s adaptive mesh refinement for makespan estimation;and comparing performance and portability between CUDA and SYCL for protein database search on NVIDIA, AMD, and Intel GPUs.
the Nested Neutral Point Clamped (NNPC) converter, functioning as a voltage source Converter (VSC), provides an effective solution for applications requiring Medium-Voltage and high-Power (MVHP). Earlier implementatio...
详细信息
暂无评论