检索结果-内蒙古大学图书馆

International Conference on Image Processing and Vision Engineering (IMPROVE)

作者： Ibrahim, Nahla M. Abou ElFarag, Ahmed Kadry, Rania Arab Acad Sci & Technol & Maritime Transport Dept Comp Engn Alexandria Egypt

ISBN: (纸本)9789897585111

Two dimensional 2D convolution is one of the most complex calculations and memory intensive algorithms used in image processing. In our paper, we present the 2D convolution algorithm used in the Gaussian blur which is a filter widely used for noise reduction and has high computational requirements. Since, single threaded solutions cannot keep up with the performance and speed needed for image processing techniques. Therefore, parallelizing the image convolution on parallel systems enhances the performance and reduces the processing time. This paper aims to give an overview on the performance enhancement of the parallel systems on image convolution using Gaussian blur algorithm. We compare the speed up of the algorithm on two parallel systems: multi-core central processing unit CPU and graphics processing unit GPU using Google Colaboratory or "colab".

关键词： CUDA parallel computing Image Convolution Gaussian Blur Google Colaboratory

来源：评论

学校读者我要写书评

暂无评论

parallel computing with R: A brief review

引用

WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS 2021年第2期13卷 e1515-e1515页

作者： Eddelbuettel, Dirk Univ Illinois Urbana IL 61801 USA

parallel computing has established itself as another standard method for applied research and data analysis. The R system, being internally constrained to mostly singly-threaded operations, can nevertheless be used along with different parallel computing approaches. This brief review covers OpenMP and Intel TBB at the CPU- and compiler level, moves to process-parallel approaches before discussing message-passing parallelism and big data technologies for parallel processing such as Spark, Docker and Kubernetes before concluding with a focus on the future package integrating many of these approaches. This article is categorized under: Algorithms and Computational Methods > Methods for High Performance computing Software for Computational Statistics > Software/Statistical Software Software for Computational Statistics > High Performance Software

关键词： Kubernetes R OpenMP OpenMPI parallel computing r spark

来源：评论

学校读者我要写书评

暂无评论

Management of Wind Power Variations in Electricity System Investment Models: A parallel computing Strategy

引用

Operations Research Forum 2021年第2期2卷 25页

作者： Göransson, Lisa Granfeldt, Caroline Strömberg, Ann-Brith Department of Space Earth and Environment Chalmers University of Technology Gothenburg Sweden Department of Mathematical Sciences Chalmers University of Technology and University of Gothenburg Gothenburg Sweden

Accounting for variability in generation and load and strategies to tackle variability cost-efficiently are key components of investment models for modern electricity systems. This work presents and evaluates the Hours-to-Decades (H2D) model, which builds upon a novel approach to account for strategies to manage variations in the electricity system covering several days, the variation management which is of particular relevance to wind power integration. The model discretizes the time dimension of the capacity expansion problem into 2-week segments, thereby exploiting the parallel processing capabilities of modern computers. Information between these segments is then exchanged in a consensus loop. The method is evaluated with regard to its ability to account for the impacts of strategies to manage variations in generation and load, regional resources and trade, and inter-annual linkages. Compared to a method with fully connected time, the proposed method provides solutions with an increase in total system cost of no more than 1.12%, while reducing memory requirements to 1/26’th of those of the original problem. For capacity expansion problems concerning two regions or more, it is found that the H2D model requires 1–2% of the calculation time relative to a model with fully connected time when solved on a computer with parallel processing capability. © 2021, The Author(s).

关键词： Capacity expansion model Consensus algorithm Electricity system model Flexibility measures Hours-to-Decades model parallel computing Variation management Wind power integration

来源：评论

学校读者我要写书评

暂无评论

Fault-Tolerant Computation Meets Network Coding: Optimal Scheduling in parallel computing

Fault-Tolerant Computation Meets Network Coding: Optimal Sch...

引用

IEEE Global Communications Conference (GLOBECOM)

作者： Li, Congduan Tan, Chee Wei Li, Jingting Chen, Siya Sun Yat Sen Univ Sch Elect & Commun Engn Shenzhen Peoples R China City Univ Hong Kong Dept Comp Sci Hong Kong Peoples R China

ISBN: (纸本)9781728181042

We propose an optimal scheduling strategy to enable fault-tolerant reliable computation to protect the integrity of computation. Specifically, we determine the optimal redundancy-failure rate tradeoff to incorporate redundancy into parallel computing units running multiple-precision arithmetic that are useful for applications such as asymmetric cryptography and fast integer multiplication. Inspired by network coding, we propose coding matrices to strategically map partial computation to available computing units, so that the central unit can reliably reconstruct the results of any failed machine without recalculations to yield the final correct computation output. We propose optimization-based algorithms to efficiently construct the optimal coding matrices subject to fault tolerance specifications. Performance evaluation demonstrates that the optimal scheduling effectively reduces the overall running time of parallel computing while resisting wide-ranging failure rates.

关键词： Fault Tolerant Control parallel computing Network Coding

来源：评论

学校读者我要写书评

暂无评论

Queuing parallel computing CAD Tasks in the Design and Optimization of IC Topography 28

Queuing Parallel Computing CAD Tasks in the Design and Optim...

引用

28th International Conference on Mixed Design of Integrated Circuits and System (MIXDES)

作者： Wojtasik, Adam Warsaw Univ Technol Inst Microelect & Optoelect Warsaw Poland

ISBN: (纸本)9788363578190

In recent years, the development of personal computers hardware has been aimed at increasing the number of processor cores. At the same time, the efficiency and reliability of computer interconnecting networks is increased. This enables the introduction and development of parallel and distributed processing methods also in CAD systems. The paper presents the problems related to the parallelization of the computational process and the methods of solving them, based on the example of typical computational experiments used in the design and optimization of integrated circuits topography.

关键词： parallel computing distributed computing CAD integrated circuit design

来源：评论

学校读者我要写书评

暂无评论

K-way spectral graph partitioning for load balancing in parallel computing

引用

International Journal of Information Technology (Singapore) 2021年第5期13卷 1893-1900页

作者： Patil, S.V. Kulkarni, D.B. Research Scholar Walchand College of Engineering (ADCET Ashta) Sangli India Professor Walchand College of Engineering Sangli India

A domain of problem-solving models the problems using graphs, for the graphs are effective representation of such problems, leading to their efficient solutions. The nodes in a graph represent a division of unit work—the computation, and the connecting edges represent communication required among the nodes to accomplish that unit work. The weight is assigned to the nodes and connecting edges for the cost incurred to compute and to collaborate, respectively. Graph partitioning exploits the concurrency in the problem being modeled and maps the problem onto parallel processors to guarantee efficient and load-balanced execution. The objective is to—(i) equally distribute the computations on available computing power (parallel processors) and (ii) minimize the cost of collaboration. To achieve the said objectives for any complex problem, the spectral graph partitioning is demonstrated here—that uses eigenvectors of the graph’s laplacian matrix. The results are tested via the realization of the stochastic block model. The quality of graph partitioning is tested by comparing it with ground truth results. Further, for a large-scale graph, the parallel implementation of spectral graph partitioning on GPGPU is presented. The GPGPU implementation provides better speedup with scalability. © 2021, Bharati Vidyapeeth's Institute of Computer Applications and Management.

关键词： General purpose graphics processing unit (GPGPU) Graph partitioning Load balancing parallel computing Spectral clustering

来源：评论

学校读者我要写书评

暂无评论

parallel computing for Fast Spatiotemporal Weighted Regression

引用

COMPUTERS & GEOSCIENCES 2021年 150卷 104723-104723页

作者： Que, Xiang Ma, Chao Ma, Xiaogang Chen, Qiyu Fujian Agr & Forestry Univ Comp & Informat Coll Fuzhou Fujian Peoples R China Univ Idaho Dept Comp Sci 875 Perimeter Dr MS 1010 Moscow ID 83844 USA Chengdu Univ Technol State Key Lab Oil & Gas Reservoir Geol & Exploita Chengdu 610059 Peoples R China China Univ Geosci Wuhan Sch Comp Sci 388 Lumo Rd Wuhan 430074 Peoples R China

The Spatiotemporal Weighted Regression (STWR) model is an extension of the Geographically Weighted Regression (GWR) model for exploring the heterogeneity of spatiotemporal processes. A key feature of STWR is that it utilizes the data points observed at previous time stages to make better fit and prediction at the latest time stage. Because the temporal bandwidths and a few other parameters need to be optimized in STWR, the model calibration is computationally intensive. In particular, when the data amount is large, the calibration of STWR becomes heavily time-consuming. For example, with 10,000 points in 10 time stages, it takes about 2307 s for a single-core PC to process the calibration of STWR. Both the distance and the weighted matrix in STWR are memory intensive, which may easily cause memory insufficiency as data amount increases. To improve the efficiency of computing, we developed a parallel computing method for STWR by employing the Message Passing Interface (MPI). A cache in the MPI processing approach was proposed for the calibration routine. Also, a matrix splitting strategy was designed to address the problem of memory insufficiency. We named the overall design as Fast STWR (F-STWR). In the experiment, we tested F-STWR in a High-Performance computing (HPC) environment with a total number of 204,611 observations in 19 years. The results show that F-STWR can significantly improve STWR's capability of processing large-scale spatiotemporal data.

关键词： Spatiotemporal weighted regression parallel computing Geographically weighted regression Spatial analysis Spatiotemporal non-stationarity

来源：评论

学校读者我要写书评

暂无评论

MiniCAR: Minimal Congestion-aware Routing Method in Fine-grained Circuit-switched Networks for parallel computing Systems 26

MiniCAR: Minimal Congestion-aware Routing Method in Fine-gra...

引用

26th IEEE Symposium on Computers and Communications (IEEE ISCC)

作者： Hu, Yao Natl Inst Informat Informat Syst Architecture Sci Res Div Tokyo Japan

ISBN: (纸本)9781665427449

In parallel high-performance computing (HPC) systems, network congestion is one of the main factors to degrade communication performance, because it may lead to increased end-to-end latency and power consumption. In this work, we address this issue by the means of a simple routing algorithm on a target fine-grained circuit-switched (FGCS) network. The number of allocated slots for each FGCS switch in the network is a direct factor to affect the end-to-end latency. Our proposed approach employs a minimal congestion-aware routing (MiniCAR) method to perform better routing decisions and alleviate the network congestion so that the minimum necessary number of slots can be reduced in a target FGCS network. Evaluation results show that, compared to the traditional dimension order routing algorithm, MiniCAR occupies a smaller number of time slots by up to 50.8% on a 2-D torus interconnection network.

关键词： parallel computing circuit-switched network congestion-aware routing

来源：评论

学校读者我要写书评

暂无评论

Metaphor-less Rao-3 and artificial neural network with parallel computing-based wheeling pricing in competitive power market

引用

COGENT ENGINEERING 2024年第1期11卷

作者： Saxena, Abhishek Pandey, Seema N. Dixit, Shishir Madhav Inst Sci & Technol Dept Elect Engn Gwalior India Dr Bhim Rao Ambedkar Polytech Coll Dept Elect Engn Gwalior India

Fast and accurate wheeling pricing has emerged as an important issue in the recent competitive power market. Embedded cost-based wheeling pricing is well accepted by power market, because it is based on actual flow of power wheeled by them. It also recovers fully the fixed cost of wheeling facility installation and operation. In this article, metaphor-less Rao-3-based ACOPF, MVA-mile method and Bialek tracing has been employed to compute wheeling prices across various generators and loads. In actual power market due to continuously varying load conditions, the computation of wheeling prices is quite a time taking process. Because for computing wheeling prices, the optimal power flow (OPF) program has to be run each time for every loading condition. In this scenario, the artificial neural network (ANN) approach has been found to be very useful, to estimate wheeling prices instantly and accurately for any unseen loading scenario. Here, a number of ANNs have been developed under parallel computing environment. This article presents a metaphor-less Rao-3-based approach to project wheeling prices in the competitive power market by developing a new radial basis function neural network (RBFNN). The present work of wheeling pricing has been demonstrated and examined on IEEE 30-bus system.

关键词： Bialek tracing radial basis function neural network (RBFNN) MVA-mile method parallel computing wheeling pricing metaphor-Less rao-3 algorithm Qingsong Ai, Senior Editor, Wuhan University of Technology, CHINA Technology Engineering & Technology Electrical & Electronic Engineering Engineering Economics Industrial Engineering & Manufacturing

来源：评论

学校读者我要写书评

暂无评论

A parallel Volunteer computing System Based on Server Assisted Communication

引用

IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING 2025年第0期

作者： Inohara, Keiichi Kurokawa, Yota Fukushi, Masaru Yamaguchi Univ Grad Sch Sci & Technol Innovat Ube Yamaguchi 7550097 Japan

Volunteer computing (VC) is one of the distributed computing paradigms, which exploits idle computing resources provided by vast amount of users on the Internet. In VC, individual nodes are usually unable to communicate with each other directly;therefore, current VC supports only bag-of-tasks computation, and this prevents widespread use of VC. Toward the realization of parallel VC, this paper proposes a parallel VC system based on the concept of server assisted communication. The proposed method replaces inter-node communication with a pair of two request-driven communication between sender/server and server/receiver. In the proposed parallel VC system, a VC server consists of an Apache web server and a MySQL database server, to ease the implementation of multi-threaded communication and stable and efficient data-management functions. A software tool is also developed to convert a parallel program written with a common MPI communication library into a program with a standard socket library with HTTP protocol. To demonstrate the feasibility of the proposed system, we have implemented the parallel VC system and evaluated the execution time of basic communication functions and parallel programs in NAS parallel benchmarks. The results show that the execution time of basic communication functions is acceptable for the practical use of VC and benchmark programs are successfully executed on the proposed systems, demonstrating the feasibility of parallel computation in VC environments. (c) 2025 Institute of Electrical Engineers of Japan and Wiley Periodicals LLC.

关键词： volunteer computing parallel computing message passing interface (MPI) server assisted communication

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：