检索结果-内蒙古大学图书馆

Supercomputing-Enabled First-Principles Analysis of Radio Wave Propagation in Urban Environments

IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION 2018年第12期66卷 6606-6617页

作者： MacKie-Mason, Brian Shao, Yang Greenwood, Andrew Peng, Zhen Univ New Mexico Dept Elect & Comp Engn Appl Electromagnet Grp Albuquerque NM 87131 USA Argonne Natl Lab Argonne Leadership Comp Facil Lemont IL 60439 USA US Air Force Res Lab Kirtland AFB Albuquerque NM 87123 USA

Wireless communications are expected to take place in increasingly complicated scenarios, such as dense urban, forest, tunnel, and other cluttered environments. A key emerging challenge is to understand the physics and characteristics of wave propagation in these environments, which is critical for the analysis, design, and application of advanced mobile and wireless communication systems. In this paper, we present a full-wave field-based computational methodology for radio wave propagation in complex urban environments. Both transmitting/receiving antennas and propagation environments are modeled by first-principles calculations. A system-level, large scene analysis is enabled by the scalable, ultraparallel algorithms on the emerging high-performance computing platforms. The proposed computational framework is verified and validated with semianalytical models and representative measurements.

关键词： Communication channel domain decomposition (DD) parallel algorithms propagation

来源：评论

学校读者我要写书评

暂无评论

DISTRIBUTED ONE-STAGE HESSENBERG-TRIANGULAR REDUCTION WITH WAVEFRONT SCHEDULING

引用

SIAM JOURNAL ON SCIENTIFIC COMPUTING 2018年第2期40卷 C157-C180页

作者： Adlerborn, Bjorn Karlsson, Lars Kagstrom, Bo Umea Univ Dept Comp Sci SE-90187 Umea Sweden Umea Univ HPC2N SE-90187 Umea Sweden

A novel parallel formulation of Hessenberg-triangular reduction of a regular matrix pair on distributed memory computers is presented. The formulation is based on a sequential cacheblocked algorithm by K degrees agstrom et al. [BIT, 48 (2008), pp. 563 584]. A static scheduling algorithm is proposed that addresses the problem of underutilized processes caused by two-sided updates of matrix pairs based on sequences of rotations. Experiments using up to 961 processes demonstrate that the new formulation is an improvement of the state of the art and also identify factors that limit its scalability.

关键词： generalized eigenvalue problem Hessenberg-triangular reduction parallel algorithms wavefront scheduling

来源：评论

学校读者我要写书评

暂无评论

Automated electron-optical system optimization through switching Levenberg-Marquardt algorithms

引用

JOURNAL OF ELECTRON SPECTROSCOPY AND RELATED PHENOMENA 2018年 227卷 31-39页

作者： Koh, Jin Ming Cheong, Kang Hao Singapore Inst Technol Engn Cluster 10 Dover Dr S-138683 Singapore Singapore

In the current study, we demonstrate an automated optimization method based on the Levenberg-Marquardt algorithm for electron-optical systems, incorporating an adaptive merit function switching process that enhances minimization convergence. The algorithm is successfully applied to three energy spectrometer designs-a radial mirror analyzer, a parallel radial mirror analyzer, and a parallel magnetic sector analyzer-by first implementing practical modifications to the device geometries. We then optimize the key design parameters to yield good focusing optics. The robustness of the method towards starting configuration is also demonstrated. The procedure can greatly enhance efficiency in the design process of electron-optical systems.

关键词： ELECTRON optics STOCHASTIC convergence ROBUST statistics parallel computers parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Convergence of sequential and parallel one-node CMFD accelerations for neutron transport analysis

Convergence of sequential and parallel one-node CMFD acceler...

引用

AISTech 2018 Iron and Steel Technology Conference and Exposition

作者： Kim, HyeonTae Kim, Yonghee Daejeon34141 Korea Republic of

ISBN: (纸本)9781935117728

In this study, the convergence rates of the two one-node CMFDs were evaluated for the sequential and the parallel algorithm. The results from the Fourier analyses and the numerical simulations showed good agreement, except for the parallel algorithm in optically thin region as an effect of boundary condition is appeared. The one-node CMFD showed instabilities in both sequential and the parallel algorithm with large scattering ratio. On the other hand, the one-node pCMFD was everywhere stable according to this study. From this result, the one-node pCMFD was found to be a better option for the parallel algorithm. © 2018 by AIST.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

On the design of hardware-software architectures for frequent itemsets mining on data streams

引用

JOURNAL OF INTELLIGENT INFORMATION SYSTEMS 2018年第3期50卷 415-440页

作者： Bustio-Martinez, Lazaro Cumplido, Rene Hernandez-Leon, Raudel Bande-Serrano, Jose M. Feregrino-Uribe, Claudia Natl Inst Astrophys Opt & Elect Dept Comp Sci Luis Enrique Erro 1 Puebla 72840 Mexico Adv Technol Applicat Ctr 7a 21406 Havana 12200 Cuba

Frequent Itemsets Mining has been applied in many data processing applications with remarkable results. Recently, data streams processing is gaining a lot of attention due to its practical applications. Data in data streams are transmitted at high rates and cannot be stored for offline processing making impractical to use traditional data mining approaches (such as Frequent Itemsets Mining) straightforwardly on data streams. In this paper, two single-pass parallel algorithms based on a tree data structure for Frequent Itemsets Mining on data streams are proposed. The presented algorithms employ Landmark and Sliding Window Models for windows handling. In the presented paper, as in other revised papers, if the number of frequent items on data streams is low then the proposed algorithms perform an exact mining process. On the contrary, if the number of frequent patterns is large the mining process is approximate with no false positives produced. Experiments conducted demonstrate that the presented algorithms outperform the processing time of the hardware architectures reported in the state-of-the-art.

关键词： Data Mining Frequent Itemsets Mining Data streams Reconfigurable Hardware parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Efficient Scalable Median Filtering Using Histogram-Based Operations

引用

IEEE TRANSACTIONS ON IMAGE PROCESSING 2018年第5期27卷 2217-2228页

作者： Green, Oded Georgia Inst Technol Coll Comp Atlanta GA 30332 USA

Median filtering is a smoothing technique for noise removal in images. While there are various implementations of median filtering for a single-core CPU, there are few implementations for accelerators and multi-core systems. Many parallel implementations of median filtering use a sorting algorithm for rearranging the values within a filtering window and taking the median of the sorted value. While using sorting algorithms allows for simple parallel implementations, the cost of the sorting becomes prohibitive as the filtering windows grow. This makes such algorithms, sequential and parallel alike, inefficient. In this work, we introduce the first software parallel median filtering that is non-sorting-based. The new algorithm uses efficient histogram-based operations. These reduce the computational requirements of the new algorithm while also accessing the image fewer times. We show an implementation of our algorithm for both the CPU and NVIDIA's CUDA supported graphics processing unit (GPU). The new algorithm is compared with several other leading CPU and GPU implementations. The CPU implementation has near perfect linear scaling with a 3.7x speedup on a quad-core system. The GPU implementation is several orders of magnitude faster than the other GPU implementations for mid-size median filters. For small kernels, 3 x 3 and 5 x 5, comparison-based approaches are preferable as fewer operations are required. Lastly, the new algorithm is open-source and can be found in the OpenCV library.

关键词： Median filtering parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

GPU-accelerated algorithms for compressed signals recovery with application to astronomical imagery deblurring

引用

INTERNATIONAL JOURNAL OF REMOTE SENSING 2018年第7期39卷 2043-2065页

作者： Fiandrotti, Attilio Fosson, Sophie M. Ravazzi, Chiara Magli, Enrico Politecn Torino Dipartimento Elettron & Telecomunicaz Turin Italy

Compressive sensing promises to enable bandwidth-efficient on-board compression of astronomical data by lifting the encoding complexity from the source to the receiver. The signal is recovered off-line, exploiting graphical processing unit (GPU)'s parallel computation capabilities to speedup the reconstruction process. However, inherent GPU hardware constraints limit the size of the recoverable signal and the speedup practically achievable. In this work, we design parallel algorithms that exploit the properties of circulant matrices for efficient GPU-accelerated sparse signals recovery. Our approach reduces the memory requirements, allowing us to recover very large signals with limited memory. In addition, it achieves a 10-fold signal recovery speedup, thanks to adhoc parallelization of matrix-vector multiplications and matrix inversions. Finally, we practically demonstrate our algorithms in a typical application of circulant matrices: deblurring a sparse astronomical image in the compressed domain.

关键词： Signal restoration Speeding compressed signal algorithms large-signal signals images memory requirements circulation parallel algorithms deblurring matrix inversion

来源：评论

学校读者我要写书评

暂无评论

A heuristic relaxed extrapolated algorithm for accelerating PageRank

引用

ADVANCES IN ENGINEERING SOFTWARE 2018年 120卷 88-95页

作者： Migallon, Hector Migallon, Violeta Palomino, Juan A. Penades, Jose Univ Miguel Hernandez Dept Phys & Comp Architectures E-03202 Alicante Spain Univ Alicante Dept Comp Sci & Artificial Intelligence E-03071 Alicante Spain

The PageRank algorithm for determining the importance of Web pages has become a central technique in Web search. This algorithm uses the Power method to compute successive iterates that converge to the principal eigenvector of the Markov chain representing the Web link graph. In this work we present an effective heuristic Relaxed and Extrapolated algorithm based on the Power method that accelerates its convergence. A hybrid parallel implementation of this algorithm has been designed by combining various OpenMP threads for each MPI process and several strategies of data distribution among nodes have been analyzed. The results show that the proposed algorithm can significantly speed up the convergence time with respect to the parallel Power algorithm. (C) 2016 Civil-Comp Ltd. and Elsevier Ltd. All rights reserved.

关键词： PageRank parallel algorithms Power method Relaxation and extrapolation Shared memory Distributed memory

来源：评论

学校读者我要写书评

暂无评论

An efficient data exchange mechanism for chained network functions

引用

JOURNAL OF parallel AND DISTRIBUTED COMPUTING 2018年 114卷 1-15页

作者： Cerrato, Ivan Marchetto, Guido Risso, Fulvio Sisto, Riccardo Virgilio, Matteo Bonafiglia, Roberto Politecn Torino Dept Control & Comp Engn I-10129 Turin Italy

Thanks to the increasing success of virtualization technologies and processing capabilities of computing devices, the deployment of virtual network functions is evolving towards a unified approach aiming at concentrating a huge amount of such functions within a limited number of commodity servers. To keep pace with this trend, a key issue to address is the definition of a secure and efficient way to move data between the different virtualized environments hosting the functions and a centralized component that builds the function chains within a single server. This paper proposes an efficient algorithm that realizes this vision and that, by exploiting the peculiarities of this application domain, is more efficient than classical solutions. The algorithm that manages the data exchanges is validated by performing a formal verification of its main safety and security properties, and an extensive functional and performance evaluation is presented. (C) 2017 Elsevier Inc. All rights reserved.

关键词： parallel algorithms High speed packet processing Data exchange mechanism Network function virtualization

来源：评论

学校读者我要写书评

暂无评论

Numerical Stiffness Analysis for Solid Oxide Fuel Cell Real-Time Simulation Applications

引用

IEEE TRANSACTIONS ON ENERGY CONVERSION 2018年第4期33卷 1917-1928页

作者： Ma, Rui Li, Zhongliang Breaz, Elena Pascal, Briois Gao, Fei Northwestern Polytech Univ Xian 710072 Shaanxi Peoples R China CNRS Fuel Cell Lab FR 3539 F-90010 Belfort France Univ Bourgogne Franche Comte Dept Energy UMR 6174 FEMTO STCNRS F-90010 Belfort France Aix Marseille Univ CNRS UMR 7296 LSIS Lab F-13397 Marseille France CNRS FR 3539 Fuel Cell Lab FCLAB F-90010 Belfort France Tech Univ Cluj Napoca Cluj Napoca 400114 Romania Univ Bourgogne Franche Comte CNRS UMR 6174 FEMTO STDept MN2S F-25200 Montbeliard France

Real-time simulation is important for the fuel cell online diagnostics and hardware-in-the-loop tests before industrial applications. However, it is hard to implement real-time multidimensional, multiphysical fuel cell models due to the model numerical stiffness issues. In this paper, the numerical stiffness of a tubular solid oxide fuel cell real-time model is first analyzed to identify the perturbation ranges related to the fuel cell electrochemical, fluidic, and thermal domains. Some of the commonly used ordinary differential equation (ODE) solvers are then tested for the real-time simulation purpose. At last, a novel two-stage third-order parallel stiff ODE solver is proposed to improve the stability and reduce the multidimensional real-time fuel cell model execution time. To verify the proposed model and the ODE solver, real-time simulation experiments are carried out in a common embedded real-time platform. The experimental results show that the execution speed satisfies the requirement of real-time simulation. The solver stability under strong stiffness and the high model accuracy are also validated. The proposed real-time fuel cell model and the stiff ODE solver can also help to design the online diagnostic control method.

关键词： Fuel cell parallel algorithms real-time system stiffness

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：