检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

39 篇 会议
1 册 图书

馆藏范围

40 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

37 篇 工学
- 37 篇 计算机科学与技术...
- 32 篇 软件工程
- 1 篇 电气工程
- 1 篇 信息与通信工程
- 1 篇 生物医学工程（可授...
- 1 篇 生物工程
3 篇 理学
- 2 篇 数学
- 1 篇 生物学

主题

5 篇 parallel computi...
5 篇 parallel algorit...
4 篇 parallel program...
4 篇 high performance...
3 篇 scalability
2 篇 parallelization
2 篇 apache spark
2 篇 mpi
2 篇 data compression
2 篇 parallel encodin...
2 篇 hevc
2 篇 gpu
1 篇 multi-frontal me...
1 篇 biology computin...
1 篇 datalog
1 篇 parallel algorit...
1 篇 mumps
1 篇 pattern assembly
1 篇 performance mode...
1 篇 co-design

机构

1 篇 inst immunol & p...
1 篇 univ mostaganem ...
1 篇 department of sy...
1 篇 tsinghua univ de...
1 篇 barcelona superc...
1 篇 moe key lab mach...
1 篇 univ valladolid ...
1 篇 univ paris 13 la...
1 篇 artificial intel...
1 篇 tsinghua univ de...
1 篇 ural fed univ ek...
1 篇 texas tech univ ...
1 篇 univ tunis el ma...
1 篇 higher sch techn...
1 篇 univ houston hou...
1 篇 icar-cnr and uni...
1 篇 krasovskii inst ...
1 篇 qingdao natl lab...
1 篇 guangdong prov k...
1 篇 charles univ pra...

作者

2 篇 dos santos rodri...
2 篇 pinol pablo
2 篇 lopez-granado ot...
2 篇 migallon hector
2 篇 lobosco marcelo
1 篇 huang tao
1 篇 sozykin andrey
1 篇 garcia ana-barba...
1 篇 luis martinez jo...
1 篇 zhang jianlei
1 篇 ayres daniel l.
1 篇 cebrian-marquez ...
1 篇 gergel victor
1 篇 ito yasuaki
1 篇 santander-jimene...
1 篇 alfredo cuzzocre...
1 篇 zavoral filip
1 篇 zhang shuang
1 篇 khamzin svyatosl...
1 篇 llanos diego r.

语言

40 篇 英文

检索条件"任意字段=16th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2016"

共 40 条记录，以下是31-40 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Improving the Performance of Cardiac Simulations in a Multi-GPU Architecture Using a Coalesced Data and Kernel Scheme 16th

Improving the Performance of Cardiac Simulations in a Multi-...

引用

16th international conference on algorithms and architectures for parallel processing (ica3pp)

作者： Cordeiro, Raphael Pereira Oliveira, Rafael Sachetto dos Santos, Rodrigo Weber Lobosco, Marcelo Univ Fed Juiz de Fora Programa Modelagem Computac Juiz de Fora Brazil Univ Fed Sao Joao del Rei Dept Ciencia Computacao Sao Joao del Rei Brazil

ISBN: (纸本)9783319495835;9783319495828

In this paper we evaluate a new coalesced data and kernel scheme used to reduce the execution costs of cardiac simulations that run on multi-GPU environments. the new scheme was tested for an important part of the simulator, the solution of the systems of Ordinary Differential Equations (ODEs). the results have shown that the proposed scheme is very effective. the execution time to solve the systems of ODEs on the multi-GPU environment was reduced by half, when compared to a scheme that does not implemented the proposed data and kernel coalescing. As a result, the total execution time of cardiac simulations was 25% faster.

关键词： High performance computing parallel computing Multi-GPU Cardiac electrophysiology Computational modeling

来源：评论

学校读者我要写书评

暂无评论

the Impact of Panel Factorization on the Gauss-Huard Algorithm for the Solution of Linear Systems on Modern architectures 16th

The Impact of Panel Factorization on the Gauss-Huard Algorit...

引用

16th international conference on algorithms and architectures for parallel processing (ica3pp)

作者： Catalan, Sandra Ezzatti, Pablo Quintana-Orti, Enrique S. Remon, Alfredo Univ Jaime I Dep Ingn & Ciencia Compuatc Castellon de La Plana 12701 Spain Univ Republica Inst Computat Montevideo 11300 Uruguay Max Planck Inst Dynam Complex Tech Syst D-30106 Magdeburg Germany

ISBN: (纸本)9783319495835;9783319495828

the Gauss-Huard algorithm (the GHA) is a specialized version of Gauss-Jordan elimination for the solution of linear systems that, enhanced with column pivoting, exhibits numerical stability and computational cost close to those of the conventional solver based on the LU factorization with row pivoting. Furthermore, the GHA can be formulated as a procedure rich in matrix multiplications, so that high performance can be expected on current architectures with multi-layered memories. Unfortunately, in principle the GHA does not admit the introduction of look-ahead, a technique that has been demonstrated to be rather useful to improve the performance of the LU factorization on multi-threaded platforms with high levels of hardware concurrency. In this paper we analyze the effect of this drawback on the implementation of the GHA on systems accelerated with graphics processing units (GPUs), exposing the roles of the CPU-to-GPU and single precision-to-double precision performance ratios, as well as the contribution from the operations in the algorithm's critical path.

关键词： Linear systems of equations Gauss-Huard algorithm LU factorization Multicore processors Graphics processing units (GPUs) Mixed precision High performance

来源：评论

学校读者我要写书评

暂无评论

Scaling DBSCAN-like algorithms for Event Detection Systems in Twitter 16th

Scaling DBSCAN-like Algorithms for Event Detection Systems i...

引用

16th international conference on algorithms and architectures for parallel processing (ica3pp)

作者： Capdevila, Joan Pericacho, Gonzalo Torres, Jordi Cerquides, Jesus Polytech Univ Catalonia UPC Dept Comp Architec Barcelona Spain Barcelona Supercomp Ctr BSC CNS Dept Comp Sci Barcelona Spain Artificial Intelligence Res Inst IIIA CSIC Barcelona Spain

ISBN: (纸本)9783319495835;9783319495828

the increasing use of mobile social networks has lately transformed news media. Real-world events are nowadays reported in social networks much faster than in traditional channels. As a result, the autonomous detection of events from networks like Twitter has gained lot of interest in both research and media groups. DBSCAN-like algorithms constitute a well-known clustering approach to retrospective event detection. However, scaling such algorithms to geographically large regions and temporarily long periods present two major shortcomings. First, detecting real-world events from the vast amount of tweets cannot be performed anymore in a single machine. Second, the tweeting activity varies a lot within these broad space-time regions limiting the use of global parameters. Against this background, we propose to scale DBSCAN-like event detection techniques by parallelizing and distributing them through a novel density-aware MapReduce scheme. the proposed scheme partitions tweet data as per its spatial and temporal features and tailors local DBSCAN parameters to local tweet densities. We implement the scheme in Apache Spark and evaluate its performance in a dataset composed of geo-located tweets in the Iberian peninsula during the course of several football matches. the results pointed out to the benefits of our proposal against other state-of-the-art techniques in terms of speed-up and detection accuracy.

关键词： Event detection parallel algorithm Data clustering DBSCAN MapReduce Apache Spark Twitter

来源：评论

学校读者我要写书评

暂无评论

A C++ generic parallel pattern interface for stream processing 16th

A C++ generic parallel pattern interface for stream processi...

引用

16th international conference on algorithms and architectures for parallel processing, ica3pp 2016

作者： Astorga, David del Rio Dolz, Manuel F. Sanchez, Luis Miguel Blas, Javier García Daniel García, J. Department of Computer Science University Carlos III of Madrid Leganés28911 Spain

ISBN: (纸本)9783319495828

Current parallel programming frameworks aid to a great extent developers to implement applications in order to exploit parallel hardware resources. Nevertheless, developers require additional expertise to properly use and tune them to operate on specific parallel platforms. On the other hand, porting applications between different parallel programming models and platforms is not straightforward and requires, in most of the cases, considerable efforts. Apart from that, the lack of highlevel parallel pattern abstractions in these frameworks increases even more the complexity for developing parallel applications. To pave the way in this direction, this paper proposes GrppI, a generic and reusable high-level parallel pattern interface for stream-based C++ applications. thanks to its high-level C++ API, this interface allows users to easily expose parallelism in sequential applications using already existing parallel frameworks, such as C++ threads, OpenMP and Intel TBB. We evaluate this approach using an image processing use case to demonstrate its benefits from the usability, flexibility, and performance points of view. © Springer international Publishing AG 2016.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

A parallel Algorithm of Kirchhoff Pre-stack Depth Migration Based on GPU 1

引用

14th international conference on algorithms and architectures for parallel processing (ica3pp)

作者： Wang, Yida Li, Chao Tian, Yang Yan, Haihua Zhao, Changhai Zhang, Jianlei Beihang Univ Sch Comp Sci & Engn Beijing 100191 Peoples R China

ISBN: (数字)9783319111940

ISBN: (纸本)9783319111940;9783319111933

Kirchhoff pre-stack depth migration (KPSDM) algorithm, as one of the most widely used migration algorithms, plays an important part in getting the real image of the earth. However, this program takes considerable time due to its high computational cost;hence the working efficiency of the oil industry is affected. the general purpose Graphic processing Unit (GPU) and the Compute Unified Device Architecture (CUDA) developed by NVIDIA have provided a new solution to this problem. In this study, we have proposed a parallel algorithm of the Kirchhoff pre-stack depth migration and an optimization strategy based on the CUDA technology. Our experiments indicate that for large data computations, the accelerated algorithm achieves a speedup of 8 similar to 15 times compared with NVIDIA GPU.

关键词： Kirchhoff pre-stack depth migration GPU CUDA parallel algorithm optimization

来源：评论

学校读者我要写书评

暂无评论

A note on developing optimal and scalable parallel two-list algorithms

A note on developing optimal and scalable parallel two-list ...

引用

12th international conference on algorithms and architectures for parallel processing, ica3pp 2012

作者： Chedid, Fouad B. College of Arts and Applied Sciences Dhofar University Oman Department of Computer Science Notre Dame University - Louaize Lebanon

ISBN: (纸本)9783642330643

We show that developing an optimal parallelization of the two-list algorithm is much easier than we once thought. All it takes is to observe that the steps of the search phase of the two-list algorithm are closely related to the steps of a merge procedure for merging two sorted lists, and we already know how to parallelize merge efficiently. Armed with this observation, we present an optimal and scalable parallel two-list algorithm that is easy to understand and analyze, while it achieves the best known range of processor-time tradeoffs for this problem. In particular, our algorithm based on a CREW PRAM model takes time O(2n/2 - α) using 2α processors, for 0 ≤ α ≤ n/2 - 2logn + 2. © 2012 Springer-Verlag.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

A verified library of algorithmic skeletons on evenly distributed arrays

A verified library of algorithmic skeletons on evenly distri...

引用

12th international conference on algorithms and architectures for parallel processing, ica3pp 2012

作者： Bousdira, Wadoud Loulergue, Frédéric Tesson, Julien LIFO University of Orléans France Kochi University of Technology Japan

ISBN: (纸本)9783642330773

To make parallel programming as widespread as parallel architectures, more structured parallel programming paradigms are necessary. One of the possible approaches are algorithmic skeletons. they can be seen as higher order functions implemented in parallel. Algorithmic skeletons offer a simple interface to the programmer without all the details of parallel implementations as they abstract the communications and the synchronisations of parallel activities. To write a parallel program, users have to combine and compose the skeletons. Orléans Skeleton Library (OSL) is an efficient meta-programmed C++ library of algorithmic skeletons that manipulate distributed arrays. A prototype implementation of OSL exists as a library written with the function parallel language Bulk Synchronous parallel ML (BSML). In this paper we are interested in verifying the correctness of a subset of this prototype implementation. To do so, we give a functional specification of a subset of OSL and we prove the correctness of the BSML implementation with respect to this functional specification, using the Coq proof assistant. To illustrate how the user could use these skeletons, we prove the correctness of two applications implemented with them. © 2012 Springer-Verlag.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

An implementation of parallel 2-D FFT using intel AVX instructions on multi-core processors

An implementation of parallel 2-D FFT using intel AVX instru...

引用

12th international conference on algorithms and architectures for parallel processing, ica3pp 2012

作者： Takahashi, Daisuke Faculty of Engineering Information and Systems University of Tsukuba 1-1-1 Tennodai Tsukuba Ibaraki 305-8573 Japan

ISBN: (纸本)9783642330643

In this paper, we propose an implementation of a parallel two-dimensional fast Fourier transform (FFT) using Intel Advanced Vector Extensions (AVX) instructions on multi-core processors. the combination of vectorization and a block two-dimensional FFT algorithm is shown to effectively improve performance. We vectorized FFT kernels using the AVX instructions. Performance results of two-dimensional FFTs on multi-core processors are reported. We successfully achieved a performance of over 61 GFlops on an Intel Xeon E5-2670 (2.6 GHz, two CPUs, 16 cores) and over 24 GFlops on an Intel Core i7-3930K (3.2 GHz, one CPU, six cores) for a 212 x 212-point FFT. © 2012 Springer-Verlag.

关键词： Fast Fourier transforms

来源：评论

学校读者我要写书评

暂无评论

A massively parallel hardware for modular exponentiations using the m-ary method

A massively parallel hardware for modular exponentiations us...

引用

10th international conference algorithms and architectures for parallel processing, ica3pp 2010

作者： Farias, Marcos Santana De Souza Raposo, S. Nedjah, Nadia De Macedo Mourelle, L. Department of Electronics Engineering and Telecommunications Engineering Faculty State University of Rio de Janeiro Brazil Department of System Engineering and Computation Engineering Faculty State University of Rio de Janeiro Brazil

ISBN: (纸本)3642131352

Most of cryptographic systems are based on modular exponentiation. It is performed using successive modular multiplications. One way of improving the throughput of a cryptographic system implementation is reducing the number of the required modular multiplications. Existing methods attempt to reduce this number by partitioning the exponent in constant or variable size windows. In this paper, in the purpose of further accelerating the computation of modular exponentiation, a concurrent novel approach is proposed along with hardware implementation of the concurrent m-ary method. We compare the proposed method to the sequential implementation. © Springer-Verlag Berlin Heidelberg 2010.

关键词： Cryptography

来源：评论

学校读者我要写书评

暂无评论

algorithms and architectures for parallel processing 1

引用

丛书名： Lecture Notes in Computer Science

1000年

作者： Yang Xiang Wanlei Zhou Alfredo Cuzzocrea Michael Hobbs

ISBN: (数字)9783642246692

ISBN: (纸本)9783642246685

this two volume set LNCS 7016 and LNCS 7017 constitutes the refereed proceedings of the 11th international conference on algorithms and architectures for parallel processing, ica3pp 2011, held in Melbourne, Australia, in October 2011. the second volume includes 37 papers from one symposium and three workshops held together with ica3pp 2011 main conference. these are 16 papers from the 2011 international Symposium on Advances of Distributed Computing and Networking (ADCN 2011), 10 papers of the 4th IEEE international Workshop on Internet and Distributed Computing Systems (IDCS 2011), 7 papers belonging to the III international Workshop on Multicore and Multithreaded architectures and algorithms (M2A2 2011), as well as 4 papers of the 1st IEEE international Workshop on parallel architectures for Bioinformatics Systems (HardBio 2011).

关键词： Algorithm Analysis and Problem Complexity Artificial Intelligence Software Engineering Information Systems Applications (incl. Internet) Computer Communication Networks Management of Computing and Information Systems

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共4页 << < 1 2 3 4 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：