检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

文献类型

36 篇 会议
4 册 图书

馆藏范围

40 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

31 篇 工学
- 31 篇 计算机科学与技术...
- 22 篇 软件工程
- 4 篇 电子科学与技术（可...
- 3 篇 电气工程
- 3 篇 信息与通信工程
- 2 篇 冶金工程
- 1 篇 仪器科学与技术
- 1 篇 控制科学与工程
- 1 篇 化学工程与技术
- 1 篇 生物医学工程（可授...
- 1 篇 生物工程
12 篇 理学
- 9 篇 数学
- 3 篇 系统科学
- 2 篇 物理学
- 1 篇 化学
- 1 篇 地球物理学
- 1 篇 生物学
- 1 篇 统计学（可授理学、...
6 篇 管理学
- 4 篇 管理科学与工程(可...
- 3 篇 图书情报与档案管...
- 2 篇 工商管理
1 篇 经济学
- 1 篇 应用经济学

主题

4 篇 algorithm analys...
3 篇 parallel program...
3 篇 cloud computing
2 篇 parallel algorit...
2 篇 information syst...
2 篇 software enginee...
2 篇 computer communi...
2 篇 parallel computi...
2 篇 management of co...
2 篇 artificial intel...
2 篇 computer systems...
1 篇 computational ma...
1 篇 distributed syst...
1 篇 parallel process...
1 篇 systems software
1 篇 parallel archite...
1 篇 apache spark
1 篇 speculative mult...
1 篇 support vector m...
1 篇 software enginee...

机构

2 篇 seecs university...
2 篇 school of inform...
2 篇 technische unive...
2 篇 department of in...
2 篇 school of inform...
2 篇 school of inform...
1 篇 science and tech...
1 篇 science and tech...
1 篇 tsinghua univ de...
1 篇 department of in...
1 篇 advanced institu...
1 篇 china university...
1 篇 department of in...
1 篇 ministry of educ...
1 篇 saarland informa...
1 篇 tsinghua univ de...
1 篇 school of comput...
1 篇 guangdong key la...
1 篇 no arizona univ ...
1 篇 xi’an jiaotong u...

作者

3 篇 albert zomaya
2 篇 zhang jianlei
2 篇 bernady o. apduh...
2 篇 yang xiang
2 篇 li zhen
2 篇 li dongsheng
2 篇 li chao
2 篇 ivan stojmenovic
2 篇 zhao changhai
2 篇 yan haihua
2 篇 jannesari ali
2 篇 wolf felix
2 篇 wang yida
2 篇 koji nakano
2 篇 guojun wang
2 篇 yang guangwen
1 篇 samra sameh
1 篇 koester marcel
1 篇 miremadi seyed g...
1 篇 li yuxiang

语言

40 篇 英文

检索条件"任意字段=8th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2008"

共 40 条记录，以下是11-20 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Implementation of beamforming for large-scale circular array sonar based on parallel FIR filter structure in FPGA 18th

Implementation of beamforming for large-scale circular array...

引用

18th international conference on algorithms and architectures for parallel processing - ica3pp 2018 Collocated Workshops: ica3pp 2018 Workshop on Intelligent algorithms for Large-scale Complex Optimization Problems, IALCOP 2018, ica3pp 2018 Workshop on Security and Privacy in Data processing, SPDP 2018

作者： Wang, Jun Jiao, Junsheng Science and Technology on Sonar Laboratory Hangzhou Applied Acoustics Research Institute Hangzhou China

ISBN: (纸本)9783030052331

In this paper, the directivity of the circular array is analyzed, real-time beamforming algorithm of circular array in frequency domain similar to parallel FIR filter structure is proposed by using the characteristic of the same directivity in different directions and the characteristic of steer vector symmetry. the coefficients of parallel FIR (Finite Impulse Response) filters are constant, while the steer vector of each frequency in the beamforming algorithm is variable, so the coefficients (steer vector) of the beamforming parallel filter need to be dynamically changed according to the frequency points. For frequency domain beamforming of high frequency large-scale circular array sonar, this algorithm only needs two shift registers, complex multipliers which is half the number of elements used in beamforming and a few logic resources to calculate steer vector in FPGA (Field Programmable Gate Array), and the algorithm can run at a higher working frequency. the lake-trial result shows that this parallel algorithm satisfies the real-time requirements of high-frequency large-scale circular array sonar, and the sonar has good azimuth resolution and detection performance. © Springer Nature Switzerland AG 2018.

关键词： Beamforming

来源：评论

学校读者我要写书评

暂无评论

Porting and optimizing VASP on the SW26010 18th

Porting and optimizing VASP on the SW26010

引用

作者： Li, Leisheng Sun, Qiao Liu, Xin Wu, Changmao Zhao, Haitao Zhang, Changyou Laboratory of Parallel Software and Computational Science Institute of Software Chinese Academy of Sciences Beijing China National Research Centre of Parallel Computer Engineering and Technology WuxiJiangsu China

ISBN: (纸本)9783030052331

VASP (Vienna Ab initio Simulation Package) is a prevalent first-principle software framework. It is so widely used that its runtime usually dominates the usage of current supercomputers. the porting and optimization of VASP to the Sunway TaihuLight supercomputer, a newly heterogeneous many-core platform based on SW26010 CPU, becomes of great importance. In this paper, we focus on the challenges in porting and optimizing VASP on the SW26010 CPU. Optimizations on three types of time-consuming kernels, which include matrix operations, FFT, and certain domain-specific computing primitives, are carried out base on thorough performance profiling. the experimental results are shown by the case of RELAX, where speedup of 2.90x and 4.48x is sustained respectively for both of the iterative diagonalization methods in VASP, RMM-DIIS (RMM) and block Davidson (DAV). © Springer Nature Switzerland AG 2018.

关键词： Density functional theory

来源：评论

学校读者我要写书评

暂无评论

PLZMA: A parallel Data Compression Method for Cloud Computing 18th

PLZMA: A Parallel Data Compression Method for Cloud Computin...

引用

18th international conference on algorithms and architectures for parallel processing (ica3pp)

作者： Wang, Xin Gan, Lin Xu, Jingheng Yang, Jinzhe Xia, Maocai Fu, Haohuan Huang, Xiaomeng Yang, Guangwen Tsinghua Univ Dept Comp Sci & Technol Beijing Peoples R China Tsinghua Univ Key Lab Earth Syst Modeling Minist Educ Beijing Peoples R China Tsinghua Univ Dept Earth Syst Sci Beijing Peoples R China Natl Supercomp Ctr Wuxi Jiangsu Peoples R China Imperial Coll London Dept Comp London England Qingdao Natl Lab Marine Sci & Technol Lab Reg Oceanog & Numer Modeling Qingdao Peoples R China

ISBN: (纸本)9783030050573;9783030050566

Recent decades have seen the rapid development of cloud computing, resulting in a huge breakthrough for people to handle the data produced every second and everywhere. Meanwhile, data compression is becoming increasingly important, due to its great potential in benefiting both the network transportation and the storage. Based on the urgent demand in high-efficient compression method with balanced performance in both merits of compression time and ratio, this paper presents PLZMA, a parallel design of LZMA. Process-level and thread-level parallelisms are implemented according to the algorithm of LZMA, which have gained great improvement in compression time, while ensuring a fair compression ratio. Experimental results on real-world application showed that PLZMA is able to achieve more balanced performance over other famous methods. the parallel design is able to achieve a performance speedup of 8x over the serial baseline, using 12 threads.

关键词： Data compression parallel computing LZMA

来源：评论

学校读者我要写书评

暂无评论

Accelerating Exhaustive Pairwise Metagenomic Comparisons 17th

Accelerating Exhaustive Pairwise Metagenomic Comparisons

引用

17th international conference on algorithms and architectures for parallel processing (ica3pp)

作者： Perez-Wohlfeil, Esteban Torreno, Oscar Trelles, Oswaldo Univ Malaga Dept Comp Architecture Blvd Louis Pasteur 35 Malaga Spain

ISBN: (纸本)9783319654829;9783319654812

In this manuscript, we present an optimized and parallel version of our previous work IMSAME, an exhaustive gapped aligner for the pairwise and accurate comparison of metagenomes. parallelization strategies are applied to take advantage of modern multiprocessor architectures. In addition, sequential optimizations in CPU time and memory consumption are provided. these algorithmic and computational enhancements enable IMSAME to calculate near optimal alignments which are used to directly assess similarity between metagenomes without requiring reference databases. We show that the overall efficiency of the parallel implementation is superior to 80% while retaining scalability as the number of parallel cores used increases. Moreover, we also show that sequential optimizations yield up to 8x speedup for scenarios with larger data.

关键词： High Performance Computing Pairwise comparison parallel computing Next Generation Sequencing Metagenome comparison

来源：评论

学校读者我要写书评

暂无评论

15th international conference on algorithms and architectures for parallel processing, ica3pp 2015

15th International Conference on Algorithms and Architecture...

引用

15th international conference on algorithms and architectures for parallel processing, ica3pp 2015

ISBN: (纸本)9783319271361

the proceedings contain 59 papers. the special focus in this conference is on Applications of parallel and Distributed Computing. the topics include: On exploring a virtual agent negotiation inspired approach for route guidance in urban traffic networks;optimization of binomial option pricing on intel MIC heterogeneous system;stencil computations on HPC-oriented ARMv8 64-bit multi-core processor;a particle swarm optimization algorithm for controller placement problem in software defined network;a streaming execution method for multi-services in mobile cloud computing;economy-oriented deadline scheduling policy for render system using IaaS cloud;towards detailed tissue-scale 3D simulations of electrical activity and calcium handling in the human cardiac ventricle;task parallel implementation of matrix multiplication on multi-socket multi-core architectures;refactoring for separation of concurrent concerns;exploiting scalable parallelism for remote sensing analysis models by data transformation graph;resource-efficient vibration data collection in cyber-physical systems;a new approach for vehicle recognition and tracking in multi-camera traffic system;a scalable distributed fingerprint identification system;energy saving and load balancing for SDN based on multi-objective particle swarm optimization;pre-stack kirchhoff time migration on hadoop and spark;a cyber physical system with GPU for CNC applications;a solution of the controller placement problem in software defined networks;parallel column subset selection of kernel matrix for scaling up support vector machines;real-time deconvolution with GPU and spark for big imaging data analysis and parallel kirchhoff pre-stack depth migration on large high performance clusters.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Scheduling Stochastic Tasks with Precedence Constrain on Cluster Systems with Heterogenous Communication Architecture 15th

Scheduling Stochastic Tasks with Precedence Constrain on Clu...

引用

15th international conference on algorithms and architectures for parallel processing (ica3pp)

作者： Liao, Qun Jiang, Shuangshuang Hei, Qiaoxiang Li, Tao Yang, Yulu Nankai Univ Coll Comp & Control Engn Tianjin 300353 Peoples R China

ISBN: (纸本)9783319271613;9783319271606

Scheduling precedence constrained stochastic tasks on heterogenous cluster systems is an important issue which impact the performance of clusters significantly. Different with deterministic tasks, stochastic task model assumes that the workload of task and quantity of data transmission between tasks are stochastic variables, which is more realistic than other task models. Scheduling model and algorithms of precedence constrained stochastic tasks attract a large number of researchers' attention recently. An algorithm SDLS (Stochastic Dynamic Level Scheduling) has been proved performing well in scheduling stochastic tasks on heterogenous clusters. However, the assumption about communication time between tasks in SDLS is much simpler than its assumptions about task computing time, which makes it cannot depict the communication cost among heterogenous links well. In this paper, it is assumed that the quantity of data communication between tasks is a stochastic variable of normal distribution, instead of assuming communication time among heterogenous links a same stochastic variable immediately. Moreover, a modified scheduling model and algorithm SDLS-HC (Stochastic Dynamic Level Scheduling on Heterogenous Communication links) are proposed. Work in this paper focus on considering much more detailed communication cost in task scheduling based on SDLS. Evaluation on many random generated tasks experiments demonstrates that SDLS-HC achieves better performance than SDLS on cluster systems with heterogenous links.

关键词： Stochastic tasks scheduling Directed acyclic graph Heterogenous clusters parallel and distributed processing Distributed system

来源：评论

学校读者我要写书评

暂无评论

Task parallel implementation of matrix multiplication on multi-socket multi-core architectures 15th

Task parallel implementation of matrix multiplication on mul...

引用

15th international conference on algorithms and architectures for parallel processing, ica3pp 2015

作者： Wang, Yizhuo Ji, Weixing Chen, Xu Hu, Sensen School of Computer Science and Technology Beijing Institute of Technology Beijing100081 China

ISBN: (纸本)9783319271361

Matrix multiplication is a very important computation kernel in many science and engineering applications. this paper presents a parallel implementation framework for dense matrix multiplication on multi-socket multi-core architectures. Our framework first partitions the computation between the multi-core processors. then a hybrid matrix multiplication algorithm is used on each processor, which combines the Winograd algorithm and the classical algorithm. In addition, a hierarchical work-stealing scheme is applied to achieve dynamic load balancing and enforce data locality in our framework. Performance experiments on two platforms show that our implementation gets significant performance gains compared with the state-of-the-art implementations. © Springer international Publishing Switzerland 2015.

关键词： Matrix algebra

来源：评论

学校读者我要写书评

暂无评论

iPLAR: Towards interactive programming with parallel linear algebra in R 15th

iPLAR: Towards interactive programming with parallel linear ...

引用

15th international conference on algorithms and architectures for parallel processing, ica3pp 2015

作者： Wang, Zhaokang Fan, Shiqing Gu, Rong Yuan, Chunfeng Huang, Yihua National Key Laboratory for Novel Software Technology Collaborative Innovation Center of Novel Software Technology and Industrialization Nanjing University Nanjing210023 China

ISBN: (纸本)9783319271392

R is a widely-used statistical programming language in the data science community. However, in the big data era, R faces the challenges from large scale data analysis tasks. It lacks the ability of distributed linear algebra computation in its local interactive shell. In this paper, we propose iPLAR, a system that runs in the interactive R environment, wraps the high performance parallel linear algebra library, and provides a group of easy-to-use interfaces. iPLAR adopts the client-server model to uncouple the interactive shell from the ScaLAPACK/MPI distributed computing backend. In addition, it provides R users with a group of parallel-detail-transparent interfaces that are similar to the native R linear algebra interfaces. We evaluate the efficiency of iPLAR with representative basic matrix operations and two widely-used machine learning algorithms. Experimental results show that iPLAR achieves the near- linear data scalability and enhances the interactive processing capability of R to large problem scales. © 2015 Springer international Publishing Switzerland.

关键词： Big data

来源：评论

学校读者我要写书评

暂无评论

Beyond Data parallelism: Identifying parallel tasks in sequential programs 15th

Beyond Data Parallelism: Identifying parallel tasks in seque...

引用

15th international conference on algorithms and architectures for parallel processing, ica3pp 2015

作者： Li, Zhen Zhao, Bo Jannesari, Ali Wolf, Felix Technische Universität Darmstadt Darmstadt64289 Germany Xi’an Jiaotong University Xi’an710049 China

ISBN: (纸本)9783319271392

Today, millions of legacy programs are awaiting their parallelization. For this reason, the automatic discovery of parallelism in sequential programs is now receiving considerable attention. However, past efforts mainly concentrated on data parallelism hidden inside loops. As programming models begin to support more irregular types of parallelism, centered around the notion of tasks in various forms, methods are needed to identify code sections that could potentially represent parallel tasks. In this paper, we present a novel approach to automatically finding parallel tasks in sequential programs. We first created a dynamic dependence graph, then isolated tasks, and finally produced a task graph according to the dependences we find. With the help of a source-to-source code translator, parallel code is automatically generated. We conducted a range of experiments to cover both tasks executing the same code and tasks executing different code. Results showed that our method achieved reasonable speedups on the test cases. © Springer international Publishing Switzerland 2015.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

Performance characterization and optimization for intel xeon phi coprocessor 1

引用

15th international conference on algorithms and architectures for parallel processing, ica3pp 2015

作者： Zhang, Cheng Liu, Li Li, Ruizhe Yang, Guangwen Department of Computer Science and Technology Tsinghua University Beijing100084 China Center for Earth System Science Tsinghua University Beijing100084 China

ISBN: (数字)9783319271194

ISBN: (纸本)9783319271187

the Intel Xeon Phi is a many-core accelerator which focuses on the high performance applications. To characterize the performance of the Intel Xeon Phi, a system of dual 8-core Intel Xeon E5-2670 processors is employed as a control platform, and a subset of the PARSEC benchmark suite is selected as the benchmark applications. the first evaluation in this paper shows that the applications on the Intel Xeon Phi is averagely 2. 06x slower than on the dual Intel Xeon E5-2670. the further detailed performance characterization quantifies the performance impact of various architecture parameters on the Intel Xeon Phi. To set an example for how to improve the architecture of the Intel Xeon Phi for better performance, the hardware optimization with an additional set of vector processing units is discussed and a simple emulator is developed accordingly. the evaluation results show that this optimization can provide an average speedup of 1. 10. © Springer international Publishing Switzerland 2015.

关键词： parallel architectures

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共4页 << < 1 2 3 4 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：