检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

2,691 篇 会议
58 册 图书
54 篇 期刊文献

馆藏范围

2,803 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

1,845 篇 工学
- 1,629 篇 计算机科学与技术...
- 847 篇 软件工程
- 340 篇 电气工程
- 222 篇 电子科学与技术（可...
- 209 篇 信息与通信工程
- 84 篇 控制科学与工程
- 63 篇 光学工程
- 57 篇 机械工程
- 41 篇 仪器科学与技术
- 39 篇 生物医学工程（可授...
- 38 篇 生物工程
- 31 篇 材料科学与工程（可...
- 25 篇 动力工程及工程热...
- 21 篇 化学工程与技术
- 20 篇 建筑学
- 15 篇 土木工程
- 13 篇 力学（可授工学、理...
- 12 篇 交通运输工程
500 篇 理学
- 343 篇 数学
- 113 篇 物理学
- 51 篇 系统科学
- 48 篇 生物学
- 30 篇 统计学（可授理学、...
- 26 篇 化学
173 篇 管理学
- 119 篇 管理科学与工程(可...
- 62 篇 图书情报与档案管...
- 49 篇 工商管理
40 篇 医学
- 30 篇 临床医学
- 14 篇 基础医学(可授医学...
15 篇 法学
- 15 篇 社会学
9 篇 经济学
9 篇 农学
8 篇 文学
2 篇 军事学
1 篇 教育学

主题

360 篇 parallel process...
219 篇 computer archite...
205 篇 graphics process...
146 篇 parallel archite...
135 篇 graphics process...
129 篇 hardware
116 篇 parallel algorit...
112 篇 image processing
98 篇 computational mo...
94 篇 concurrent compu...
87 篇 instruction sets
86 篇 field programmab...
83 篇 algorithm design...
79 篇 multicore proces...
77 篇 signal processin...
76 篇 parallel process...
66 篇 parallel program...
60 篇 gpu
59 篇 throughput
59 篇 kernel

机构

11 篇 natl univ def te...
6 篇 college of compu...
6 篇 school of comput...
6 篇 hosei univ dept ...
6 篇 natl univ def te...
5 篇 univ aizu dept c...
5 篇 carleton univ sc...
5 篇 school of comput...
5 篇 computer science...
5 篇 inria rennes
5 篇 city university ...
4 篇 chinese acad sci...
4 篇 univ michigan ad...
4 篇 institute of com...
4 篇 univ chinese aca...
4 篇 school of comput...
4 篇 univ jaume 1 dep...
4 篇 hainan internati...
4 篇 tech univ cluj n...
4 篇 department of co...

作者

11 篇 jack dongarra
10 篇 roman wyrzykowsk...
9 篇 konrad karczewsk...
9 篇 quintana-orti en...
7 篇 dongarra jack
7 篇 kothapalli kisho...
6 篇 hannig frank
6 篇 liu jie
6 篇 su jinshu
6 篇 nakano koji
6 篇 peng shietung
6 篇 li yamin
6 篇 chu wanming
6 篇 wyrzykowski roma...
6 篇 thulasiraman par...
5 篇 ito yasuaki
5 篇 jerzy waśniewski
5 篇 wang guojun
5 篇 geyong min
5 篇 wanlei zhou

语言

2,757 篇 英文
21 篇 其他
15 篇 中文
11 篇 俄文
2 篇 乌克兰文
1 篇 西班牙文

检索条件"任意字段=10th International Conference on Algorithms and Architectures for Parallel Processing"

共 2803 条记录，以下是1121-1130 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Performance and Energy Analysis of the Iterative Solution of Sparse Linear Systems on Multicore and Manycore architectures

Performance and Energy Analysis of the Iterative Solution of...

引用

10th international conference on parallel processing and Applied Mathematics (PPAM)

作者： Aliaga, Jose I. Anzt, Hartwig Castillo, Maribel Fernandez, Juan C. Leon, German Perez, Joaquin Quintana-Orti, Enrique S. Univ Tennessee ICL Knoxville TN 37996 USA Univ Jaume 1 Dept Ingn & Ciencia Comp Castellon de La Plana 12071 Spain

ISBN: (纸本)9783642552243

In this paper we investigate the performance-energy balance of a variety of concurrent architectures, from general-purpose and digital signal multicore systems to graphics processors (GPUs), representative of current technology. this analysis employs the conjugate gradient method, an important algorithm for the iterative solution of linear systems that is basically composed of the sparse matrix-vector product and other (minor) vector kernels. To allow a fair comparison, we leverage simple implementations of the numerical methods and underlying kernels, and rely only on those optimizations applied by the target compiler.

关键词： Energy efficiency High-performance computing Sparse linear algebra Multicore processors Low-power processors GPUs

来源：评论

学校读者我要写书评

暂无评论

Parampl: A Simple Approach for parallel Execution of AMPL Programs 1

引用

10th international conference on parallel processing and Applied Mathematics (PPAM)

作者： Olszak, Artur Karbowski, Andrzej Warsaw Univ Technol Inst Comp Sci Warsaw Poland

ISBN: (数字)9783642551956

ISBN: (纸本)9783642551956

Due to the physical processor frequency scaling constraint, current computer systems are equipped with more and more processing units. therefore, parallel computing has become an important paradigm in the recent years. AMPL is a comprehensive algebraic modeling language for formulating optimization problems. However, AMPL itself does not support defining tasks to be executed in parallel. Although in last years the parallelism is often provided by solvers, which take advantage of multiple processing units, in many cases it is more efficient to formulate the problem in a decomposed way and apply various problem specific enhancements. Moreover, when the number of cores is permanently growing, it is possible to use both types of parallelism. this paper presents the design of Parampl - a simple tool for parallel execution of AMPL programs. Parampl introduces explicit asynchronous execution of AMPL subproblems from within the program code. Such an extension implies a new view on AMPL programs, where a programmer is able to define complex, parallelized optimization tasks and formulate algorithms solving optimization subproblems in parallel.

关键词： AMPL parallel Optimization Modeling languages

来源：评论

学校读者我要写书评

暂无评论

thread Mapping and parallel Optimization for MIC Heterogeneous parallel Systems

Thread Mapping and Parallel Optimization for MIC Heterogeneo...

引用

14th international conference on algorithms and architectures for parallel processing (ICA3PP)

作者： Ju, Tao Zhu, Zhengdong Wang, Yinfeng Li, Liang Dong, Xiaoshe Xi An Jiao Tong Univ Sch Elect & Informat Engn Xian 710049 Peoples R China Shenzhen Inst Informat Technol Shenzhen 518172 Peoples R China

ISBN: (纸本)9783319111940;9783319111933

there is no dedicated thread mapping method for Many Integrated Core (MIC) heterogeneous system in the traditional multithread programming model. the unreasonable thread mapping will lead the promising computing power of MIC coprocessor not to be fully exploited. In order to fully exploit the computing potential of MIC coprocessor, this paper discussed effective multi threads mapping strategies through comparing the computing performance and analyzing the performance differences between various mapping methods. Meanwhile, for the further exploiting the high computing power of MIC heterogeneous system, the specific program porting and performance optimization strategies were explored by using the k-means application program. Experimental results show that the proposed mapping and parallel optimization strategies are effective, which can be guide the programmer to port and optimize applications effectively to MIC heterogeneous parallel system.

关键词： Computing power

来源：评论

学校读者我要写书评

暂无评论

A massively parallel processing for the Multiple Linear Regression 10

A massively parallel processing for the Multiple Linear Regr...

引用

international conference on Signal-Image Technology and Internet-Based Systems SITIS

作者： Adjout, Moufida Rehab Boufares, Faouzi Univ Paris 13 Lab LIPN UMR 7030 CNRS Av JB Clement F-93430 Villetaneuse France

ISBN: (纸本)9781479979783

the amount of data generated by traditional business activities, has resulted data warehouses with a size up to petabytes. the ability to analyze this torrent of data will become the basis of competition and growth for individual firms by ever-narrower segmentation of customers, improvement of decision-making and unearth valuable insights that would otherwise remain hidden. For this purpose, the large size of data to be processed requires the use of high-performance analytical systems running on distributed environments. Because the data is so big it affects the types of algorithms we are willing to consider. then standard analytics algorithms need to be adapted to take advantage of cloud computing models which provide scalability and flexibility. this work illustrates an implementation of a parallel version of the multiple linear regression. It can extract coefficients from large amounts of data, based on MapReduce Framework with large scale. parallel processing of multiple linear regression will be based on the QR decomposition and the ordinary least squares method adapted to Map Reduce. Our platform in deployed on Cloud Amazon EMR. Experimental results demonstrate that the our parallel version of the multiple linear regression can efficiently handle very large datasets on commodity hardware with a good performance on different evaluation criterions, including number, size and structure of machines in the cluster.

关键词： Data mining Predictive analysis Multiple linear regression Big Data Hadoop MapReduce Cloud Computing

来源：评论

学校读者我要写书评

暂无评论

SignalPU: A programming model for DSP applications on parallel and heterogeneous clusters 16

SignalPU: A programming model for DSP applications on parall...

引用

16th IEEE Int Conf on High Performance Computing and Communications/11th IEEE Int Conf on Embedded Software and Systems\6th Int Symposium on Cyberspace Safety and Security

作者： Mansouri, Farouk Huet, Sylvain Houzet, Dominique Univ Stendhal CNRS INPG UJF GIPSA Lab UMR 5216 F-38402 Grenoble France

ISBN: (纸本)9781479961238

the biomedical imagery, the numeric communications, the acoustic signal processing and many others digital signal processing (DSP) applications are present more and more in the numeric world. they process growing data volume which is represented with more and more accuracy, and use complex algorithms with time constraints to satisfying. Consequently, a high requirement of computing power characterize them. To satisfy this need, it's inevitable today to use parallel and heterogeneous architectures in order to speedup the processing, where the best examples are today's supercomputers like "Tianhe-2" and "Titan" of Top500 ranking. these architectures with their multi-core nodes supported by many-core accelerators offer a good response to this problem. However, they are still hard to program to make performance because of many reasons: parallelism expression, task synchronization, memory management, hardware specifications handling, load balancing ... In the present work, we are characterizing DSP applications and propose a programming model based on their distinctiveness in order to implement them easily and efficiently on heterogeneous clusters.

关键词： Digital signal processing data flow graph graphic processing unit model of programming parallel and heterogenous programming Digital signal processing data flow graphs Demonstration programmes parallel Lines Graphics processing Unit Desmoplakin Gene Programming computational power Time constraints

来源：评论

学校读者我要写书评

暂无评论

Using GPUs for parallel Stencil Computations in Relativistic Hydrodynamic Simulation

Using GPUs for Parallel Stencil Computations in Relativistic...

引用

10th international conference on parallel processing and Applied Mathematics (PPAM)

作者： Cygert, Sebastian Kikola, Daniel Porter-Sobieraj, Joanna Sikorski, Jan Slodkowski, Marcin Warsaw Univ Technol Fac Math & Informat Sci Koszykowa 75 PL-00662 Warsaw Poland Purdue Univ Dept Phys W Lafayette IN 47907 USA Warsaw Univ Technol Fac Phys PL-00662 Warsaw Poland

ISBN: (纸本)9783642552243

this paper explores the possibilities of using a GPU for complex 3D finite difference computation. We propose a new approach to this topic using surface memory and compare it with 3D stencil computations carried out via shared memory, which is currently considered to be the best approach. the case study was performed for the extensive computation of collisions between heavy nuclei in terms of relativistic hydrodynamics.

关键词： Finite difference Riemann solver MUSTA-FORCE algorithm parallel algorithms CUDA

来源：评论

学校读者我要写书评

暂无评论

Porting the Princeton Ocean Model to GPUs 1

引用

14th international conference on algorithms and architectures for parallel processing (ICA3PP)

作者： Xu, Shizhen Huang, Xiaomeng Zhang, Yan Hu, Yong Fu, Haohuan Yang, Guangwen Tsinghua Univ Minist Educ Key Lab Earth Syst Modeling Beijing 100084 Peoples R China

ISBN: (数字)9783319111971

ISBN: (纸本)9783319111971;9783319111964

While GPU is becoming a compelling acceleration solution for a series of scientific applications, most existing work on climate models only achieved limited speedup. It is due to partial porting of the huge code and the memory bound inherence of these models. In this work, we design and implement a customized GPU-based acceleration of the Princeton Ocean Model (gpuPOM). Based on Nvidia's state-of-the-art GPU architectures (K20X and K40m), we rewrite the original model from the Fortran into the CUDA-C completely. Several accelerating methods, including optimizing memory access in a single GPU, overlapping communication and boundary operations among multiple GPUs, are presented. the experimental results show that the gpuPOM on one K40m GPU achieves 6.9-fold to 17.8-fold speedup and 5.8-fold to 15.5-fold speedup on one K20X GPU comparing with different Intel CPUs. Further experiments on multiple GPUs indicate that the performance of the gpuPOM on a super-workstation containing 4 GPUs is equivalent to a powerful cluster consisting of 34 pure CPU nodes with over 400 CPU cores.

关键词： Graphics processing unit

来源：评论

学校读者我要写书评

暂无评论

FooPar: A Functional Object Oriented parallel Framework in Scala 1

引用

10th international conference on parallel processing and Applied Mathematics (PPAM)

作者： Hargreaves, Felix Palludan Merkle, Daniel Univ Southern Denmark Dept Math & Comp Sci Odense Denmark

ISBN: (数字)9783642551956

ISBN: (纸本)9783642551956

We present FooPar, an extension for highly efficient parallel Computing in the multi-paradigm programming language Scala. Scala offers concise and clean syntax and integrates functional programming features. Our framework FooPar combines these features with parallel computing techniques. FooPar is designed to be modular and supports easy access to different communication backends for distributed memory architectures as well as high performance math libraries. In this article we use it to parallelize matrix-matrix multiplication and show its scalability by a isoefficiency analysis. In addition, results based on a empirical analysis on two supercomputers are given. We achieve close-to-optimal performance wrt. theoretical peak performance. Based on this result we conclude that FooPar allows programmers to fully access Scalas design features without suffering from performance drops when compared to implementations purely based on C and MPI.

关键词： Functional programming Isoefficiency Matrix multiplication

来源：评论

学校读者我要写书评

暂无评论

Runtime prediction on new architectures 14

Runtime prediction on new architectures

引用

10th Central and Eastern European Software Engineering conference in Russia, CEE-SECR 2014

作者： Sidnev, Aleksey A. Lobachevsky State University of Nizhni Novgorod Russia

ISBN: (纸本)9781450328890

this paper formulates the program runtime prediction problem subject to algorithm parameters and characteristics of a computational system to be used to run the algorithm. It is suggested to build a model representing runtime as a function of algorithm parameters and computational system characteristics. this is followed by determination of features to be used for functional dependence recovery. A two-step method of problem solution using linear and non-linear machine learning algorithms is proposed. the paper examines peculiarities of software algorithms and suggests a method for processing experimental data provided by computational systems. It also features a comparative analysis of runtime prediction results for solution of several linear algebra problems on 84 personal computers and servers using a number of machine learning algorithms. Use of a random forest combined with the linear least square method shows an error of less than 15% for most computational systems of similar architecture. Copyright 2014 ACM.

关键词： Regression analysis

来源：评论

学校读者我要写书评

暂无评论

An Algorithm to Embed a Family of Node-Disjoint 3D Meshes into Locally Twisted Cubes 1

引用

14th international conference on algorithms and architectures for parallel processing (ICA3PP)

作者： You, Lantao Han, Yuejuan Soochow Univ Suzhou Ind Pk Inst Serv Outsourcing Suzhou 215000 Peoples R China Soochow Univ Ctr Informat Dev & Management Suzhou 215000 Peoples R China

ISBN: (数字)9783319111940

ISBN: (纸本)9783319111940;9783319111933

In this paper, embeddings of a family of 3D meshes in locally twisted cubes are studied. Let LTQ(n)(V, E) denotes the n-dimensional locally twisted cube. We find two major results in this paper:(1) For any integer n >= 4, two node-disjoint 3D meshes of size 2 x 2 x 2(n-3) can be embedded into LTQ(n) with dilation 1 and expansion 2. (2) For any integer n = 6, four node-disjoint 4x2x2(n-5) meshes can be embedded into LTQ(n) with dilation 1 and expansion 4. Further, an embedding algorithm can be constructed based on our embedding method. the obtained results are optimal in the sense that the dilations of the embeddings are 1.

关键词： Interconnection networks locally twisted cube 3D mesh embedding parallel computing

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共281页 << < 109 110 111 112 113 114 115 116 117 118 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：