检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

2,698 篇 会议
58 册 图书
55 篇 期刊文献

馆藏范围

2,811 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

1,851 篇 工学
- 1,634 篇 计算机科学与技术...
- 848 篇 软件工程
- 341 篇 电气工程
- 223 篇 电子科学与技术（可...
- 214 篇 信息与通信工程
- 87 篇 控制科学与工程
- 63 篇 光学工程
- 58 篇 机械工程
- 42 篇 仪器科学与技术
- 39 篇 生物医学工程（可授...
- 38 篇 生物工程
- 31 篇 材料科学与工程（可...
- 25 篇 动力工程及工程热...
- 21 篇 化学工程与技术
- 20 篇 建筑学
- 15 篇 土木工程
- 13 篇 力学（可授工学、理...
- 12 篇 交通运输工程
502 篇 理学
- 344 篇 数学
- 113 篇 物理学
- 51 篇 系统科学
- 48 篇 生物学
- 30 篇 统计学（可授理学、...
- 27 篇 化学
176 篇 管理学
- 121 篇 管理科学与工程(可...
- 63 篇 图书情报与档案管...
- 49 篇 工商管理
41 篇 医学
- 30 篇 临床医学
- 14 篇 基础医学(可授医学...
15 篇 法学
- 15 篇 社会学
9 篇 经济学
9 篇 农学
8 篇 文学
2 篇 军事学
1 篇 教育学

主题

363 篇 parallel process...
219 篇 computer archite...
205 篇 graphics process...
147 篇 parallel archite...
135 篇 graphics process...
129 篇 hardware
116 篇 parallel algorit...
112 篇 image processing
98 篇 computational mo...
94 篇 concurrent compu...
87 篇 instruction sets
86 篇 field programmab...
83 篇 algorithm design...
79 篇 multicore proces...
77 篇 signal processin...
76 篇 parallel process...
66 篇 parallel program...
60 篇 throughput
60 篇 gpu
59 篇 kernel

机构

11 篇 natl univ def te...
6 篇 college of compu...
6 篇 school of comput...
6 篇 hosei univ dept ...
6 篇 natl univ def te...
5 篇 univ aizu dept c...
5 篇 carleton univ sc...
5 篇 school of comput...
5 篇 computer science...
5 篇 inria rennes
5 篇 city university ...
4 篇 chinese acad sci...
4 篇 univ michigan ad...
4 篇 institute of com...
4 篇 univ chinese aca...
4 篇 school of comput...
4 篇 univ jaume 1 dep...
4 篇 hainan internati...
4 篇 tech univ cluj n...
4 篇 department of co...

作者

11 篇 jack dongarra
10 篇 roman wyrzykowsk...
9 篇 konrad karczewsk...
9 篇 quintana-orti en...
7 篇 dongarra jack
7 篇 kothapalli kisho...
6 篇 hannig frank
6 篇 liu jie
6 篇 su jinshu
6 篇 nakano koji
6 篇 peng shietung
6 篇 li yamin
6 篇 chu wanming
6 篇 wyrzykowski roma...
6 篇 thulasiraman par...
5 篇 ito yasuaki
5 篇 jerzy waśniewski
5 篇 wang guojun
5 篇 geyong min
5 篇 wanlei zhou

语言

2,758 篇 英文
28 篇 其他
18 篇 中文
11 篇 俄文
2 篇 乌克兰文
1 篇 西班牙文

检索条件"任意字段=10th International Conference on Algorithms and Architectures for Parallel Processing"

共 2811 条记录，以下是551-560 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

A parallel Branch and Bound Algorithm for the Probabilistic TSP 18th

A Parallel Branch and Bound Algorithm for the Probabilistic ...

引用

18th international conference on algorithms and architectures for parallel processing (ICA3PP)

作者： Amar, Mohamed Abdellahi Khaznaji, Walid Bellalouna, Monia Univ Manouba Natl Sch Comp Sci CRISTAL Lab POLE GRIFT Tunis Tunisia Tunisia SESAME Univ Ariana Tunisia

ISBN: (纸本)9783030050511;9783030050504

the paper presents parallelization of exact algorithm of resolution for the Probabilistic Traveling Salesman Problem (PTSP). this algorithm allows us, first, to verify the stability of well-solvable special cases and also to optimally solve useful instances of PTSP. It again allows to perform our version of Karp partitioning algorithm, where real problems are very large-sized. the implementation of the algorithm of Karp consists in subdividing the square plan, into sub-plans. So we transform the resolution of a large size problem to the resolution of many small size sub-problems which can be exactly solved. this application can be gridified and these different sub-problems would be processed in parallel by different nodes since they are totally independent. In each sub-plan the Branch and Bound algorithm is used. In this paper we propose two parallelizations of the Branch and Bound algorithm for the resolution of the PTSP. On the one hand, the parallelization of the branches used in the exploration of the tree, on the other hand the parallelization of the algorithm associated with the notion of partitioning introduced by Karp. We perform an experimental study conducted in a multi-core environment to evaluate the performance of the proposed approach.

关键词： PTSP parallel algorithm Open MP Simulations

来源：评论

学校读者我要写书评

暂无评论

Comparative Study of Distributed Deep Learning Tools on Supercomputers 18th

Comparative Study of Distributed Deep Learning Tools on Supe...

引用

18th international conference on algorithms and architectures for parallel processing (ICA3PP)

作者： Du, Xin Kuang, Di Ye, Yan Li, Xinxin Chen, Mengqiang Du, Yunfei Wu, Weigang Sun Yat Sen Univ Sch Data & Comp Sci Guangzhou Peoples R China Guangdong Prov Key Lab Big Data Anal & Proc Guangzhou Peoples R China Minist Educ Key Lab Machine Intelligence & Adv Comp Guangzhou Peoples R China

ISBN: (纸本)9783030050511;9783030050504

With the growth of the scale of data set and neural networks, the training time is increasing rapidly. Distributed parallel training has been proposed to accelerate deep neural network training, and most efforts are made on top of GPU clusters. this paper focuses on the performance of distributed parallel training in CPU clusters of supercomputer systems. Using resources at the supercomputer system of "Tianhe-2", we conduct extensive evaluation of the performance of popular deep learning tools, including Caffe, TensorFlow, and BigDL, and several deep neural network models are tested, including AutoEncoder, LeNet, AlexNet and ResNet. the experiment results show that Caffe performs the best in communication efficiency and scalability. BigDL is the fastest in computing speed benefiting from its optimization for CPU, but it suffers from long communication delay due to the dependency on MapReduce framework. the insights and conclusions from our evaluation provides significant reference for improving resource utility of supercomputer resources in distributed deep learning.

关键词： Distributed deep learning Tianhe-2 Speedup Performance evaluation parallel processing

来源：评论

学校读者我要写书评

暂无评论

the parallel Speedup Improving Based on Wave field forward Communication Latency Hiding Technology 10

The Parallel Speedup Improving Based on Wave field forward C...

引用

10th international conference on Measuring Technology and Mechatronics Automation (ICMTMA)

作者： Xie, Lei Zhao, Changming Wu, Xi Wu, Tao Huang, Min Xiao, Dan Yao, Mingqing Xia, Chaoyang Chengdu Univ Informat Technol Sch Comp Sci Chengdu 610000 Sichuan Peoples R China

ISBN: (纸本)9781538651148

the Finite Difference Method (FDM) is the most common parallel computing method for high performance numerical simulation . the method is able to solve and obtain physical field parameters efficiently when grid strengthening treatment is dense enough. However, FDM processing in parallel acceleration case is necessary to divide all raw data into several parts, which is corresponding to the number of computing nodes amount. It may cause that the edge raw data of each computing node are essential to communicate with the other edge data of the surrounding six adjacent computing nodes. Once the system communication time step is short enough and the computing node is of high computing performance, the communication latency may turn to the primary reason for inefficient parallel. In this paper, it provides a scheme for communication latency hiding that part edge data of each node is able to running in communication and computing simultaneously. the experimental results show that the proposed method has the ability to improve parallel efficiency and reduce the running time definitely.

关键词： Finite Difference Method parallel computing Communication latency Numerical Simulation

来源：评论

学校读者我要写书评

暂无评论

A parallel Fast Fourier Transform Algorithm for Large-Scale Signal Data Using Apache Spark in Cloud 1

引用

18th international conference on algorithms and architectures for parallel processing (ICA3PP)

作者： Yang, Cheng Bao, Weidong Zhu, Xiaomin Wang, Ji Xiao, Wenhua Natl Univ Def Technol Changsha Peoples R China State Key Lab High Performance Comp Changsha Peoples R China Acad Mil Med Sci Beijing Peoples R China

ISBN: (数字)9783030050573

ISBN: (纸本)9783030050573;9783030050566

In the field of signal process, Fast Fourier Transform (FFT) is a widely used algorithm to transform signal data from time to frequency. Unfortunately, with the exponential growth of data, traditional methods cannot meet the demand of large-scale computation on these big data because of three main challenges of large-scale FFT, i.e., big data size, real-time data processing and high utilization of compute resources. To satisfy these requirements, an optimized FFT algorithm in Cloud is deadly needed. In this paper, we introduce a new method to conduct FFT in Cloud with the following contributions: first, we design a parallel FFT algorithm for large-scaled signal data in Cloud;second, we propose a MapReduce-based mechanism to distribute data to compute nodes using big data processing framework;third, an optimal method of distributing compute resources is implemented to accelerate the algorithm by avoiding redundant data exchange between compute nodes. the algorithm is designed in MapReduce computation framework which contains three steps: data preprocessing, local data transform and parallel data transform to integrate processing results. the parallel FFT is implemented in a 16-node Cloud to process real signal data the experimental results reveal an obvious improvement in the algorithm speed. Our parallel FFT is approximately five times faster than FFT in Matlab in when the data size reaches 10 GB.

关键词： Fast fourier transform Cloud computing Apache spark parallel algorithm

来源：评论

学校读者我要写书评

暂无评论

Neuro-Spectral Audio Synthesis: Exploiting Characteristics of the Discrete Fourier Transform in the Real-Time Simulation of Musical Instruments Using parallel Neural Networks 28th

Neuro-Spectral Audio Synthesis: Exploiting Characteristics o...

引用

28th international conference on Artificial Neural Networks (ICANN)

作者： Tarjano, Carlos Pereira, Valdecy Univ Fed Fluminense Niteroi RJ Brazil

ISBN: (纸本)9783030304904;9783030304898

Two main approaches are currently prevalent in the digital emulation of musical instruments: manipulation of pre-recorded samples and techniques of real-time synthesis, generally based on physical models with varying degrees of accuracy. Concerning the first, while the processing power of present-day computers enables their use in real-time, many restrictions arising from this sample-based design persist;the huge on disk space requirements and the stiffness of musical articulations being the most prominent. On the other side of the spectrum, pure synthesis approaches, while offering greater flexibility, fail to capture and reproduce certain nuances central to the verisimilitude of the generated sound, offering a dry, synthetic output, at a high computational cost. We propose a method where ensembles of lightweight neural networks working in parallel are learned, from crafted frequency-domain features of an instrument sound spectra, an arbitrary instrument's voice and articulations realistically and efficiently. We find that our method, while retaining perceptual sound quality on par with sampled approaches, exhibits 1/10 of latency times of industry standard real-time synthesis algorithms, and 1/100 of the disk space requirements of industry standard sample-based digital musical instruments. this method can, therefore, serve as a basis for more efficient implementations in dedicated devices, such as keyboards and electronic drumkits and in general purpose platforms, like desktops and tablets or open-source hardware like Arduino and Raspberry Pi. From a conceptual point of view, this work highlights the advantages of a closer integration of machine learning with other subjects, especially in the endeavor of new product development. Exploiting the synergy between neural networks, digital signal processing techniques and physical modelling, we illustrate the proposed method via the implementation of two virtual instruments: a conventional grand piano and a hibrid strin

关键词： Neural networks Acoustic modeling Digital musical instruments Real-time audio synthesis

来源：评论

学校读者我要写书评

暂无评论

Efficient Belief Propagation List Decoding of Polar Codes

Efficient Belief Propagation List Decoding of Polar Codes

引用

international conference on ASIC

作者： Yuqing Ren Weihong Xu Zaichen Zhang Xiaohu You Chuan Zhang Lab of Efficient Architectures for Digital-communication and Signal-processing (LEADS) National Mobile Communications Research Laboratory Southeast University Nanjing China Purple Mountain Laboratories Nanjing China

As known to all, polar codes have been chosen as the control channel codes for the enhance mobile broadband (eMBB) scenario in the 3GPP RAN1. Due to its excellent performance, polar codes also have caused widespread concern in the academia and industry. In general, polar codes can be decoded by two methods: successive cancellation (SC) and belief propagation (BP) algorithms. However, compared with the series of SC algorithms, the error-correction performance of BP decoding is not satisfactory, even though it has excellent parallel throughput. Hence, this paper proposes an efficient BP list (EBPL) decoding of polar codes which can enhance the performance to the same scale of successive cancellation list (SCL) without sacrificing decoding throughput. With reasonable complexity, the proposed EBPL decoding can achieve comparable BER and FER compared to SCL in simulation results.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Task-parallel Analysis of Molecular Dynamics Trajectories 18

Task-parallel Analysis of Molecular Dynamics Trajectories

引用

47th international conference on parallel processing (ICPP)

作者： Paraskevakos, Ioannis Luckow, Andre Khoshlessan, Mahzad Chantzialexiou, George Cheatham, thomas E. Beckstein, Oliver Fox, Geoffrey C. Jha, Shantenu Rutgers State Univ Piscataway NJ 08854 USA Ludwig Maximilians Univ Munchen Munich Germany Arizona State Univ Tempe AZ USA Univ Utah Salt Lake City UT USA Indiana Univ Bloomington IN USA Brookhaven Natl Lab Upton NY 11973 USA

ISBN: (纸本)9781450365109

Different parallel frameworks for implementing data analysis applications have been proposed by the HPC and Big Data communities. In this paper, we investigate three task-parallel frameworks: Spark, Dask and RADICAL-Pilot with respect to their ability to support data analytics on HPC resources and compare them to MPI. We investigate the data analysis requirements of Molecular Dynamics (MD) simulations which are significant consumers of supercomputing cycles, producing immense amounts of data. A typical large-scale MD simulation of a physical system of O(100k) atoms over fisecs can produce from O(10) GB to O(1000) GBs of data. We propose and evaluate different approaches for parallelization of a representative set of MD trajectory analysis algorithms, in particular the computation of path similarity and leaflet identification. We evaluate Spark, Dask and RADICAL-Pilot with respect to their abstractions and runtime engine capabilities to support these algorithms. We provide a conceptual basis for comparing and understanding different frameworks that enable users to select the optimal system for each application. We also provide a quantitative performance analysis of the different algorithms across the three frameworks.

关键词： Data analytics MD Simulations Analysis MD analysis task-parallel

来源：评论

学校读者我要写书评

暂无评论

the parallel Algorithm for the 2-D Discrete Wavelet Transform 9

The Parallel Algorithm for the 2-D Discrete Wavelet Transfor...

引用

9th international conference on Graphic and Image processing (ICGIP)

作者： Barina, David Najman, Pavel Kleparnik, Petr Kula, Michal Zemcik, Pavel Brno Univ Technol Fac Informat Technol Ctr Excellence IT4Innovat Bozetechova 1-2 Brno Czech Republic

ISBN: (数字)9781510617421

ISBN: (纸本)9781510617421

the discrete wavelet transform can be found at the heart of many image-processing algorithms. Until now, the transform on general-purpose processors (CPUs) was mostly computed using a separable lifting scheme. As the lifting scheme consists of a small number of operations, it is preferred for processing using single-core CPUs. However, considering a parallel processing using multi-core processors, this scheme is inappropriate due to a large number of steps. On such architectures, the number of steps corresponds to the number of points that represent the exchange of data. Consequently, these points often form a performance bottleneck. Our approach appropriately rearranges calculations inside the transform, and thereby reduces the number of steps. In other words, we propose a new scheme that is friendly to parallel environments. When evaluating on multi-core CPUs, we consistently overcome the original lifting scheme. the evaluation was performed on 61-core Intel Xeon Phi and 8-core Intel Xeon processors.

关键词： Discrete wavelet transform lifting scheme multi-core processors parallel architectures

来源：评论

学校读者我要写书评

暂无评论

Efficient algorithms for Graph Coloring on GPU 24

Efficient Algorithms for Graph Coloring on GPU

引用

24th IEEE international conference on parallel and Distributed Systems (ICPADS)

作者： Nguyen Quang Anh Pham Fan, Rui Nanyang Technol Univ Sch Comp Sci & Engn Singapore Singapore ShanghaiTech Univ Sch Informat Sci & Technol Shanghai Peoples R China

ISBN: (纸本)9781538673089

Graph coloring is an important problem in computer science and engineering with numerous applications. As the size of data increases today, graphs with millions of nodes are becoming commonplace. parallel graph coloring algorithms on high throughput graphics processing units (GPUs) have recently been proposed to color such large graphs efficiently. We present two new graph coloring algorithms for GPUs which improve upon existing algorithms both in coloring speed and quality. the first algorithm, counting-based Jones-Plassmann (CJP), uses counters to implement the classic Jones-Plassmann parallel coloring heuristic in a work-efficient manner. the second algorithm, conflict coloring (CC) achieves higher parallelism than CJP, and is based on optimistically coloring the graph using estimates of the chromatic number. We compared CC and CJP with two state-of-the-art GPU coloring algorithms, csrcolor [1] and Deveci et al's [2] vertex/edge-based algorithms (which we call VEB), as well as the sequential CPU algorithm ColPack [3]. In terms of coloring quality, CJP and CC are both far better than csrcolor, while CJP uses 10% fewer colors than VEB on average and CC uses 10% more. Compared to ColPack, CJP and CC use 1.3 x and 1.5x more colors on nonbipartite graphs, resp. In terms of speed, CJP is on average 1.5 - 2x faster than the other algorithms, while CC is 2.7 - 4.3x faster.

关键词： graph coloring parallel GPU high performance randomized algorithm

来源：评论

学校读者我要写书评

暂无评论

New Multi-objectives Scheduling Strategies in Docker SwarmKit 1

引用

18th international conference on algorithms and architectures for parallel processing (ICA3PP)

作者： Menouer, Tarek Cerin, Christophe Leclercq, Etienne Univ Paris 13 Sorbonne Paris Cite LIPN CNRS UMR 7030 F-93430 Villetaneuse France

ISBN: (数字)9783030050573

ISBN: (纸本)9783030050573;9783030050566

this paper presents new multi-objectives scheduling strategies implemented in Docker SwarmKit. Docker SwarmKit is a container toolkit for orchestrating distributed systems at any scale. Currently, Docker SwarmKit has one scheduling strategy called Spread. Spread is based only on one objective to select from a set of cloud nodes, one node to execute a container. However, the containers submitted by users to be scheduled in Docker SwarmKit are configured according to multi-objectives criteria, as the number of CPUs and the memory size. To better address the multi-objectives configuration problem of containers, we introduce the concept and the implementation of new multi-objectives scheduling strategies adapted for Cloud Computing environments and implemented in Docker SwarmKit. the principle of our multi-objectives strategies consist to select a node which has a good compromise between multi-objectives criteria to execute a container. the proposed scheduling strategies are based on a combinaison of PROMEthEE and Kung multi-objectives decision algorithms in order to place containers. the implementation in Docker SwarmKit and experiments of our new strategies demonstrate the potential of our approach under different scenarios.

关键词： Systems software Scheduling and resource management Container technology Cloud computing Application of parallel and distributed algorithms

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共282页 << < 52 53 54 55 56 57 58 59 60 61 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：