检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

3,174 篇 会议
72 篇 期刊文献
65 册 图书

馆藏范围

3,310 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

2,346 篇 工学
- 2,065 篇 计算机科学与技术...
- 1,041 篇 软件工程
- 415 篇 电气工程
- 329 篇 信息与通信工程
- 311 篇 电子科学与技术（可...
- 114 篇 控制科学与工程
- 69 篇 机械工程
- 67 篇 光学工程
- 67 篇 生物工程
- 62 篇 生物医学工程（可授...
- 36 篇 动力工程及工程热...
- 33 篇 仪器科学与技术
- 32 篇 材料科学与工程（可...
- 32 篇 建筑学
- 30 篇 化学工程与技术
- 24 篇 土木工程
- 21 篇 力学（可授工学、理...
726 篇 理学
- 485 篇 数学
- 174 篇 物理学
- 80 篇 生物学
- 65 篇 系统科学
- 61 篇 统计学（可授理学、...
- 37 篇 化学
249 篇 管理学
- 161 篇 管理科学与工程(可...
- 102 篇 图书情报与档案管...
- 71 篇 工商管理
64 篇 医学
- 53 篇 临床医学
- 21 篇 基础医学(可授医学...
22 篇 法学
- 20 篇 社会学
22 篇 农学
- 19 篇 作物学
16 篇 经济学
12 篇 文学
11 篇 教育学
4 篇 军事学

主题

329 篇 parallel process...
204 篇 computer archite...
203 篇 graphics process...
158 篇 parallel archite...
135 篇 parallel process...
123 篇 parallel algorit...
121 篇 graphics process...
115 篇 hardware
113 篇 image processing
86 篇 concurrent compu...
86 篇 computational mo...
77 篇 signal processin...
72 篇 parallel program...
72 篇 field programmab...
69 篇 multicore proces...
68 篇 instruction sets
67 篇 parallel computi...
65 篇 algorithm design...
58 篇 throughput
57 篇 gpu

机构

9 篇 college of compu...
9 篇 natl univ def te...
8 篇 carleton univ sc...
8 篇 national laborat...
6 篇 hosei univ dept ...
6 篇 inria rennes
6 篇 st francis xavie...
5 篇 chinese acad sci...
5 篇 univ aizu dept c...
5 篇 polish japanese ...
5 篇 computer science...
5 篇 college of compu...
5 篇 city university ...
4 篇 shanghai jiao to...
4 篇 charles univ pra...
4 篇 rwth aachen univ...
4 篇 hainan internati...
4 篇 department of co...
4 篇 university of ch...
4 篇 universidad carl...

作者

11 篇 jack dongarra
10 篇 roman wyrzykowsk...
8 篇 dongarra jack
7 篇 liu jie
7 篇 konrad karczewsk...
7 篇 quintana-orti en...
6 篇 hannig frank
6 篇 li dongsheng
6 篇 teich juergen
6 篇 li chao
6 篇 nakano koji
6 篇 peng shietung
6 篇 li yamin
6 篇 chu wanming
6 篇 krulis martin
5 篇 zhang lei
5 篇 ito yasuaki
5 篇 li kenli
5 篇 wanlei zhou
5 篇 tudruj marek

语言

3,216 篇 英文
83 篇 其他
20 篇 中文

检索条件"任意字段=5th International Conference on Algorithms and Architectures for Parallel Processing"

共 3311 条记录，以下是1331-1340 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Performance analysis for job scheduling in hierarchical HPC systems: A coloured petri nets method 15th

Performance analysis for job scheduling in hierarchical HPC ...

引用

15th international conference on algorithms and architectures for parallel processing, ICA3PP 2015

作者： Li, Zhijia Jiao, Li Hu, Xiang State Key Laboratory of Computer Science Institute of Software Chinese Academy of Sciences Beijing100190 China University of Chinese Academy of Sciences Beijing100049 China

ISBN: (纸本)9783319271392

Distributed computing technology has been widely used to solve complex problems appearing in parallel processing systems. Job scheduling is very important in many distributed computing systems, like grid systems and high performance computers. their performance is directly related to the efficiency of the distributed computing systems. Modeling them and analyzing their performance can provide quantitative performance metrics and predictions, which are helpful to guide capacity planning and scheduling optimization. In this paper, we study job scheduling systems widespread in high performance computing systems and propose a coloured Petri net method for analyzing their performance, which can be easily implemented in CPN software by potential users. We also propose an approximative modeling technique so as to reduce the model size. As a model-based performance analysis method, our method is low cost and highly flexible. Experimental results show that our method is feasible and can be applied to more complex and large-scale systems. © Springer international Publishing Switzerland 2015.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

Improving performance of floating point division on GPU and MIC 1

引用

15th international conference on algorithms and architectures for parallel processing, ICA3PP 2015

作者： Huang, Kun Chen, Yifeng Department of Computer Science School of EECS Peking University Beijing100871 China

ISBN: (数字)9783319271224

ISBN: (纸本)9783319271217

Floating point computing ability is an important concern in high performance scientific application and engineering computing. Although as a fundamental operation, floating point division (or reciprocal) has long been much less efficiency compared with addition and multiplication. architectures like GPU and MIC even have no instruction for such division in the instruction level. this paper proposes a fast approximation algorithm to estimate the division of floating point numbers in IEEE 754 format based on existing instructions which in most cases are accurate enough for practical computing. It consists of a predicting step and an iterating step like most iterative numerical algorithm. the predicting step makes use of the property of IEEE 754 format to calculate estimation by only one integer subtraction instruction. the iterating step improves the accuracy by fast iterations in about ten instructions. this new algorithm is extremely easy to implement and shows a great performance in practical experiments. © Springer international Publishing Switzerland 2015.

关键词： Graphics processing unit

来源：评论

学校读者我要写书评

暂无评论

An Improved Algorithm for Querying Encrypted Data in the Cloud 5

An Improved Algorithm for Querying Encrypted Data in the Clo...

引用

5th international conference on Communication Systems and Network Technologies (CSNT)

作者： Shastri, Samraddhi Kresman, Ray Lee, Jong Kwan Bowling Green State Univ Dept Comp Sci Bowling Green OH 43403 USA

ISBN: (纸本)9781479917976

Organizations have begun outsourcing management of their data to third party cloud service providers after the introduction of Database as a Service (DAS) model. A cloud database is a database that typically runs on a cloud computing platform, such as Amazon EC2, GoGrid, Salesforce and Rackspace. But outsourcing the data raises concerns over privacy. A typical solution is to store databases in encrypted form on the remote server. Queried records are downloaded from the server and decrypted for further processing. Bucketization is one technique for executing queries over encrypted data on a DAS server. this paper is an extension to work done by other researchers [1-4]. Query Optimal Bucketization (QOB) algorithm [1-2] divides the server data into buckets subject to an optimality constraint. In an earlier paper [3], the authors proposed Binary Query Bucketization (BQB) to improve the search time for bucketized datasets and reduce the number of records that are processed by QOB. In this paper, we propose a parallel Binary Query Bucketization (PBQB) algorithm to query records located in the DAS. It integrates parallel search [4] and BQB. parallel search divides the search workload into chunks with each thread/processor working on a chunk. Simulation is used to assess the numerical performance of PBQB. It is shown that the proposed algorithm outperforms BQB.

关键词： parallel Bucketization Encrypted Database Database as a Service

来源：评论

学校读者我要写书评

暂无评论

Optimal Performance Prediction of ADAS algorithms on Embedded parallel architectures

Optimal Performance Prediction of ADAS Algorithms on Embedde...

引用

IEEE international conference on High Performance Computing and Communications (HPCC)

作者： Romain Saussard Boubker Bouzid Marius Vasiliu Roger Reynaud Renault S.A.S Guyancourt France Instutut d'Electronique Fondamentale Université Paris Sud Orsay France

ADAS (Advanced Driver Assistance Systems) algorithms increasingly use heavy image processing operations. To embed this type of algorithms, semiconductor companies offer many heterogeneous architectures. these SoCs (System on Chip) are composed of different processing units, with different capabilities, and often with massively parallel computing unit. Due to the complexity of these SoCs, predicting if a given algorithm can be executed in real time on a given architecture is not trivial. In fact it is not a simple task for automotive industry actors to choose the most suited heterogeneous SoC for a given application. Moreover, embedding complex algorithms on these systems remains a difficult task due to heterogeneity, it is not easy to decide how to allocate parts of a given algorithm on the different computing units of a given SoC. In order to help automotive industry in embedding algorithms on heterogeneous architectures, we propose a novel approach to predict performances of image processing algorithms applicable on different types of computing units. Our methodology is able to predict a more or less wide interval of execution time with a degree of confidence using only high level description of algorithms, and a few characteristics of computing units.

关键词： Kernel Computer architecture Image processing Graphics processing units Prediction algorithms parallel processing Computational modeling

来源：评论

学校读者我要写书评

暂无评论

parallel column subset selection of kernel matrix for scaling up support vector machines 15th

Parallel column subset selection of kernel matrix for scalin...

引用

15th international conference on algorithms and architectures for parallel processing, ICA3PP 2015

作者： Wu, Jiangang Feng, Chang Gao, Peihuan Liao, Shizhong School of Computer Science and Technology Tianjin University Tianjin300072 China

ISBN: (纸本)9783319271361

Nyström method and low-rank linearized Support Vector Machines (SVMs) are two widely used methods for scaling up kernel SVMs, both of which need to sample part of columns of the kernel matrix to reduce the size. However, existing non-uniform sampling methods suffer from at least quadratic time complexity in the number of training data, limiting the scalability of kernel SVMs. In this paper, we pro- pose a parallel sampling method called parallel column subset selection (PCSS) based on the divide-and-conquer strategy, which divides the kernel matrix into several small submatrices and then selects columns in parallel. We prove that PCSS has a (1+ϵ) relative-error upper bound with respect to the kernel matrix. Further, we present two approaches to scaling up kernel SVMs by combining PCSS with Nyström method and lowrank linearized SVMs. the results of comparison experiments demonstrate the effectiveness, efficiency and scalability of our approaches. © Springer international Publishing Switzerland 2015.

关键词： Support vector machines

来源：评论

学校读者我要写书评

暂无评论

A cyber physical system with gpu for CNC applications 15th

A cyber physical system with gpu for CNC applications

引用

15th international conference on algorithms and architectures for parallel processing, ICA3PP 2015

作者： Chang, Jen-Chieh Chien, Ting-Hsuan Chang, Rong-Guey Department of Computer Science and Information Engineering National Chung Cheng University Chiayi62102 Taiwan

ISBN: (纸本)9783319271361

In this paper, we parallelize the collision detection of five- axis machining as an example to show how to execute CNC applications on Graphics processing Unit (GPU). We first design and implement an efficient collision detection tool, including the kinematics analyses for five-axis motions, separating axis method for collision detection, and computer simulation for verification. the machine structure is modeled as STL format in CAD software. the input to the detection system is the g-code part program, which describes the tool motions to produce the part surface. then the g-code will be partitioned and be executed by our collision detection tool in parallel on Graphics processing Unit (GPU). the system simulates the five-axis CNC motion for tool trajectory and detects any collisions according to the input g-codes. the result shows that our method can improve the performance of computational efficiency significantly when comparing to the conventional detection method. © Springer international Publishing Switzerland 2015.

关键词： Graphics processing unit

来源：评论

学校读者我要写书评

暂无评论

Shortest-Path Queries in Planar Graphs on GPU-Accelerated architectures 10th

Shortest-Path Queries in Planar Graphs on GPU-Accelerated Ar...

引用

10th international conference on Large-Scale Scientific Computations (LSSC)

作者： Chapuis, Guillaume Djidjev, Hristo Los Alamos Natl Lab POB 1663 Los Alamos NM 87545 USA

ISBN: (纸本)9783319265209;9783319265193

We develop an efficient parallel algorithm for answering shortest-path queries in planar graphs and implement it on a multi-node CPU-GPU clusters. the algorithm uses a divide-and-conquer approach for decomposing the input graph into small and roughly equal subgraphs and constructs a distributed data structure containing shortest distances within each of those subgraphs and between their boundary vertices. For a planar graph with n vertices, that data structure needs O(n) storage per processor and allows queries to be answered in O(n(1/4)) time.

关键词： Shortest path problems Graph algorithms Distributed computing GPU computing Graph partitioning

来源：评论

学校读者我要写书评

暂无评论

parallel implementation of dense optical flow computation on many-core processor 1

引用

15th international conference on algorithms and architectures for parallel processing, ICA3PP 2015

作者： Chen, Wenjie Yu, Jin Zhang, Weihua Jiang, Linhua Zhang, Guanhua Chai, Zhilei MoE Engineering Research Center for Software/Hardware Co-design Technology and Application East China Normal University Shanghai200061 China School of IoT Engineering Jiangnan University Wuxi214122 China Parallel Processing Institute Fudan University Shanghai200433 China Shanghai Key Lab of Modern Optical Systems University of Shanghai for Science and Technology Shanghai200093 China

ISBN: (数字)9783319271194

ISBN: (纸本)9783319271187

Computation of optical flow is a fundamental step in computer vision applications. However, due to its high complexity, it is difficult to compute a high-accuracy optical flow field in real time. this paper proposes a parallel computing approach for fast computation of high-accuracy optical flow field. It is specially designed for Tilera, a typical many-core processor with 36 tiles. By efficiently exploiting the advantages of the mesh architecture of Tilera, and by appropriately handling the parallelism inherent in the optical flow computation, the proposed implemention is able to significantly reduce the computation time while keep a low power consumption. Experiment shows that, for a 640×480 image, the computation time is only 0. 80 seconds per frame. It is 2. 56 times faster than on a typical CPU i3-3240 (3. 4GHz), and the power consumption as less as 1/6. Experimental results also show that the proposed parallel approach is highly scalable for variable requirements on computation speeds and power consumptions, since it can flexibly selects a proper number of computing cores. © Springer international Publishing Switzerland 2015.

关键词： parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

Low-Area and Low-Power Reconfigurable Architecture for Convolution-Based 1-D DWT using 9/7 and 5/3 Filters 28

Low-Area and Low-Power Reconfigurable Architecture for Convo...

引用

28th international conference on VLSI Design (VLSID) / 14th international conference on Embedded Systems

作者： Meher, Pramod Kumar Mohanty, Basant Kumar Swamy, M. N. S. Nanyang Technol Univ Sch Comp Engn Nanyang Ave Singapore 639798 Singapore Jaypee Univ Engn & Technol Raghogarh Madhy Pradesh India Concordia Univ Dept Elect & Comp Engn Montreal PQ H3G 2W1 Canada

ISBN: (纸本)9781479966585

this paper presents an optimized adder-based formulation for low-area and low-power implementation of 1-D DWT using 5/3 and 9/7 filters. Not only the number of adders is minimized, the number bit-shifts also minimized in the formulation to reduce the bit-width of intermediate results. Separate Adder-based designs are derived using the proposed formulation for 9/7 filter, 5/3 filter and a reconfigurable structure for both 9/7 and 5/3 filters. the proposed structure for 9/7 filter requires 19 adders and 11 hardwired-shifters (shifters are implemented by rewiring only) and computes two DWT components in every clock cycle. It requires only 8 registers for two-stage pipeline implementation. the proposed reconfigurable structure involves a small overhead of complexity in terms of one adder, 2 MUXes, 2 registers, and 4 extra hardwired-shifters than the proposed 9/7 structure to have the reconfigurable design. the proposed reconfigurable structure supports higher usable frequency (without pipelining), and provides double the throughput per clock cycle compared to that of best available similar structure with marginally higher area complexity. ASIC synthesis results show that the proposed pipelined structure for 9/7 filters involves nearly 70% less ADP and 82% less EPO than the best of DA-based structures. Further, it involves less than half the ADP and 47% less EPO than the corresponding recent multiplier-based structure. the proposed reconfigurable structure involves less than one-third the EPO and ADP of similar existing structure. the proposed design indicates the superiority of adder-based design over DA-based design as well as conventional multiplier-based design.

关键词： adders application specific integrated circuits convolution discrete wavelet transforms distributed arithmetic high-pass filters low-pass filters low-power electronics reconfigurable architectures shift registers 5-3 filters 9-7 filters ADP ASIC synthesis DA-based structure design EPO MUX adder-based formulation design optimization bit-shift minimization clock cycle complexity overhead convolution-based 1-D DWT discrete wavelet transform hardwired-shifters high-pass filter higher usable frequency intermediate bit-width reduction low-area reconfigurable structure architecture low-pass filter low-power reconfigurable structure architecture multiplier-based structure design registers two-stage pipeline structure implementation Adders Clocks Discrete wavelet transforms Periodic structures Pipeline processing Registers throughput Discrete Wavelet Transform VLSI discrete wavelet transform adders distributed algorithms Registers Low pass filters clock cycle high-pass filters Pipeline processing application specific integrated circuits Reconfigurable architectures Convolution MULTIPLEXOR Adenosine Diphosphate Automatic data processing European Patent Office

来源：评论

学校读者我要写书评

暂无评论

parallel bloom filter on xeon phi many-core processors 1

引用

15th international conference on algorithms and architectures for parallel processing, ICA3PP 2015

作者： Ni, Sheng Guo, Rentong Liao, Xiaofei Jin, Hai Services Computing Technology and System Lab Cluster and Grid Computing Lab School of Computer Science and Technology Huazhong University of Science and Technology Wuhan430074 China

ISBN: (数字)9783319271224

ISBN: (纸本)9783319271217

Bloom filters are widely used in databases and network areas. these filters facilitate efficient membership checking with a low false positive ratio. It is a way to improve the throughput of bloom filter by parallel processing. Common many-core processors such as Xeon Phi can provide high parallelism. thus, we build an iterative model to analyze memory access performance. this performance suggests that the bottleneck in the traditional design is mainly caused by synchronization cost and memory latency on many-core platforms. therefore, we propose a parallel bloom filter (PBF), which is a lockless method involving input data preprocessing. this method reduces synchronization overhead and improves cache locality. We also implement and evaluate PBF on a Xeon Phi processor. Results show that the memory access performance is three times better than that of the counting bloom filter. PBF provides improved scalability, and the speedup ratio can reach a maximum of 80.7x. © Springer international Publishing Switzerland 2015.

关键词： Scalability

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共332页 << < 130 131 132 133 134 135 136 137 138 139 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：