检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

2,694 篇 会议
58 册 图书
53 篇 期刊文献

馆藏范围

2,805 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

1,844 篇 工学
- 1,629 篇 计算机科学与技术...
- 847 篇 软件工程
- 340 篇 电气工程
- 222 篇 电子科学与技术（可...
- 209 篇 信息与通信工程
- 84 篇 控制科学与工程
- 63 篇 光学工程
- 57 篇 机械工程
- 41 篇 仪器科学与技术
- 39 篇 生物医学工程（可授...
- 38 篇 生物工程
- 31 篇 材料科学与工程（可...
- 25 篇 动力工程及工程热...
- 21 篇 化学工程与技术
- 20 篇 建筑学
- 15 篇 土木工程
- 13 篇 力学（可授工学、理...
- 12 篇 交通运输工程
499 篇 理学
- 343 篇 数学
- 113 篇 物理学
- 51 篇 系统科学
- 48 篇 生物学
- 30 篇 统计学（可授理学、...
- 26 篇 化学
173 篇 管理学
- 119 篇 管理科学与工程(可...
- 62 篇 图书情报与档案管...
- 49 篇 工商管理
40 篇 医学
- 30 篇 临床医学
- 14 篇 基础医学(可授医学...
15 篇 法学
- 15 篇 社会学
9 篇 经济学
9 篇 农学
8 篇 文学
2 篇 军事学
1 篇 教育学

主题

363 篇 parallel process...
219 篇 computer archite...
205 篇 graphics process...
146 篇 parallel archite...
136 篇 graphics process...
129 篇 hardware
116 篇 parallel algorit...
112 篇 image processing
99 篇 computational mo...
94 篇 concurrent compu...
87 篇 instruction sets
86 篇 field programmab...
83 篇 algorithm design...
79 篇 multicore proces...
77 篇 signal processin...
76 篇 parallel process...
66 篇 parallel program...
60 篇 throughput
60 篇 gpu
59 篇 kernel

机构

11 篇 natl univ def te...
6 篇 college of compu...
6 篇 school of comput...
6 篇 hosei univ dept ...
6 篇 natl univ def te...
5 篇 univ aizu dept c...
5 篇 carleton univ sc...
5 篇 school of comput...
5 篇 computer science...
5 篇 inria rennes
5 篇 city university ...
4 篇 chinese acad sci...
4 篇 univ michigan ad...
4 篇 institute of com...
4 篇 univ chinese aca...
4 篇 school of comput...
4 篇 univ jaume 1 dep...
4 篇 hainan internati...
4 篇 tech univ cluj n...
4 篇 department of co...

作者

11 篇 jack dongarra
10 篇 roman wyrzykowsk...
9 篇 konrad karczewsk...
9 篇 quintana-orti en...
7 篇 dongarra jack
7 篇 kothapalli kisho...
6 篇 hannig frank
6 篇 liu jie
6 篇 su jinshu
6 篇 nakano koji
6 篇 peng shietung
6 篇 li yamin
6 篇 chu wanming
6 篇 wyrzykowski roma...
6 篇 thulasiraman par...
5 篇 ito yasuaki
5 篇 jerzy waśniewski
5 篇 wang guojun
5 篇 geyong min
5 篇 wanlei zhou

语言

2,744 篇 英文
36 篇 其他
18 篇 中文
11 篇 俄文
2 篇 乌克兰文
1 篇 西班牙文

检索条件"任意字段=10th International Conference on Algorithms and Architectures for Parallel Processing"

共 2805 条记录，以下是281-290 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Model-Based Warp Overlapped Tiling for Image processing Programs on GPUs 20

Model-Based Warp Overlapped Tiling for Image Processing Prog...

引用

ACM international conference on parallel architectures and Compilation Techniques (PACT)

作者： Jangda, Abhinav Guha, Arjun Univ Massachusetts Amherst Amherst MA 01003 USA Northeastern Univ Boston MA 02115 USA

ISBN: (纸本)9781450380751

Domain-specific languages that execute image processing pipelines on GPUs, such as Halide and Forma, operate by 1) dividing the image into overlapped tiles, and 2) fusing loops to improve memory locality. However, current approaches have limitations: 1) they require intra thread block synchronization, which has a nontrivial cost, 2) they must choose between small tiles that require more overlapped computations or large tiles that increase shared memory access (and lowers occupancy), and 3) their autoscheduling algorithms use simplified GPU models that can result in inefficient global memory accesses. We present a new approach for executing image processing pipelines on GPUs that addresses these limitations as follows. 1) We fuse loops to form overlapped tiles that fit in a single warp, which allows us to use lightweight warp synchronization. 2) We introduce hybrid tiling, which stores overlapped regions in a combination of thread-local registers and shared memory. thus hybrid tiling either increases occupancy by decreasing shared memory usage or decreases overlapping computations using larger tiles. 3) We present an automatic loop fusion algorithm that considers several factors that affect the performance of GPU kernels. We implement these techniques in PolyMage-GPU, which is a new GPU backend for PolyMage. Our approach produces code that is faster than Halide's manual schedules: 1.65x faster on an NVIDIA GTX 1080Ti and 1.33x faster on an NVIDIA Tesla V100.

关键词： Polyhedral Optimizations Graphics processing Units Image processing Pipelines

来源：评论

学校读者我要写书评

暂无评论

parallel solvers comparison for an inverse problem in fractional calculus 14

Parallel solvers comparison for an inverse problem in fracti...

引用

14th IADIS international conference Computer Graphics, Visualization, Computer Vision and Image processing 2020, CGVCVIP 2020 and 5th IADIS international conference Big Data Analytics, Data Mining and Computational Intelligence 2020, BigDaCI 2020 and 9th IADIS international conference theory and Practice in Modern Computing 2020, TPMC 2020, Part of the 14th Multi conference on Computer Science and Information Systems, MCCSIS 2020

作者： De Luca, Pasquale Galletti, Ardelio Marcellino, Livia Department of Computer Science University of Salerno Fisciano Italy Department of Science and Technology University of Naples Parthenope Napoli Italy

ISBN: (纸本)9789898704214

High-Performance Computing (HPC) is a fundamental tool for improving the performance of many algorithms in terms of time, especially for large-scale problems. In the last years, various HPC architectures have been developed to quickly process data in many research areas and at the same time the HPC tools has become very important. In addition, the development of scientific libraries for parallel computing plays a key role in achieving better performance. In particular, thanks to the computational power of Graphic processing Units, themost popular and inexpensive accelerators, the parallel computing field has become almost a standard process for datamanagement. Hence, the porting of many standard numerical libraries on these architectures produced excellent results. In this work, we deal with a two-dimensional time fractional diffusion problem. More in detail, we analyze the performance of some parallel codes, specifically designed to solve it, implemented in different architectures. Moreover, a further GPU version is proposed and compared with the above implementations. © Proceedings of the 14th IADIS international conference Computer Graphics, Visualization, Computer Vision and Image processing 2020, CGVCVIP 2020 and Proceedings of the 5th IADIS international conference Big Data Analytics, Data Mining and Computational Intelligence 2020, BigDaCI 2020 and Proceedings of the 9th IADIS international conference theory and Practice in Modern Computing 2020, TPMC 2020 - Part of the 14th Multi conference on Computer Science and Information Systems, MCCSIS 2020. All rights reserved.

关键词： Graphics processing unit

来源：评论

学校读者我要写书评

暂无评论

Particle Swarm Optimization for Performance Management in Multi-cluster IoT Edge architectures 10

Particle Swarm Optimization for Performance Management in Mu...

引用

10th international conference on Cloud Computing and Services Science (CLOSER)

作者： Azimi, Shelernaz Pahl, Claus Shirvani, Mirsaeid Hosseini Free Univ Bozen Bolzano Bolzano Italy Islamic Azad Univ Sari Branch Dept Comp Engn Sari Iran

ISBN: (纸本)9789897584244

Edge computing extends cloud computing capabilities to the edge of the network, allowing for instance Internet-of-things (IoT) applications to process computation more locally and thus more efficiently. We aim to minimize latency and delay in edge architectures. We focus on an advanced architectural setting that takes communication and processing delays into account in addition to an actual request execution time in a performance engineering scenario. Our architecture is based on multi-cluster edge layer with local independent edge node clusters. We argue that particle swarm optimisation as a bio-inspired optimisation approach is an ideal candidate for distributed load processing in semi-autonomous edge clusters for IoT management. By designing a controller and using a particle swarm optimization algorithm, we can demonstrate that processing and propagation delay and the end-to-end latency (i.e., total response time) can be optimized.

关键词： Internet of things Edge Computing Cloud Computing Particle Swarm Optimization PSO Performance

来源：评论

学校读者我要写书评

暂无评论

PTangle: A parallel Detector for Unverified Blockchain Transactions 20th

PTangle: A Parallel Detector for Unverified Blockchain Trans...

引用

20th international conference on algorithms and architectures for parallel processing (ICA3PP)

作者： Victor, Ashish Christopher Jayanthi, Akhilarka Gopalakrishnan, Atul Anand Nagpal, Rahul PES Univ Dept Comp Sci & Engn Bangalore Karnataka India

ISBN: (纸本)9783030602482;9783030602475

Tangle is a novel directed acyclic graph (DAG)-based distributed ledger preferred over traditional linear ledgers in blockchain applications because of better transaction throughput. Earlier techniques have mostly focused on comparing the performance of graph chains over linear chains and incorporating the Markov Chain Monte Carlo process in probabilistic traversals to detect unverified transactions in DAG chains. In this paper, we present a parallel detection method for unverified transactions. Experimental evaluation of the proposed parallel technique demonstrates a significant, scalable average speed-up of close to 70%, and a peak speed-up of approximately 73% for a large number of transactions.

关键词： Blockchain DAG Tangle Markov Chain Monte Carlo parallelism

来源：评论

学校读者我要写书评

暂无评论

parallel processing algorithms for the Vehicle Routing Problem and Its Variants: A Literature Review with a Look into the Future 20th

Parallel Processing Algorithms for the Vehicle Routing Probl...

引用

20th international conference on algorithms and architectures for parallel processing (ICA3PP)

作者： Rabbouch, Bochra Rabbouch, Hana Saadaoui, Foued Zone Industrielle Manzel Hayet LEONI Wiring Syst Monastir 5033 Tunisia Univ Monastir Fac Sci Lab LR18ES15 Algebre Theorie Nombres & Anal Nonli Monastir 5019 Tunisia King Abdulaziz Univ Fac Sci Dept Stat POB 80203 Jeddah 21589 Saudi Arabia

ISBN: (纸本)9783030602451;9783030602444

Vehicle Routing Problems (VRPs) are well-know combinatorial optimization problems used to design an optimal route for a fleet of vehicles to service a set of customers under a number of constraints. Due to their NP-hard complexity, a number of purely computational techniques have been proposed in recent years in order to solve them. Among these techniques, nature-inspired algorithms have proven their effectiveness in terms of accuracy and convergence speed. Some of these methods are also designed in such a way to decompose the basic problem into a number of sub-problems which are subsequently solved in parallel computing environments. It is therefore the purpose of this paper to review the fresh corpus of the literature dealing with the main approaches proposed over the past few years to solve combinatorial optimization problems in general and, in particular, the VRP and its different variants. Bibliometric and review studies are conducted with a special attention paid to metaheuristic strategies involving procedures with parallel architectures. the obtained results show an expansion of the use of parallel algorithms for solving various VRPs. Nevertheless, the regression in the number of citations in this framework proves that the interest of the research community has declined somewhat in recent years. this decline may be explained by the lack of rigorous mathematical results and practical interfaces under famous calculation softwares.

关键词： Combinatorial optimization Vehicle Routing Problem Metaheuristics Nature-inspired algorithms parallel computing

来源：评论

学校读者我要写书评

暂无评论

Improving Existing WMS for Reduced Makespan of Workflows with Lambda

Improving Existing WMS for Reduced Makespan of Workflows wit...

引用

26th international conference on parallel and Distributed Computing (Euro-Par)

作者： Al-Haboobi, Ali Kecskemeti, Gabor Univ Miskolc Inst Informat Technol Miskolc Hungary Liverpool John Moores Univ Dept Comp Sceince Liverpool Merseyside England

ISBN: (纸本)9783030715939;9783030715922

Scientific workflows are increasingly important for complex scientific applications. Recently, Function as a Service (FaaS) has emerged as a platform for processing non-interactive tasks. FaaS (such as AWS Lambda and Google Cloud Functions) can play an important role in processing scientific workflows. A number of works have demonstrated their ability to process these workflows. However, some issues were identified when workflows executed on cloud functions due to their limits (e.g., stateless behaviour). A major issue is the additional data transfer during the execution between object storage and the FaaS invocation environment. this leads to increased communication costs. DEWE v3 is one of the Workflow Management Systems (WMSs) that already had foundations for processing workflows with cloud functions. In this paper, we have modified the job dispatch algorithm of DEWE v3 on a function environment to reduce data dependency transfers. Our modified algorithm schedules jobs with precedence constraints to be executed in a single function invocation. therefore, later jobs can utilise output files generated from their predecessor job in the same invocation. this reduces the makespan of workflow execution. We have evaluated the improved scheduling algorithm and the original with small- and large-scale Montage workflows. the experimental results show that our algorithm can reduce the overall makespan in contrast to the original DEWE v3 by about 10%.

关键词： Scientific workflows Cloud functions Serverless architectures Makespan

来源：评论

学校读者我要写书评

暂无评论

A Framework for Designing Autonomous parallel Data Warehouses 19th

A Framework for Designing Autonomous Parallel Data Warehouse...

引用

19th international conference on algorithms and architectures for parallel processing (ICA3PP)

作者： Benkrid, Soumia Bellatreche, Ladjel Ecole Natl Super Informat ESI Algiers Algeria LIAS ISAE ENSMA Poitiers France

ISBN: (纸本)9783030389611;9783030389604

parallel data platforms are recognized as a key solution for processing analytical queries running on extremely large data warehouses (DWs). Deploying a DW on such platforms requires efficient data partitioning and allocation techniques. Most of these techniques assume a priori knowledge of workload. To deal with their evolution, reactive strategies are mainly used. the BI 2.0 requirements have put large batch and ad-hoc user queries at the center. Consequently, reactive-based solutions for deploying a DW in parallel platforms are not sufficient. Autonomous computing has emerged as a paradigm that allows digital objects managing themselves in accordance with high-level guidance by the means of proactive approaches. Being inspired by this paradigm, we propose in this paper, a proactive approach based on a query clustering model to deploying a DW over a parallel platform. the query clustering triggers partitioning and allocation processes by considering only evolved query groups. Intensive experiments were conducted to show the efficiency of our proposal.

关键词： Partitioning Allocation Common sub-expressions Utility maximization Autonomous system Workload clustering

来源：评论

学校读者我要写书评

暂无评论

Neural Architecture Search Based on Tabu Search and Evolutionary Algorithm 11

Neural Architecture Search Based on Tabu Search and Evolutio...

引用

11th IEEE Annual international conference on CYBER Technology in Automation, Control, and Intelligent Systems, CYBER 2021

作者： Fan, Zhun Long, Zhoubin Li, Wenji Wang, Zhaojun Yang, Zhi Wang, Liu Guangdong Province Engineering College Shantou University Department of Key Lab of Digital Signal and Image Processing GuangdongShantou China

ISBN: (纸本)9781665425278

Most existing optimization methods for neural architecture search (NAS), including evolutionary algorithms, reinforcement learning and gradient-based approaches, have not employed memory strategies explicitly, which may lack of efficiency when searching neural architectures. To solve this issue, we propose a new NAS approach by using an evolutionary algorithm which employs a tabu mechanism to help to improve the search efficiency. To be more specific, the individuals of parent population are selected by tournament selection and tabu list. the tournament selection select parent population according to the accuracy of each individual. And the tabu mechanism builds a tabu list to record the chosen operations in the last previous search process, which employs a search memory mechanism to improve the efficiency explicitly. To confirm the superior performance of our approach, a well-designed surrogate model is used to accelerate the process of performance evaluation on CIFAR-10. the comprehensive experimental results show that the proposed method can reach to 2.48% error rate with about 2 GPU days, which demonstrates the superiority of the suggested method. © 2021 IEEE.

关键词： Evolutionary algorithms

来源：评论

学校读者我要写书评

暂无评论

A GPU-based jde algorithm applied to continuous unconstrained optimization 18th

A GPU-based jde algorithm applied to continuous unconstraine...

引用

Joint conferences on 18th international conference on Intelligent Systems Design and Applications, ISDA 2018 and 10th World Congress on Nature and Biologically Inspired Computing , NaBIC 2018

作者： Boiani, Mateus Dominico, Gabriel Stubs Parpinelli, Rafael Santa Catarina State University JoinvilleSC Brazil

ISBN: (纸本)9783030166564

Population-based search algorithms, such as the Differential Evolution approach, evolve a pool of candidate solutions during the optimization process and are suitable for massively parallel architectures promoted by the use of GPUs. Hence, this paper proposes a GPU-based self-adaptive Differential Evolution employing the jDE mechanism to control its parameters, named cujDE. Two CUDA structures are employed to model the kernel functions in cujDE. the proposed algorithm is compared with another GPU-based self-adaptive DE (cuSaDE) in four continuous unconstrained benchmark functions. Also, the speedup between the CPU-based and the GPU-based jDE is measured. Results obtained suggest that the proposed approach is well suited to and competitive for continuous optimization. © Springer Nature Switzerland AG 2020.

关键词： Graphics processing unit

来源：评论

学校读者我要写书评

暂无评论

GPU parallelization and Optimization of a Combustion Simulation Application

GPU Parallelization and Optimization of a Combustion Simulat...

引用

IEEE international conference on High Performance Computing and Communications (HPCC)

作者： Zhixiang Liao Yongzhou Liu Yonggang Che State Key Laboratory of High Performance Computing Institute for Quantum Information College of Computer National University of Defense Technology Changsha China

Graphics processing units (GPUs) are widely used in the area of scientific computing. While GPUs provide much higher peak performance, efficient implementation of real applications on the GPU architectures is still a non-trivial task. It is crucial to realize efficient solution algorithms that can better utilize GPU architectures. this paper presents our efforts in parallelizing and optimizing LESAP, a CFD application for scramjet combustion simulation, on NVIDIA GPUs. the GPU parallelization is realized based on the CUDA programming model, with a data-parallel implicit time-marching method that is efficient on the GPU architecture. Furthermore, shared memory and redundant calculation are proposed to reduce memory access overhead during GPU computation, and data transfer between CPU and GPU is optimized by packing the data to be transferred. the experimental results show that the GPU version, when runs on four V100 GPUs, achieves a speedup of 11.26 times compared to the CPU version that runs on two 24-core Intel Skylake Gold 6240R CPUs. Excellent parallel scalability across multiple GPUs is also observed.

关键词： Gold Smart cities Computational modeling Graphics processing units Computer architecture Combustion Central processing Unit

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共281页 << < 25 26 27 28 29 30 31 32 33 34 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：