检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

154 篇 会议
20 册 图书
7 篇 期刊文献

馆藏范围

180 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

79 篇 工学
- 77 篇 计算机科学与技术...
- 51 篇 软件工程
- 19 篇 电子科学与技术（可...
- 10 篇 电气工程
- 7 篇 信息与通信工程
- 5 篇 控制科学与工程
- 2 篇 冶金工程
- 2 篇 建筑学
- 2 篇 交通运输工程
- 2 篇 安全科学与工程
- 1 篇 机械工程
- 1 篇 光学工程
- 1 篇 材料科学与工程（可...
- 1 篇 土木工程
- 1 篇 石油与天然气工程
- 1 篇 生物医学工程（可授...
- 1 篇 生物工程
- 1 篇 网络空间安全
30 篇 理学
- 24 篇 数学
- 3 篇 生物学
- 2 篇 物理学
- 2 篇 统计学（可授理学、...
- 1 篇 地球物理学
- 1 篇 系统科学
7 篇 管理学
- 6 篇 管理科学与工程(可...
- 3 篇 工商管理
- 1 篇 图书情报与档案管...
2 篇 法学
- 2 篇 社会学
2 篇 农学
1 篇 经济学
- 1 篇 应用经济学
1 篇 教育学
- 1 篇 教育学

主题

25 篇 parallel archite...
21 篇 computer archite...
16 篇 parallel process...
13 篇 parallel algorit...
11 篇 algorithm analys...
9 篇 software enginee...
8 篇 computer science
8 篇 computer communi...
8 篇 computer program...
7 篇 computer systems...
7 篇 concurrent compu...
7 篇 parallel program...
7 篇 parallel machine...
6 篇 computational mo...
6 篇 artificial intel...
5 篇 algorithm design...
5 篇 hardware
4 篇 information syst...
4 篇 software enginee...
4 篇 very large scale...

机构

3 篇 seecs university...
2 篇 leiden universit...
2 篇 computer network...
2 篇 cnrs st. martin ...
2 篇 irisa rennes fr ...
2 篇 school of inform...
2 篇 department of in...
2 篇 beihang universi...
2 篇 oak ridge natl l...
2 篇 school of comput...
2 篇 deakin universit...
2 篇 university of br...
2 篇 school of inform...
2 篇 school of inform...
2 篇 department of co...
1 篇 ibm united state...
1 篇 dalian maritime ...
1 篇 dip. di matemati...
1 篇 cnrs besancon fr...
1 篇 institut d'elect...

作者

4 篇 ivan stojmenovic
3 篇 yang xiang
3 篇 wanlei zhou
3 篇 robert y.
2 篇 kaiser hartmut
2 篇 cosnard m.
2 篇 bernady o. apduh...
2 篇 zhiyang li
2 篇 b. zavidovique
2 篇 geyong min
2 篇 muller j.m.
2 篇 wenyu qu
2 篇 xian-he sun
2 篇 koji nakano
2 篇 tingting yang
2 篇 guojun wang
2 篇 yulei wu
2 篇 lei liu
2 篇 hua guo
2 篇 albert zomaya

语言

176 篇 英文
3 篇 中文
1 篇 法文
1 篇 其他

检索条件"任意字段=Proceedings of the International Workshop on Algorithms and Parallel VSLI Architectures"

共 181 条记录，以下是1-10 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

proceedings of international workshop on OpenCL and SYCL, IWOCL 2024

Proceedings of International Workshop on OpenCL and SYCL, IW...

引用

12th international workshop on OpenCL and SYCL, IWOCL 2024

ISBN: (纸本)9798400717901

The proceedings contain 22 papers. The topics discuss include: towards efficient OpenCL pipe specification for hardware accelerators;SimSYCL: a SYCL implementation targeting development, debugging, simulation and conformance;experiences with implementing Kokkos’ SYCL backend;optimization and evaluation of breadth first search with oneAPI/SYCL on Intel FPGAs: from describing algorithms to describing architectures;improving performance portability of the procedurally generated high energy physics event generator MadGraph using SYCL;unlocking performance portability on LUMI-G supercomputer: a virtual screening case study;evaluation of SYCL’s different data parallel kernels;smoothing the migration from CUDA to SYCL: SYCLomatic utility features;and optimization of fast Fourier transform (FFT) for Qualcomm Adreno graphics processing unit.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Modeling TCP Performance using Graph Neural Networks 1

Modeling TCP Performance using Graph Neural Networks

引用

1st international workshop on Graph Neural Networking (GNNet)

作者： Jaeger, Benedikt Helm, Max Schwegmann, Lars Carle, Georg Tech Univ Munich Chair Network Architectures & Serv Munich Germany

ISBN: (纸本)9781450399333

TCP throughput and RTT prediction are essential to model TCP behavior and optimize network configurations. Flows adapt their sending rate to network parameters like link capacity or buffer size and interact with parallel flows. Especially the elastic behavior of TCP congestion control can vary, even when only slight changes in the network occur. Thus, existing analytical models for TCP behavior reach their limits due to the number and complexity of different algorithms. Machine learning approaches, in contrast, are often fixed to specific network topologies. This paper presents a TCP bandwidth and RTT prediction approach that can handle different algorithms and topologies. For this, we utilize Gated Graph Neural Networks and simulated network traffic. We evaluate different encodings of the input data into graphs and how network size, number of flows, and TCP algorithms influence prediction accuracy. Additionally, we quantify the impact of different input features on our models. We show that Graph Neural Networks can be used to model TCP behavior. The resulting models can predict RTT with a median relative error of 2.29 % and throughput with an error of 13.31 %.

关键词： TCP modeling congestion control throughput round-trip time graph neural networks

来源：评论

学校读者我要写书评

暂无评论

Domain Decomposition Preconditioners for Unstructured Network Problems in parallel Vector architectures 21

Domain Decomposition Preconditioners for Unstructured Networ...

引用

50th international Conference on parallel Processing (ICPP)

作者： Maldonado, Daniel Adrian Schanen, Michel Pacaud, Francois Anitescu, Mihai Argonne Natl Lab Lemont IL 60439 USA

ISBN: (纸本)9781450384414

In this paper we present our experience implementing domain decomposition preconditioners on vector architectures. In particular, we will focus on the solution of unstructured network equations arising from electrical power systems by preconditioning iterative algorithms with the Additive Schwarz Method (ASM). The implementation will be carried out using the Julia programming language, which allows for easy prototyping and interfacing with GPU architectures thanks to its multiple dispatch features. In our experiments, we will show the trade-off between device throughput and convergence of the iterative algorithm as the size of the domain varies, and determine optimal fronts of computational performance.

关键词： Iterative methods

来源：评论

学校读者我要写书评

暂无评论

Scalable parallel algorithm for fast computation of Transitive Closure of Graphs on Shared Memory architectures 6

Scalable parallel algorithm for fast computation of Transiti...

引用

IEEE/ACM 6th international workshop on Extreme Scale Programming Models and Middleware (ESPM2)

作者： Patel, Sarthak Dave, Bhrugu Kumbhani, Smit Desai, Mihir Kumar, Sidharth Chaudhury, Bhaskar DA IICT Grp Computat Sci & HPC Gandhinagar India Univ Alabama Birmingham Dept Comp Sci Birmingham AL 35294 USA

ISBN: (纸本)9781665411400

We present a scalable algorithm that computes the transitive closure of a graph on shared memory architectures using the OpenMP API in C++. Two different parallelization strategies have been presented and the performance of the two algorithms has been compared for several data-sets of varying sizes. We demonstrate the scalability of the best parallel implementation up to 176 threads on a shared memory architecture, by producing a graph with more than 3.82 trillion edges. To the best of our knowledge, this is the first implementation that has computed the transitive closure of such a large graph on a shared memory system. Optimization strategies for better cache utilization for large data-sets have been discussed. The important issue of load balancing has been analyzed and its mitigation using the optimal OpenMP scheduling clause has been discussed in detail.

关键词： Graph algorithms OpenMP Transitive Closure Scalability Shared memory

来源：评论

学校读者我要写书评

暂无评论

proceedings of PMBS 2022: Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems, Held in conjunction with SC 2022: The international Conference for High Performance Computing, Networking, Storage and Analysis

Proceedings of PMBS 2022: Performance Modeling, Benchmarking...

引用

13th IEEE/ACM international workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems, PMBS 2022

ISBN: (纸本)9781665451857

The proceedings contain 14 papers. The topics discussed include: ML-based performance portability for time-dependent density functional theory in HPC environments;a comprehensive evaluation of novel ai accelerators for deep learning workloads;frontier vs the exascale report: why so long? and are we really there yet?;evaluating ISO C++ parallel algorithms on heterogeneous HPC systems;going green: optimizing GPUs for energy efficiency through model-steered auto-tuning;performance analysis with unified hardware counter metrics;a methodology for evaluating tightly-integrated and disaggregated accelerated architectures;WfBench: automated generation of scientific workflow benchmarks;high-performance GMRES multi-precision benchmark: design, performance, and challenges;and an initial evaluation of arm’s scalable matrix extension.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Implementing Arbitrary/Common Concurrent Writes of CRCW PRAM 21

Implementing Arbitrary/Common Concurrent Writes of CRCW PRAM

引用

50th international Conference on parallel Processing (ICPP)

作者： Ghanim, Fady Elwasif, Wael R. Bernholdt, David E. Oak Ridge Natl Lab Oak Ridge TN 37830 USA

ISBN: (纸本)9781450384414

The parallel Random Access Machines (PRAM) abstraction is the simplest and most elegant algorithmic model for the design and analysis of parallel algorithms. It consists of different models categorized based on the underlying memory access mode used, the most powerful of which is the Concurrent Read Concurrent Write (CRCW) model. A PRAM algorithm describes a series of rounds, each of which consists of a collection of operations that can be executed concurrently within the same time step. However, the lack of support for concurrent memory accesses and the prevalence of asynchronous programming models led to the belief that implementing CRCW PRAM algorithms is unattainable and prompted many to avoid this model except for theoretical studies of optimal performance. In this work, we study the arbitrary and common concurrent writes in the CRCW PRAM model and explore implementation challenges on general-purpose systems. Moreover, we examine current practices for implementing common/arbitrary concurrent writes and propose a new efficient lightweight and thread-safe method to implement concurrent writes through leveraging atomic instructions. To demonstrate the efficacy of our method, we developed OpenMP kernels for classical CRCW PRAM algorithms and provide experimental results and comparisons based on run time performance measured over the x86 multicore architecture. Our results show a performance speedup compared to current practices up to 4.5x across all our benchmarks.

关键词： CRCW PRAM parallel algorithms Arbitrary Concurrent Writes Write-conflict resolution parallel architectures

来源：评论

学校读者我要写书评

暂无评论

parallel SIMD-A Policy Based Solution for Free Speed-Up using C++ Data-parallel Types 6

Parallel SIMD-A Policy Based Solution for Free Speed-Up usin...

引用

6th international IEEE/ACM workshop on Extreme Scale Programming Models and Middleware, ESPM2 2021

作者： Yadav, Srinivas Gupta, Nikunj Reverdell, Auriane Kaiser, Hartmut Keshav Memorial Institute of Technology Hyderabad India University of Illinois at Urbana-Champaign Illinois United States Swiss National Supercomputing Centre Zurich Switzerland Louisiana State University Center for Computation Technology Baton Rouge United States

ISBN: (纸本)9781665411400

Recent additions to the C++ standard and ongoing standardization efforts aim to add data-parallel types to the C++ standard library. This enables the use of vectorization techniques in existing C++ codes without having to rely on the C++ compiler's abilities to auto-vectorize the code's execution. The integration of the existing parallel algorithms with these new data-parallel types opens up a new way of speeding up existing codes with minimal effort. Today, only very little implementation experience exists for potential data-parallel execution of the standard parallel algorithms. In this paper, we report on experiences and performance analysis results for our implementation of two new data-parallel execution policies usable with HPX's parallel algorithms module: simd and par_simd. We utilize the new experimental implementation of data-parallel types provided by recent versions of the GCC and Clang C++ standard libraries. The benchmark results collected from artificial tests and real-world codes presented in this paper are very promising. Compared to sequenced execution, we report on speed-ups of more than three orders of magnitude when executed using the newly implemented data-parallel execution policy par_simd with HPX's parallel algorithms. We also report that our implementation is performance portable across different compute architectures (x64-Intel and AMD, and Arm), using different vectorization extensions (AVX2, AVX512, and NEON128). © 2021 IEEE.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

proceedings of IA3 2020: 10th workshop on Irregular Applications: architectures and algorithms, Held in conjunction with SC 2020: The international Conference for High Performance Computing, Networking, Storage and Analysis

Proceedings of IA3 2020: 10th Workshop on Irregular Applicat...

引用

10th workshop on Irregular Applications: architectures and algorithms, IA3 2020

ISBN: (纸本)9780738110905

The proceedings contain 8 papers. The topics discussed include: accelerating domain propagation: an efficient GPU-parallel algorithm over sparse matrices;parallelizing irregular computations for molecular docking;reducing queuing impact in irregular data streaming applications;supporting irregularity in throughput-oriented computing by SIMT-SIMD integration;DistDGL: distributed graph neural network training for billion-scale graphs;labeled triangle indexing for efficiency gains in distributed interactive subgraph search;distributed memory graph coloring algorithms for multiple GPU;and performance evaluation of the vectorizable binary search algorithms on an FPGA platform.

关键词：

来源：评论

学校读者我要写书评

暂无评论

proceedings of the 11th workshop on parallel Programming and Run-Time Management Techniques for Many-Core architectures / 9th workshop on Design Tools and architectures for Multicore Embedded Computing Platforms, PARMA-DITAM 2020

Proceedings of the 11th Workshop on Parallel Programming and...

引用

11th workshop on parallel Programming and Run-Time Management Techniques for Many-Core architectures / 9th workshop on Design Tools and architectures for Multicore Embedded Computing Platforms, PARMA-DITAM 2020

The proceedings contain 5 papers. The topics discussed include: sparse matrix-dense matrix multiplication on heterogeneous CPU+FPGA embedded system;run-time power modelling in embedded GPUs with dynamic voltage and frequency scaling;fault-tolerant online scheduling algorithms for CubeSats;an OpenMP parallel genetic algorithm for design space exploration of heterogeneous multi-processor embedded systems;and automated precision tuning in activity classification systems: a case study.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Proceedings of the 11th Workshop on Parallel Programming and...

引用

8th international Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies, HEART 2017

ISBN: (纸本)9781450375450

The proceedings contain 5 papers. The topics discussed include: sparse matrix-dense matrix multiplication on heterogeneous CPU+FPGA embedded system;run-time power modeling in embedded GPUs with dynamic voltage and frequency scaling;fault-tolerant online scheduling algorithms for CubeSats;an OpenMP parallel genetic algorithm for design space exploration of heterogeneous multi-processor embedded systems;and automated precision tuning in activity classification systems: a case study.

关键词：

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共19页 << < 1 2 3 4 5 6 7 8 9 10 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：