检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

49 篇 会议

馆藏范围

49 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

26 篇 工学
- 26 篇 计算机科学与技术...
- 16 篇 软件工程
- 3 篇 电气工程
- 3 篇 信息与通信工程
- 2 篇 电子科学与技术（可...
- 1 篇 光学工程
- 1 篇 材料科学与工程（可...
- 1 篇 动力工程及工程热...
- 1 篇 控制科学与工程
- 1 篇 生物工程
10 篇 理学
- 9 篇 数学
- 1 篇 物理学
- 1 篇 生物学
- 1 篇 统计学（可授理学、...
4 篇 管理学
- 4 篇 管理科学与工程(可...
- 3 篇 工商管理
- 1 篇 图书情报与档案管...
1 篇 经济学
- 1 篇 应用经济学

主题

6 篇 parallel archite...
6 篇 computer archite...
5 篇 hardware
3 篇 microcomputers
3 篇 scheduling
3 篇 dynamic scheduli...
3 篇 vliw
2 篇 parallel algorit...
2 篇 hard discs
2 篇 graphics process...
2 篇 asynchronous var...
2 篇 laboratories
2 篇 concurrent data ...
2 篇 processor schedu...
2 篇 dependence graph
2 篇 program processo...
2 篇 instruction sets
2 篇 pipelines
2 篇 graphics process...
2 篇 encoding

机构

2 篇 department of el...
2 篇 north carolina s...
2 篇 univ texas austi...
2 篇 univ calif berke...
1 篇 univ texas austi...
1 篇 hewlett packard ...
1 篇 universidade fed...
1 篇 samsung semicond...
1 篇 brookhaven natio...
1 篇 dept. of compute...
1 篇 university lille...
1 篇 université nice ...
1 篇 univ jaume 1 dep...
1 篇 university of ch...
1 篇 iit dept comp sc...
1 篇 uc santa cruz sa...
1 篇 univ chicago dep...
1 篇 tech univ darmst...
1 篇 huazhong univ sc...
1 篇 universidade fed...

作者

2 篇 s.w. sathaye
2 篇 s. banerjia
2 篇 banerjia sanjeev
2 篇 conte thomas m.
2 篇 t.m. conte
2 篇 sathaye sumedh w...
1 篇 raicu ioan
1 篇 chan ernie
1 篇 lee yunsup
1 篇 park chanik
1 篇 luciano agostini
1 篇 marcelo porto
1 篇 krste asanović
1 篇 kang yangwook
1 篇 gustavo smaniott...
1 篇 andrade guilherm...
1 篇 menezes kishore ...
1 篇 seemaier daniel
1 篇 crotty andrew
1 篇 huang ping

语言

49 篇 英文

检索条件"任意字段=Proceedings of the 29th ACM Symposium on Parallelism in Algorithms and Architectures"

共 49 条记录，以下是1-10 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

SPAA 2024 - proceedings of the 36th acm symposium on parallelism in algorithms and architectures

SPAA 2024 - Proceedings of the 36th ACM Symposium on Paralle...

引用

36th acm symposium on parallelism in algorithms and architectures, SPAA 2024

ISBN: (纸本)9798400704161

the proceedings contain 54 papers. the topics discussed include: expediting hazard pointers with bounded RCU critical sections;Alock: asymmetric lock primitive for RDMA systems;when is parallelism fearless and zero-cost with rust?;efficient parallel reinforcement learning framework using the reactor model;parallel best arm identification in heterogeneous environments;brief announcement: lock-free learned search data structure;brief announcement: LIT: lookup interlocked table for range queries;brief announcement: a fast scalable detectable unrolled lock-based linked list;scheduling out-trees online to optimize maximum flow;optimizing dynamic data center provisioning through speed scaling: a primal-dual perspective;scheduling jobs with work-inefficient parallel solutions;and multi bucket queues: efficient concurrent priority scheduling.

关键词：

来源：评论

学校读者我要写书评

暂无评论

SPAA 2023 - proceedings of the 35th acm symposium on parallelism in algorithms and architectures

SPAA 2023 - Proceedings of the 35th ACM Symposium on Paralle...

引用

35th acm symposium on parallelism in algorithms and architectures, SPAA 2023

ISBN: (纸本)9781450395458

the proceedings contain 47 papers. the topics discussed include: Quancurrent: a concurrent quantiles sketch;an efficient scheduler for task-parallel interactive applications;efficient synchronization-light work stealing;balanced allocations in batches: the tower of two choices;massively parallel tree embeddings for high dimensional spaces;deterministic massively parallel symmetry breaking for sparse graphs;an associativity threshold phenomenon in set-associative caches;increment - and - freeze: every cache, everywhere, all of the time;multidimensional approximate agreement with asynchronous fallback;a tight characterization of fast failover routing: resiliency to two link failures is possible;releasing memory with optimistic access: a hybrid approach to memory reclamation and allocation in lock-free programs;transactional composition of nonblocking data structures;applying hazard pointers to more concurrent data structures;and nearly optimal parallel algorithms for longest increasing subsequence.

关键词：

来源：评论

学校读者我要写书评

暂无评论

SPAA 2022 - proceedings of the 34th acm symposium on parallelism in algorithms and architectures

SPAA 2022 - Proceedings of the 34th ACM Symposium on Paralle...

引用

34th acm symposium on parallelism in algorithms and architectures, SPAA 2022

ISBN: (纸本)9781450391467

the proceedings contain 44 papers. the topics discussed include: deterministic distributed sparse and ultra-sparse spanners and connectivity certificates;fully polynomial-time distributed computation in low-treewidth graphs;adaptive massively parallel algorithms for cut problems;preparing for disaster: leveraging precomputation to efficiently repair graph structures upon failures;the energy complexity of Las Vegas leader election;a fully-distributed peer-to-peer protocol for byzantine-resilient distributed hash tables;brief announcement: the (limited) power of multiple identities: asynchronous byzantine reliable broadcast with improved resilience through collusion;brief announcement: composable dynamic secure emulation;and robust and optimal contention resolution without collision detection.

关键词：

来源：评论

学校读者我要写书评

暂无评论

HybriDS: Cache-Conscious Concurrent Data Structures for Near-Memory Processing architectures 22

HybriDS: Cache-Conscious Concurrent Data Structures for Near...

引用

34th acm symposium on parallelism in algorithms and architectures (SPAA)

作者： Choe, Jiwon Crotty, Andrew Moreshet, Tali Herlihy, Maurice Bahar, R. Iris Brown Univ Providence RI 02912 USA Northwestern Univ Evanston IL USA

ISBN: (纸本)9781450391467

In recent years, the ever-increasing impact of memory access bottlenecks has brought forth a renewed interest in near-memory processing (NMP) architectures. In this work, we propose and empirically evaluate hybrid data structures, which are concurrent data structures custom-designed for these new NMP architectures. We focus on cache-optimized data structures, such as skiplists and B+ trees, that are often used as index structures in online transaction processing (OLTP) systems to enable fast key-based lookups. these data structures are hierarchical, where lookups begin at a small number of top-level nodes and diverge to many different node paths as they move down the hierarchy, such that nodes in higher levels benefit more from caching. Our proposed hybrid data structures split traditional hierarchical data structures into a host-managed portion consisting of higher-level nodes and an NMP-managed portion consisting of the remaining lower-level nodes, thus retaining and further enhancing the cache-conscious optimizations of their conventional implementations. Although the idea might seem relatively simple, the splitting of the data structure prompts new synchronization problems, and careful implementation is required to ensure high concurrency and correctness. We provide implementations of a hybrid skiplist and a hybrid B+ tree, and we empirically evaluate them on a cycle-accurate full-system architecture simulator. Our results show that the hybrid data structures have the potential to improve performance by more than 2x compared to state-of-the-art concurrent data structures.

关键词： near-memory processing concurrent data structures

来源：评论

学校读者我要写书评

暂无评论

proceedings - 29th International symposium on the Modeling, Analysis, and Simulation of Computer and Telecommunication Systems, MASCOTS 2021

Proceedings - 29th International Symposium on the Modeling, ...

引用

29th International symposium on the Modeling, Analysis, and Simulation of Computer and Telecommunication Systems, MASCOTS 2021

ISBN: (纸本)9781665458382

the proceedings contain 26 papers. the topics discussed include: automated performance prediction of microservice applications using simulation;exact and efficient protective jamming in sinr-based wireless networks;deep learning models for automated identification of scheduling policies;energy-efficiency comparison of common sorting algorithms;scaling up the performance of distributed key-value stores with in-switch coordination;simulation modeling of urban e-scooter mobility;performance characterization of MPI Allreduce in cloud data center networks;enabling extremely fine-grained parallelism via scalable concurrent queues on modern many-core architectures;and mechanisms for transition from monolithic to distributed architecture in software development process.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Enabling Extremely Fine-grained parallelism via Scalable Concurrent Queues on Modern Many-core architectures 29

Enabling Extremely Fine-grained Parallelism via Scalable Con...

引用

29th International symposium on the Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS)

作者： Nookala, Poornima Dinda, Peter Hale, Kyle C. Chard, Kyle Raicu, Ioan IIT Dept Comp Sci Chicago IL 60616 USA Northwestern Univ Dept Comp Sci Evanston IL USA Univ Chicago Dept Comp Sci Chicago IL 60637 USA

ISBN: (纸本)9781665458382

Enabling efficient fine-grained task parallelism is a significant challenge for hardware platforms with increasingly many cores. Existing techniques do not scale to hundreds of threads due to the high cost of synchronization in concurrent data structures. To overcome these limitations we present XQueue, a novel lock-less concurrent queuing system with relaxed ordering semantics that is geared towards realizing scalability up to hundreds of concurrent threads. We demonstrate the scalability of XQueue using microbenchmarks and show that XQueue can deliver concurrent operations with latencies as low as 110 cycles at scales of up to 192 cores (up to 6900x improvement compared to traditional synchronization mechanisms) across our diverse hardware, including x86, ARM, and Power9. the reduced latency allows XQueue to provide orders of magnitude (3300x) better throughput that existing techniques. To evaluate the real-world benefits of XQueue, we integrated XQueue with LLVM OpenMP and evaluated five unmodified benchmarks from the Barcelona OpenMP Task Suite (BOTS) as well as a graph traversal benchmark from the GAP benchmark suite. We compared the XQueue-enabled LLVM OpenMP implementation with the native LLVM and GNU OpenMP versions. Using fine-grained task workloads, XQueue can deliver 4x to 6x speedup compared to native GNU OpenMP and LLVM OpenMP in many cases, with speedups as high as 116x in some cases.

关键词： concurrent data structures fine-grained parallelism lock-free lock-less queues parallel runtime tasks

来源：评论

学校读者我要写书评

暂无评论

An efficient uncertain graph processing framework for heterogeneous architectures 21

An efficient uncertain graph processing framework for hetero...

引用

26th acm SIGPLAN symposium on Principles and Practice of Parallel Programming, PPoPP 2021

作者： Zhang, Heng Li, Lingda Zhuang, Donglin Liu, Rui Song, Shuang Tao, Dingwen Wu, Yanjun Song, Shuaiwen Leon Institution of Software Chinese Academy of Sciences China Brookhaven National Laboratory New York United States University of Sydney Sydney Australia University of Chicago Chicago United States Facebook Inc. Montain View United States Washington State University Washington United States Institution of Software Chinese Academy of Sciences Beijing China

ISBN: (纸本)9781450382946

Uncertain or probabilistic graphs have been ubiquitously used in many emerging applications. Previously CPU based techniques were proposed to use sampling but suffer from (1) low computation efficiency and large memory overhead, (2) low degree of parallelism, and (3) nonexistent general framework to effectively support programming uncertain graph applications. To tackle these challenges, we propose a general uncertain graph processing framework for multi-GPU systems, named BPGraph. Integrated with our highly-efficient path sampling method, BPGraph can support a wide range of uncertain graph algorithms' development and optimization. Extensive evaluation demonstrates a significant performance improvement from BPGraph over the state-of-the-art uncertain graph sampling techniques. © 2021 Owner/Author.

关键词： Graphics processing unit

来源：评论

学校读者我要写书评

暂无评论

Deep multilevel graph partitioning 29

Deep multilevel graph partitioning

引用

29th Annual European symposium on algorithms, ESA 2021

作者： Gottesbüren, Lars Heuer, Tobias Sanders, Peter Schulz, Christian Seemaier, Daniel Karlsruhe Institute of Technology Germany Universität Heidelberg Germany

ISBN: (纸本)9783959772044

Partitioning a graph into blocks of "roughly equal"weight while cutting only few edges is a fundamental problem in computer science with a wide range of applications. In particular, the problem is a building block in applications that require parallel processing. While the amount of available cores in parallel architectures has significantly increased in recent years, state-of-the-art graph partitioning algorithms do not work well if the input needs to be partitioned into a large number of blocks. Often currently available algorithms compute highly imbalanced solutions, solutions of low quality, or have excessive running time for this case. this is due to the fact that most high-quality general-purpose graph partitioners are multilevel algorithms which perform graph coarsening to build a hierarchy of graphs, initial partitioning to compute an initial solution, and local improvement to improve the solution throughout the hierarchy. However, for large number of blocks, the smallest graph in the hierarchy that is used for initial partitioning still has to be large. In this work, we substantially mitigate these problems by introducing deep multilevel graph partitioning and a shared-memory implementation thereof. Our scheme continues the multilevel approach deep into initial partitioning - integrating it into a framework where recursive bipartitioning and direct k-way partitioning are combined such that they can operate with high performance and quality. Our integrated approach is stronger, more flexible, arguably more elegant, and reduces bottlenecks for parallelization compared to existing multilevel approaches. For example, for large number of blocks our algorithm is on average at least an order of magnitude faster than competing algorithms while computing partitions with comparable solution quality. At the same time, our algorithm consistently produces balanced solutions. Moreover, for small number of blocks, our algorithms are the fastest among competing systems wit

关键词： Parallel architectures

来源：评论

学校读者我要写书评

暂无评论

Cache Telepathy: Leveraging Shared Resource Attacks to Learn DNN architectures 29

Cache Telepathy: Leveraging Shared Resource Attacks to Learn...

引用

29th USENIX Security symposium

作者： Yan, Mengjia Fletcher, Christopher W. Torrellas, Josep Univ Illinois Champaign IL 61820 USA

ISBN: (纸本)9781939133175

Deep Neural Networks (DNNs) are fast becoming ubiquitous for their ability to attain good accuracy in various machine learning tasks. A DNN's architecture (i.e., its hyperparameters) broadly determines the DNN's accuracy and performance, and is often confidential. Attacking a DNN in the cloud to obtain its architecture can potentially provide major commercial value. Further, attaining a DNN's architecture facilitates other existing DNN attacks. this paper presents Cache Telepathy: an efficient mechanism to help obtain a DNN's architecture using the cache side channel. the attack is based on the insight that DNN inference relies heavily on tiled GEMM (Generalized Matrix Multiply), and that DNN architecture parameters determine the number of GEMM calls and the dimensions of the matrices used in the GEMM functions. Such information can be leaked through the cache side channel. this paper uses Prime+Probe and Flush+Reload to attack the VGG and ResNet DNNs running OpenBLAS and Intel MKL libraries. Our attack is effective in helping obtain the DNN architectures by very substantially reducing the search space of target DNN architectures. For example, when attacking the OpenBLAS library, for the different layers in VGG-16, it reduces the search space from more than 5.4 x 10(12) architectures to just 16;for the different modules in ResNet-50, it reduces the search space from more than 6 x 10(46) architectures to only 512.

关键词： Deep neural networks

来源：评论

学校读者我要写书评

暂无评论

Annual acm symposium on parallelism in algorithms and architectures

Annual ACM symposium on parallelism in algorithms and archit...

引用

30th acm symposium on parallelism in algorithms and architectures, SPAA 2018

ISBN: (纸本)9781450357999

the proceedings contain 47 papers. the topics discussed include: parallel minimum cuts in near-linear work and low depth;trees for vertex cuts, hypergraph cuts and minimum hypergraph bisection;dynamic representations of sparse distributed networks: a locality-sensitive approach;constant-depth and subcubic-size threshold circuits for matrix multiplication;integrated model, batch, and domain parallelism in training neural networks;brief announcement: on approximating pagerank locally with sublinear query complexity;brief announcement: coloring-based task mapping for dragonfly systems;brief announcement: parallel transitive closure within 3D crosspoint memory;and lock-free contention adapting search trees.

关键词：

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共5页 << < 1 2 3 4 5 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：