检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

344 篇 会议
19 篇 期刊文献
1 册 图书

馆藏范围

364 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

305 篇 工学
- 261 篇 软件工程
- 250 篇 计算机科学与技术...
- 13 篇 电子科学与技术（可...
- 9 篇 信息与通信工程
- 5 篇 控制科学与工程
- 4 篇 机械工程
- 4 篇 生物工程
- 3 篇 生物医学工程（可授...
- 1 篇 力学（可授工学、理...
- 1 篇 动力工程及工程热...
- 1 篇 电气工程
- 1 篇 核科学与技术
- 1 篇 农业工程
- 1 篇 环境科学与工程（可...
- 1 篇 网络空间安全
57 篇 理学
- 53 篇 数学
- 4 篇 生物学
- 4 篇 系统科学
- 4 篇 统计学（可授理学、...
- 2 篇 化学
18 篇 管理学
- 12 篇 管理科学与工程(可...
- 11 篇 工商管理
- 5 篇 图书情报与档案管...
5 篇 经济学
- 5 篇 应用经济学
3 篇 法学
- 3 篇 社会学
3 篇 教育学
- 3 篇 教育学
1 篇 农学
- 1 篇 作物学

主题

54 篇 performance
50 篇 parallel process...
34 篇 parallel program...
33 篇 algorithms
27 篇 languages
25 篇 design
20 篇 parallel algorit...
20 篇 gpu
9 篇 experimentation
9 篇 measurement
8 篇 parallel
7 篇 scalability
7 篇 graphics process...
7 篇 theory
7 篇 parallel computi...
6 篇 parallelism
6 篇 mpi
6 篇 concurrency
5 篇 graph algorithms
5 篇 logic programmin...

机构

7 篇 carnegie mellon ...
4 篇 indiana univ blo...
3 篇 univ of tokyo
3 篇 tsinghua univ de...
3 篇 univ chinese aca...
3 篇 massachusetts in...
3 篇 univ illinois ur...
3 篇 swiss fed inst t...
3 篇 mit csail united...
3 篇 shanghai jiao to...
3 篇 tsinghua univ pe...
3 篇 univ calif berke...
2 篇 ist austria klos...
2 篇 georgetown univ ...
2 篇 univ wisconsin d...
2 篇 yale university ...
2 篇 shanghai key lab...
2 篇 univ of wisconsi...
2 篇 tsinghua univers...
2 篇 shanghai jiao to...

作者

8 篇 blelloch guy e.
6 篇 hoefler torsten
6 篇 garland michael
6 篇 zhai jidong
6 篇 chen haibo
6 篇 shun julian
5 篇 sun yihan
4 篇 dhulipala laxman
4 篇 chen wenguang
4 篇 tsigas philippas
4 篇 tan guangming
4 篇 wang haojie
4 篇 mellor-crummey j...
4 篇 gu yan
4 篇 kennedy ken
3 篇 taura kenjiro
3 篇 li jiajia
3 篇 yonezawa akinori
3 篇 pingali keshav
3 篇 kim jungwon

语言

361 篇 英文
3 篇 其他

检索条件"任意字段=Proceedings of the 24th Symposium on Principles and Practice of Parallel Programming"

共 364 条记录，以下是21-30 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

PPDP '24: proceedings of the 26th International symposium on principles and practice of Declarative programming 26

PPDP '24: Proceedings of the 26th International Symposium on...

引用

26th International symposium on principles and practice of Declarative programming, PPDP 2024, 26th International symposium on Formal Methods and held in conjunction with LOPSTR 2024

作者： Bruni, Alessandro Momigliano, Alberto Pradella, Matteo Rossi, Matteo

ISBN: (纸本)9798400709692

the proceedings contain 19 papers. the topics discussed include: a simple view of multiparty session types;on the preciseness of subtyping in session types: 10 years later;higher-order unification for free!: reusing the meta-language unification for the object language;declarative macro-programming of collective systems with aggregate computing: an experience report;hierarchical higher-order port-graphs: a rewriting-based modelling language;on the almost-sure termination of binary sessions;formal verification of executable matrix inversion via adjoint matrix and gaussian elimination;grammar-based pattern matching and type checking for difference data structures;evidence tampering and chain of custody in layered attestations;and towards effective ASP-based stream reasoning: facilitate the reasoning over patterns of events.

关键词：

来源：评论

学校读者我要写书评

暂无评论

GraphCube: Interconnection Hierarchy-aware Graph Processing 24

GraphCube: Interconnection Hierarchy-aware Graph Processing

引用

29th ACM SIGPLAN Annual symposium on principles and practice of parallel programming (PPoPP)

作者： Gan, Xinbiao Wu, Guang Qiu, Shenghao Xiong, Feng Si, Jiaqi Fang, Jianbin Dong, Dezun Gong, Chunye Li, Tiejun Wang, Zheng NUDT Beijing Peoples R China Univ Leeds Leeds W Yorkshire England Natl Supercomputer Ctr Tianjin Peoples R China

ISBN: (纸本)9798400704352

Processing large-scale graphs with billions to trillions of edges requires efficiently utilizing parallel systems. However, current graph processing engines do not scale well beyond a few tens of computing nodes because they are oblivious to the communication cost variations across the interconnection hierarchy. We introduce GraphCube, a better approach to optimizing graph processing on large-scale parallel systems with complex interconnections. GraphCube features a new graph partitioning approach to achieve better load balancing and minimize communication overhead across multiple levels of the interconnection hierarchy. We evaluate GraphCube by applying it to fundamental graph operations performed on synthetic and real-world graph datasets. Our evaluation used up to 79,024 computing nodes and 1.2+ million processor cores. Our large-scale experiments show that GraphCube outperforms state-of-the-art parallel graph processing methods in throughput and scalability. Furthermore, GraphCube outperformed the top-ranked systems on the Graph 500 list.

关键词： Graph processing Graph partitioning parallel computing Vectorization Graph500

来源：评论

学校读者我要写书评

暂无评论

CPMA: An Efficient Batch-parallel Compressed Set Without Pointers 24

CPMA: An Efficient Batch-Parallel Compressed Set Without Poi...

引用

29th ACM SIGPLAN Annual symposium on principles and practice of parallel programming (PPoPP)

作者： Wheatman, Brian Burns, Randal Buluc, Aydin Xu, Helen Johns Hopkins Univ Baltimore MD 21218 USA Lawrence Berkeley Natl Lab Lawrence KS USA Georgia Inst Technol Atlanta GA USA

ISBN: (纸本)9798400704352

this paper introduces the batch-parallel Compressed Packed Memory Array (CPMA), a compressed, dynamic, ordered set data structure based on the Packed Memory Array (PMA). Traditionally, batch-parallel sets are built on pointerbased data structures such as trees because pointer-based structures enable fast parallel unions via pointer manipulation. Whencompared with cache-optimized trees, PMAswere slower to update but faster to scan. the batch-parallel CPMA overcomes this tradeoff between updates and scans by optimizing for cache-friendliness. On average, the CPMA achieves 3x faster batch-insert throughput and 4x faster range-query throughput compared with compressed PaC-trees, a state-of-the-art batch-parallel set library based on cache-optimized trees. We further evaluate the CPMA compared with compressed PaC-trees and Aspen, a state-of-the-art system, on a realworld application of dynamic-graph processing. the CPMA is on average 1.2x faster on a suite of graph algorithms and 2x faster on batch inserts when compared with compressed PaC-trees. Furthermore, the CPMA is on average 1.3x faster on graph algorithms and 2x faster on batch inserts compared with Aspen.

关键词： packed memory array batch-parallel compression data structures dynamic graphs

来源：评论

学校读者我要写书评

暂无评论

proceedings of the ACM SIGPLAN symposium on principles and practice of parallel programming, PPOPP

Proceedings of the ACM SIGPLAN Symposium on Principles and P...

引用

24th ACM SIGPLAN symposium on principles and practice of parallel programming, PPoPP 2019

ISBN: (纸本)9781450362252

the proceedings contain 58 papers. the topics discussed include: beyond human-level accuracy: computational challenges in deep learning;throughput-oriented GPU memory allocation;SEP-graph: finding shortest execution paths for graph processing under a hybrid framework on GPU;incremental flattening for nested data parallelism;modular transactions: bounding mixed races in space and time;processing transactions in a predefined order;data-flow/dependence profiling for structured transformations;lightweight hardware transactional memory profiling;provably and practically efficient granularity control;semantics-aware scheduling policies for synchronization determinism;and a round-efficient distributed betweenness centrality algorithm.

关键词：

来源：评论

学校读者我要写书评

暂无评论

INFINEL: An efficient GPU-based processing method for unpredictable large output graph queries 24

INFINEL: An efficient GPU-based processing method for unpred...

引用

29th ACM SIGPLAN Annual symposium on principles and practice of parallel programming (PPoPP)

作者： Park, Sungwoo Oh, Seyeon Kim, Min-Soo Korea Adv Inst Sci & Technol Seoul South Korea GraphAI Seoul South Korea

ISBN: (纸本)9798400704352

With the introduction of GPUs, which are specialized for iterative parallel computations, the execution of computationintensive graph queries using a GPU has seen significant performance improvements. However, due to the memory constraints of GPUs, there has been limited research on handling large-scale output graph queries with unpredictable output sizes on a GPU. Traditionally, two-phase methods have been used, where the query is re-executed after splitting it into sub-tasks while only considering the size of the output in a static manner. However, two-phase methods become highly inefficient when used with graph data with extreme skew, failing to maximize the GPU performance. this paper proposes INFINEL, which handles unpredictable large output graph queries in a one-phase method through chunk allocation per thread and kernel stop/restart methods. We also propose applicable optimization techniques due to the corresponding unique characteristics of operating with low time/space overhead and not heavily relying on the GPU output buffer size. through extensive experiments, we demonstrate that our one-phase method of INFINEL improves the performance by up to 31.5 times over the conventional twophase methods for triangle listing ULO query.

关键词： Graph query processing Large output parallel computing GPU

来源：评论

学校读者我要写书评

暂无评论

ParlayANN: Scalable and Deterministic parallel Graph-Based Approximate Nearest Neighbor Search Algorithms 24

ParlayANN: Scalable and Deterministic Parallel Graph-Based A...

引用

29th ACM SIGPLAN Annual symposium on principles and practice of parallel programming (PPoPP)

作者： Manohar, Magdalen Dobson Shen, Zheqi Blelloch, Guy E. Dhulipala, Laxman Gu, Yan Simhadri, Harsha Vardhan Sun, Yihan Carnegie Mellon Univ Pittsburgh PA 15213 USA UC Riverside Riverside CA USA Univ Maryland Baltimore MD USA Microsoft Res Redmond WA USA

ISBN: (纸本)9798400704352

Approximate nearest-neighbor search (ANNS) algorithms are a key part of the modern deep learning stack due to enabling efficient similarity search over high-dimensional vector space representations (i.e., embeddings) of data. Among various ANNS algorithms, graph-based algorithms are known to achieve the best throughput-recall tradeoffs. Despite the large scale of modern ANNS datasets, existing parallel graphbased implementations suffer from significant challenges to scale to large datasets due to heavy use of locks and other sequential bottlenecks, which 1) prevents them from efficiently scaling to a large number of processors, and 2) results in nondeterminism that is undesirable in certain applications. In this paper, we introduce ParlayANN, a library of deterministic and parallel graph-based approximate nearest neighbor search algorithms, along with a set of useful tools for developing such algorithms. In this library, we develop novel parallel implementations for four state-of-the-art graph-based ANNS algorithms that scale to billion-scale datasets. Our algorithms are deterministic and achieve high scalability across a diverse set of challenging datasets. In addition to the new algorithmic ideas, we also conduct a detailed experimental study of our new algorithms as well as two existing non-graph approaches. Our experimental results both validate the effectiveness of our new techniques, and lead to a comprehensive comparison among ANNS algorithms on large scale datasets with a list of interesting findings.

关键词： nearest neighbor search vector search parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

AGAthA: Fast and Efficient GPU Acceleration of Guided Sequence Alignment for Long Read Mapping 24

AGAThA: Fast and Efficient GPU Acceleration of Guided Sequen...

引用

29th ACM SIGPLAN Annual symposium on principles and practice of parallel programming (PPoPP)

作者： Park, Seongyeon Hong, Junguk Song, Jaeyong Kim, Hajin Kim, Youngsok Lee, Jinho Seoul Natl Univ Seoul South Korea Yonsei Univ Seoul South Korea

ISBN: (纸本)9798400704352

With the advance in genome sequencing technology, the lengths of deoxyribonucleic acid (DNA) sequencing results are rapidly increasing at lower prices than ever. However, the longer lengths come at the cost of a heavy computational burden on aligning them. For example, aligning sequences to a human reference genome can take tens or even hundreds of hours. the current de facto standard approach for alignment is based on the guided dynamic programming method. Although this takes a long time and could potentially benefit from high-throughput graphic processing units (GPUs), the existing GPU-accelerated approaches often compromise the algorithm's structure, due to the GPU-unfriendly nature of the computational pattern. Unfortunately, such compromise in the algorithm is not tolerable in the field, because sequence alignment is a part of complicated bioinformatics analysis pipelines. In such circumstances, we propose AGAthA, an exact and efficient GPU-based acceleration of guided sequence alignment. We diagnose and address the problems of the algorithm being unfriendly to GPUs, which comprises strided/redundant memory accesses and workload imbalances that are difficult to predict. According to the experiments on modern GPUs, AGAthA achieves 18.8x speedup against the CPU-based baseline, 9.6x against the best GPU-based baseline, and 3.6x against GPU-based algorithms with different heuristics.

关键词： GPU Acceleration Genome Sequence Alignment Long Reads Dynamic programming

来源：评论

学校读者我要写书评

暂无评论

POSTER: Fast parallel Exact Inference on Bayesian Networks 28

POSTER: Fast Parallel Exact Inference on Bayesian Networks

引用

28th ACM SIGPLAN Annual symposium on principles and practice of parallel programming, PPoPP 2023

作者： Jiang, Jiantong Wen, Zeyi Mansoor, Atif Mian, Ajmal The University of Western Australia Australia Hong Kong University of Science and Technology Guangzhou China

ISBN: (纸本)9798400700156

Bayesian networks (BNs) are attractive, because they are graphical and interpretable machine learning models. However, exact inference on BNs is time-consuming, especially for complex problems. To improve the efficiency, we propose a fast BN exact inference solution named Fast-BNI on multi-core CPUs. Fast-BNI enhances the efficiency of exact inference through hybrid parallelism that tightly integrates coarse- and fine-grained parallelism. We also propose techniques to further simplify the bottleneck operations of BN exact inference. Fast-BNI source code is freely available at https://***/jjiantong/FastBN. © 2023 Owner/Author.

关键词： Bayesian networks

来源：评论

学校读者我要写书评

暂无评论

POSTER: ParGeo: A Library for parallel Computational Geometry 27

POSTER: ParGeo: A Library for Parallel Computational Geometr...

引用

27th ACM SIGPLAN symposium on principles and practice of parallel programming (PPoPP)

作者： Wang, Yiqiu Yu, Shangdi Dhulipala, Laxman Gu, Yan Shun, Julian IMIT CSAIL Riverside CA USA

We present PARGEO, a multicore library for computational geometry algorithms. We describe two of the algorithms from PARGEO, convex hull and the smallest enclosing ball, and present a short evaluation of all implement... 详细信息

ISBN: (纸本)9781450392044

关键词： Computational geometry

来源：评论

学校读者我要写书评

暂无评论

POSTER: Automatic Synthesis of parallel Unix Commands and Pipelines with KUMQUAT 27

POSTER: Automatic Synthesis of Parallel Unix Commands and Pi...

引用

27th ACM SIGPLAN symposium on principles and practice of parallel programming (PPoPP)

作者： Shen, Jiasi Rinard, Martin Vasilakis, Nikos MIT Cambridge MA USA

ISBN: (纸本)9781450392044

We present KUMQUAT, a system for automatically generating data-parallel implementations of UNIX shell commands and pipelines. the generated parallel versions split input streams, execute multiple instantiations of the original pipeline commands to process the splits in parallel, then combine the resulting parallel outputs to produce the final output stream. KumQUAT automatically synthesizes the combine operators, with a domain-specific combiner language acting as a strong regularizer that promotes efficient inference of correct combiners. We present experimental results that show that these combiners enable the effective parallelization of our benchmark scripts.

关键词： Automatic parallelization program synthesis

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共37页 << < 1 2 3 4 5 6 7 8 9 10 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：