检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

1,504 篇 会议
105 篇 期刊文献

馆藏范围

1,609 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

1,168 篇 工学
- 1,111 篇 计算机科学与技术...
- 557 篇 软件工程
- 118 篇 电气工程
- 75 篇 信息与通信工程
- 46 篇 控制科学与工程
- 37 篇 电子科学与技术（可...
- 13 篇 材料科学与工程（可...
- 13 篇 农业工程
- 11 篇 机械工程
- 11 篇 光学工程
- 8 篇 化学工程与技术
- 8 篇 生物工程
- 7 篇 建筑学
- 7 篇 生物医学工程（可授...
- 6 篇 动力工程及工程热...
- 5 篇 土木工程
- 3 篇 力学（可授工学、理...
579 篇 理学
- 557 篇 数学
- 55 篇 统计学（可授理学、...
- 16 篇 物理学
- 9 篇 生物学
- 9 篇 系统科学
- 8 篇 化学
73 篇 管理学
- 64 篇 管理科学与工程(可...
- 40 篇 工商管理
- 10 篇 图书情报与档案管...
16 篇 农学
- 16 篇 作物学
6 篇 经济学
- 6 篇 应用经济学
3 篇 法学
- 3 篇 社会学
3 篇 教育学
- 3 篇 教育学
2 篇 医学
1 篇 文学
1 篇 军事学

主题

237 篇 parallel algorit...
173 篇 parallel process...
80 篇 computer archite...
74 篇 parallel process...
57 篇 parallel program...
55 篇 algorithms
47 篇 parallel archite...
41 篇 hardware
30 篇 scheduling
27 篇 computer program...
21 篇 graph algorithms
20 篇 computer systems...
18 篇 approximation al...
18 篇 processor schedu...
18 篇 computational mo...
18 篇 field programmab...
17 篇 parallel computi...
16 篇 computer science
16 篇 performance
16 篇 delay

机构

32 篇 carnegie mellon ...
15 篇 swiss fed inst t...
15 篇 carnegie mellon ...
11 篇 univ maryland de...
11 篇 stanford univ st...
10 篇 univ maryland co...
10 篇 mit 77 massachus...
10 篇 univ calif berke...
8 篇 eth zurich
7 篇 georgetown univ ...
7 篇 mit cambridge ma...
7 篇 univ texas austi...
6 篇 penn state univ ...
6 篇 mit csail cambri...
5 篇 univ calif river...
5 篇 princeton univer...
5 篇 university of ma...
5 篇 microsoft res re...
5 篇 carnegie mellon ...
5 篇 harvard univ cam...

作者

38 篇 blelloch guy e.
20 篇 gu yan
18 篇 gibbons phillip ...
18 篇 shun julian
18 篇 goodrich michael...
16 篇 fineman jeremy t...
15 篇 sun yihan
14 篇 dhulipala laxman
13 篇 vishkin uzi
12 篇 agrawal kunal
11 篇 leiserson charle...
10 篇 ballard grey
10 篇 hoefler torsten
10 篇 anon
10 篇 miller gary l.
10 篇 harris david g.
9 篇 ghaffari mohsen
9 篇 tangwongsan kana...
9 篇 reif john h.
9 篇 demmel james

语言

1,569 篇 英文
40 篇 其他

检索条件"任意字段=Annual ACM Symposium on Parallel Algorithms and Architectures"

共 1609 条记录，以下是31-40 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

A Simple and Efficient parallel Laplacian Solver 23

A Simple and Efficient Parallel Laplacian Solver

引用

35th acm symposium on parallelism in algorithms and architectures (SPAA)

作者： Sachdeva, Sushant Zhao, Yibin Univ Toronto Toronto ON Canada

ISBN: (纸本)9781450395458

A symmetric matrix is called a Laplacian if it has nonpositive off-diagonal entries and zero row sums. Since the seminal work of Spielman and Teng (2004) on solving Laplacian linear systems in nearly linear time, several algorithms have been designed for the task. Yet, the work of Kyng and Sachdeva (2016) remains the simplest and most practical sequential solver. They presented a solver purely based on random sampling and without graph-theoretic constructions such as low-stretch trees and sparsifiers. In this work, we extend the result of Kyng and Sachdeva to a simple parallel Laplacian solver with O(m log(3) n log log n) or O((m + n log(5) n) log n log log n) work and O(log(2) n log log n) depth using the ideas of block Cholesky factorization from Kyng et al. (2016). Compared to the best known parallel Laplacian solvers that achieve polylogarithmic depth due to Lee et al. (2015), our solver achieves both better depth and, for dense graphs, better work.

关键词： Laplacian Linear Systems parallel algorithms Linear System Solvers

来源：评论

学校读者我要写书评

暂无评论

ParlayANN: Scalable and Deterministic parallel Graph-Based Approximate Nearest Neighbor Search algorithms 24

ParlayANN: Scalable and Deterministic Parallel Graph-Based A...

引用

29th acm SIGPLAN annual symposium on Principles and Practice of parallel Programming (PPoPP)

作者： Manohar, Magdalen Dobson Shen, Zheqi Blelloch, Guy E. Dhulipala, Laxman Gu, Yan Simhadri, Harsha Vardhan Sun, Yihan Carnegie Mellon Univ Pittsburgh PA 15213 USA UC Riverside Riverside CA USA Univ Maryland Baltimore MD USA Microsoft Res Redmond WA USA

ISBN: (纸本)9798400704352

Approximate nearest-neighbor search (ANNS) algorithms are a key part of the modern deep learning stack due to enabling efficient similarity search over high-dimensional vector space representations (i.e., embeddings) of data. Among various ANNS algorithms, graph-based algorithms are known to achieve the best throughput-recall tradeoffs. Despite the large scale of modern ANNS datasets, existing parallel graphbased implementations suffer from significant challenges to scale to large datasets due to heavy use of locks and other sequential bottlenecks, which 1) prevents them from efficiently scaling to a large number of processors, and 2) results in nondeterminism that is undesirable in certain applications. In this paper, we introduce ParlayANN, a library of deterministic and parallel graph-based approximate nearest neighbor search algorithms, along with a set of useful tools for developing such algorithms. In this library, we develop novel parallel implementations for four state-of-the-art graph-based ANNS algorithms that scale to billion-scale datasets. Our algorithms are deterministic and achieve high scalability across a diverse set of challenging datasets. In addition to the new algorithmic ideas, we also conduct a detailed experimental study of our new algorithms as well as two existing non-graph approaches. Our experimental results both validate the effectiveness of our new techniques, and lead to a comprehensive comparison among ANNS algorithms on large scale datasets with a list of interesting findings.

关键词： nearest neighbor search vector search parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Almost Optimal Massively parallel algorithms for k-Center Clustering and Diversity Maximization 23

Almost Optimal Massively Parallel Algorithms for k-Center Cl...

引用

35th acm symposium on parallelism in algorithms and architectures (SPAA)

作者： Haqi, Alireza Zarrabi-Zadeh, Hamid Sharif Univ Technol Tehran Iran

ISBN: (纸本)9781450395458

Clustering and diversification are two central problems with various applications in machine learning, data mining, and information retrieval. The k-center clustering and k-diversity maximization are two of the most well-studied and widely-used problems in this area. Both problems admit sequential algorithms with optimal approximation factors of 2 in any metric space. However, finding distributed algorithms matching the same optimal approximation ratios has been open for more than a decade, with the best current algorithms having factors at least twice the optimal. In this paper, we settle this open problem by presenting constant-round distributed algorithms for k-center clustering and k-diversity maximization in the massively parallel computation (MPC) model, achieving an approximation factor of 2 + epsilon in any metric space for any constant epsilon > 0, which is essentially the best possible considering the lower bound of 2 on the approximability of both these problems. Our algorithms are based on a novel technique for approximating vertex degrees and finding a so-called k-bounded maximal independent set in threshold graphs, using only a constant number of MPC rounds. Other applications of our general technique is also implied, including an almost optimal (3 + epsilon)-approximation algorithm for the k-supplier problem in any metric space in the MPC model.

关键词： k-center clustering diversity maximization maximal independent set massively parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Adaptive Massively parallel Connectivity in Optimal Space 23

Adaptive Massively Parallel Connectivity in Optimal Space

引用

35th acm symposium on parallelism in algorithms and architectures (SPAA)

作者： Latypov, Rustam Lacki, Jakub Maus, Yannic Uitto, Jara Aalto Univ Espoo Finland Google Res New York NY USA Graz Univ Technol Graz Austria

ISBN: (纸本)9781450395458

We study the problem of finding connected components in the Adaptive Massively parallel Computation (AMPC) model. We show that when we require the total space to be linear in the size of the input graph the problem can be solved in O( log*n) rounds in forests (with high probability) and 2 (O( log*n)) expected rounds in general graphs. This improves upon an existing O(log log(m/n) n) round algorithm. For the case when the desired number of rounds is constant we show that both problems can be solved using Theta(m +n log((k)) n) total space in expectation (in each round), where k is an arbitrarily large constant and log((k)) is the k-th iterate of the log(2) function. This improves upon existing algorithms requiring Omega(m +n log n) total space.

关键词： adaptive massively parallel model AMPC connectivity

来源：评论

学校读者我要写书评

暂无评论

parallel Longest Increasing Subsequence and van Emde Boas Trees 23

Parallel Longest Increasing Subsequence and van Emde Boas Tr...

引用

35th acm symposium on parallelism in algorithms and architectures (SPAA)

作者： Gu, Yan Men, Ziyang Shen, Zheqi Sun, Yihan Wan, Zijin UC Riverside Riverside CA 92521 USA

ISBN: (纸本)9781450395458

This paper studies parallel algorithms for the longest increasing subsequence (LIS) problem. Let.. be the input size and k be the LIS length of the input. Sequentially, LIS is a simple problem that can be solved using dynamic programming (DP) in O(n log n) work. However, parallelizing LIS is a long-standing challenge. We are unaware of any parallel LIS algorithm that has optimal O(n log n) work and non-trivial parallelism (i.e., (O) over tilde (k) or o(n) span). This paper proposes a parallel LIS algorithm that costs O(n log k) work, (O) over tilde (k) span, and O(n) space, and is much simpler than the previous parallel LIS algorithms. We also generalize the algorithm to a weighted version of LIS, which maximizes the weighted sum for all objects in an increasing subsequence. To achieve a better work bound for the weighted LIS algorithm, we designed parallel algorithms for the van Emde Boas (vEB) tree, which has the same structure as the sequential vEB tree, and supports work-efficient parallel batch insertion, deletion, and range queries. We also implemented our parallel LIS algorithms. Our implementation is light-weighted, efficient, and scalable. On input size 10(9), our LIS algorithm outperforms a highly-optimized sequential algorithm (with O(n log k) cost) on inputs with k <= 3 x 10(5). Our algorithm is also much faster than the best existing parallel implementation by Shen et al. (2022) on all input instances.

关键词： parallel algorithms longest increasing subsequence van Emde Boas tree dynamic programming parallel data structure

来源：评论

学校读者我要写书评

暂无评论

Brief Announcement: Optimized GPU-accelerated Feature Extraction for ORB-SLAM Systems 23

Brief Announcement: Optimized GPU-accelerated Feature Extrac...

引用

35th acm symposium on parallelism in algorithms and architectures (SPAA)

作者： Muzzini, Filippo Capodieci, Nicola Cavicchioli, Roberto Rouxel, Benjamin Univ Modena & Reggio Emilia Modena Italy

ISBN: (纸本)9781450395458

Reducing the execution time of ORB-SLAM algorithm is a crucial aspect of autonomous vehicles since it is computationally intensive for embedded boards. We propose a parallel GPU-based implementation, able to run on embedded boards, of the Tracking part of the ORB-SLAM2/3 algorithm. Our implementation is not simply a GPU port of the tracking phase. Instead, we propose a novel method to accelerate image Pyramid construction on GPUs. Comparison against state-of-the-art CPU and GPU implementations, considering both computational time and trajectory errors shows improvement on execution time in well-known datasets, such as KITTI and EuRoC.

关键词： GPU ORB-SLAM CUDA parallel

来源：评论

学校读者我要写书评

暂无评论

High-Performance and Flexible parallel algorithms for Semisort and Related Problems 23

High-Performance and Flexible Parallel Algorithms for Semiso...

引用

35th acm symposium on parallelism in algorithms and architectures (SPAA)

作者： Dong, Xiaojun Wu, Yunshu Wang, Zhongqi Dhulipala, Laxman Gu, Yan Sun, Yihan Univ Calif Riverside Riverside CA 92521 USA Univ Maryland College Pk MD 20742 USA

ISBN: (纸本)9781450395458

Semisort is a fundamental algorithmic primitive widely used in the design and analysis of efficient parallel algorithms. It takes input as an array of records and a function extracting a key per record, and reorders them so that records with equal keys are contiguous. Since many applications only require collecting equal values, but not fully sorting the input, semisort is broadly applicable, e.g., in string algorithms, graph analytics, and geometry processing, among many other domains. However, despite dozens of recent papers that use semisort in their theoretical analysis and the existence of an asymptotically optimal parallel semisort algorithm, most implementations of these parallel algorithms choose to implement semisort by using comparison or integer sorting in practice, due to potential performance issues in existing semisort implementations. In this paper, we revisit the semisort problem, with the goal of achieving a high-performance parallel semisort implementation with a flexible interface. Our approach can easily be extended to two related problems, histogram and collect-reduce. Our algorithms achieve strong speedups in practice, and importantly, outperform state-of-the-art parallel sorting and semisorting methods for almost all settings we tested, with varying input sizes, distribution, and key types. On average (geometric means), our semisort implementation is at least 1.27x faster the best of the tested baselines. We also test two important applications with real-world data, and show that our algorithms improve the performance (up to 2.13x) over existing approaches. We believe that many other parallel algorithm implementations can be accelerated using our results.

关键词： Semisort Collect-reduce Histogram Sorting Group-by parallel algorithms Shared-Memory parallelism

来源：评论

学校读者我要写书评

暂无评论

parallel Memory-Independent Communication Bounds for SYRK 23

Parallel Memory-Independent Communication Bounds for SYRK

引用

35th acm symposium on parallelism in algorithms and architectures (SPAA)

作者： Al Daas, Hussam Ballard, Grey Grigori, Laura Kumar, Suraj Rouse, Kathryn Rutherford Appleton Lab Didcot Oxon England Wake Forest Univ Winston Salem NC 27101 USA Inria Paris Paris France Inria Lyon Lyon France Inmar Intelligence Winston Salem NC USA

ISBN: (纸本)9781450395458

In this paper, we focus on the parallel communication cost of multiplying a matrix with its transpose, known as a symmetric rank-k update (SYRK). SYRK requires half the computation of general matrix multiplication because of the symmetry of the output matrix. Recent work (Beaumont et al., SPAA '22) has demonstrated that the sequential I/O complexity of SYRK is also a constant factor smaller than that of general matrix multiplication. Inspired by this progress, we establish memory-independent parallel communication lower bounds for SYRK with smaller constants than general matrix multiplication, and we show that these constants are tight by presenting communication-optimal algorithms. The crux of the lower bound proof relies on extending a key geometric inequality to symmetric computations and analytically solving a constrained nonlinear optimization problem. The optimal algorithms use a triangular blocking scheme for parallel distribution of the symmetric output matrix and corresponding computation.

关键词： Symmetric matrices Communication costs Convex optimization

来源：评论

学校读者我要写书评

暂无评论

parallel Sampling via Counting 2024

Parallel Sampling via Counting

引用

56th annual acm symposium on Theory of Computing (STOC)

作者： Anari, Nima Gao, Ruiquan Rubinstein, Aviad Stanford Univ Stanford CA 94305 USA

ISBN: (纸本)9798400703836

We show how to use parallelization to speed up sampling from an arbitrary distribution mu on a product space [q](n), given oracle access to counting queries: P-X similar to mu[X-S = sigma(S)] for any S subset of [n] and sigma(S) epsilon [q](S). Our algorithm takes O(n(2/3) center dot polylog(n,q)) parallel time, to the best of our knowledge, the first sublinear in n runtime for arbitrary distributions. Our results have implications for sampling in autoregressive models. Our algorithm directly works with an equivalent oracle that answers conditional marginal queries P-X similar to mu[X-i= sigma(i) | X-S= sigma(S)], whose role is played by a trained neural network in autoregressive models. This suggests a roughly n(1/3)-factor speedup is possible for sampling in any-order autoregressive models. We complement our positive result by showing a lower bound of (Omega) over tilde (n(1/3)) for the runtime of any parallel sampling algorithm making at most poly(n) queries to the counting oracle, even for q=2.

关键词： parallel sampling counting conditional marginals autoregressive models

来源：评论

学校读者我要写书评

暂无评论

Provably-Efficient and Internally-Deterministic parallel Union-Find 23

Provably-Efficient and Internally-Deterministic Parallel Uni...

引用

35th acm symposium on parallelism in algorithms and architectures (SPAA)

作者： Fedorov, Alexander Hashemi, Diba Nadiradze, Giorgi Alistarh, Dan IST Austria Klosterneuburg Austria

ISBN: (纸本)9781450395458

Determining the degree of inherent parallelism in classical sequential algorithms and leveraging it for fast parallel execution is a key topic in parallel computing, and detailed analyses are known for a wide range of classical algorithms. In this paper, we perform the first such analysis for the fundamental Union-Find problem, in which we are given a graph as a sequence of edges, and must maintain its connectivity structure under edge additions. We prove that classic sequential algorithms for this problem are well-parallelizable under reasonable assumptions, addressing a conjecture by [Blelloch, 2017]. More precisely, we show via a new potential argument that, under uniform random edge ordering, parallel union-find operations are unlikely to interfere: T concurrent threads processing the graph in parallel will encounter memory contention O(T-2 center dot log vertical bar V vertical bar center dot log vertical bar E vertical bar) times in expectation, where vertical bar E vertical bar and vertical bar V vertical bar are the number of edges and nodes in the graph, respectively. We leverage this result to design a new parallel Union-Find algorithm that is both internally deterministic, i.e., its results are guaranteed to match those of a sequential execution, but also work-efficient and scalable, as long as the number of threads T is O(vertical bar E vertical bar(1/3-epsilon)), for an arbitrarily small constant epsilon > 0, which holds for most large real-world graphs. We present lower bounds which show that our analysis is close to optimal, and experimental results suggesting that the performance cost of internal determinism is limited.

关键词： union-find parallel algorithms graph algorithms deterministic parallelism

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共161页 << < 1 2 3 4 5 6 7 8 9 10 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：