检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

1,504 篇 会议
105 篇 期刊文献

馆藏范围

1,609 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

1,168 篇 工学
- 1,111 篇 计算机科学与技术...
- 557 篇 软件工程
- 118 篇 电气工程
- 75 篇 信息与通信工程
- 46 篇 控制科学与工程
- 37 篇 电子科学与技术（可...
- 13 篇 材料科学与工程（可...
- 13 篇 农业工程
- 11 篇 机械工程
- 11 篇 光学工程
- 8 篇 化学工程与技术
- 8 篇 生物工程
- 7 篇 建筑学
- 7 篇 生物医学工程（可授...
- 6 篇 动力工程及工程热...
- 5 篇 土木工程
- 3 篇 力学（可授工学、理...
579 篇 理学
- 557 篇 数学
- 55 篇 统计学（可授理学、...
- 16 篇 物理学
- 9 篇 生物学
- 9 篇 系统科学
- 8 篇 化学
73 篇 管理学
- 64 篇 管理科学与工程(可...
- 40 篇 工商管理
- 10 篇 图书情报与档案管...
16 篇 农学
- 16 篇 作物学
6 篇 经济学
- 6 篇 应用经济学
3 篇 法学
- 3 篇 社会学
3 篇 教育学
- 3 篇 教育学
2 篇 医学
1 篇 文学
1 篇 军事学

主题

237 篇 parallel algorit...
173 篇 parallel process...
80 篇 computer archite...
74 篇 parallel process...
57 篇 parallel program...
55 篇 algorithms
47 篇 parallel archite...
41 篇 hardware
30 篇 scheduling
27 篇 computer program...
21 篇 graph algorithms
20 篇 computer systems...
18 篇 approximation al...
18 篇 processor schedu...
18 篇 computational mo...
18 篇 field programmab...
17 篇 parallel computi...
16 篇 computer science
16 篇 performance
16 篇 delay

机构

32 篇 carnegie mellon ...
15 篇 swiss fed inst t...
15 篇 carnegie mellon ...
11 篇 univ maryland de...
11 篇 stanford univ st...
10 篇 univ maryland co...
10 篇 mit 77 massachus...
10 篇 univ calif berke...
8 篇 eth zurich
7 篇 georgetown univ ...
7 篇 mit cambridge ma...
7 篇 univ texas austi...
6 篇 penn state univ ...
6 篇 mit csail cambri...
5 篇 univ calif river...
5 篇 princeton univer...
5 篇 university of ma...
5 篇 microsoft res re...
5 篇 carnegie mellon ...
5 篇 harvard univ cam...

作者

38 篇 blelloch guy e.
20 篇 gu yan
18 篇 gibbons phillip ...
18 篇 shun julian
18 篇 goodrich michael...
16 篇 fineman jeremy t...
15 篇 sun yihan
14 篇 dhulipala laxman
13 篇 vishkin uzi
12 篇 agrawal kunal
11 篇 leiserson charle...
10 篇 ballard grey
10 篇 hoefler torsten
10 篇 anon
10 篇 miller gary l.
10 篇 harris david g.
9 篇 ghaffari mohsen
9 篇 tangwongsan kana...
9 篇 reif john h.
9 篇 demmel james

语言

1,569 篇 英文
40 篇 其他

检索条件"任意字段=Annual ACM Symposium on Parallel Algorithms and Architectures"

共 1609 条记录，以下是111-120 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Terminus: A Programmable Accelerator for Read and Update Operations on Sparse Data Structures 57

Terminus: A Programmable Accelerator for Read and Update Ope...

引用

57th annual IEEE/acm International symposium on Microarchitecture, MICRO 2024

作者： Lee, Hyun Ryong Sanchez, Daniel Massachusetts Institute of Technology MA United States

ISBN: (纸本)9798350350579

Sparse data structures like hash tables, trees, or compressed tensors are ubiquitous, but operations on these structures are expensive and inefficient on current systems. Prior work has proposed hardware acceleration for these operations, but these techniques have two key shortcomings: They limit the types of data structures they support, and they focus on reads but do not support fine-grained updates to these structures. We present Terminus, a programmable accelerator for read and update operations on sparse data structures. Terminus extends each general-purpose core with a programmable dataflow engine capable of accelerating a wide range of structures and operations. Terminus engines are flexible yet simple, as they focus on common operations and defer rare, complex ones to cores. Terminus features a simple concurrency control mechanism based on address ranges that enables safe updates while preserving parallelism. We evaluate Terminus on serial and parallel benchmarks on a wide range of sparse data structures. Terminus improves performance by gmean 7.4× over a CPU baseline, showing that Terminus can accelerate fine-grained reads and writes that were previously not possible in prior accelerators for sparse structures. © 2024 IEEE.

关键词： Reconfigurable architectures

来源：评论

学校读者我要写书评

暂无评论

Snapgram: Matrix Multiplications Made Fun 56

Snapgram: Matrix Multiplications Made Fun

引用

56th annual SIGCSE Technical symposium on Computer Science Education, SIGCSE TS 2025

作者： Porquet-Lupine, Joël Department of Computer Science University of California Davis Davis CA United States

ISBN: (纸本)9798400705328

This paper presents Snapgram, an innovative assignment for teaching matrix multiplications in the context of a parallel Computing course. Designed for fourth-year undergraduate at the University of California, Davis, this assignment transforms the typically mundane task of matrix multiplication into an engaging exercise by framing it as a simple recommender system for a fictional social media application. Students implement two key operations: multiplying a matrix by itself and selecting the highest values in each row of the resulting matrix. These operations are adaptable to various parallel programming paradigms, including shared-memory (pthreads, OpenMP), distributed-memory (MPI), and GPU programming (CUDA). By contextualizing matrix multiplications within a practical, relatable scenario, Snapgram significantly enhances student engagement and learning outcomes. Instructors are typically encouraged to provide scaffolding code for I/O operations, allowing students to focus on implementing the core algorithms. This paper discusses the motivation behind the creation of Snapgram, provides a detailed description of the assignment, analyzes its strengths and weaknesses, and presents student feedback. Our findings suggest that Snapgram successfully bridges the gap between theoretical concepts and practical applications in parallel computing education, leading to improved student satisfaction compared to traditional matrix multiplication assignments. The assignment material can be found at https://***/lupteach/shared/snapgram. © 2025 Copyright held by the owner/author(s).

关键词： assessment parallel computing undergraduate education

来源：评论

学校读者我要写书评

暂无评论

NSF/IEEE-TCPP Curriculum on parallel and Distributed Computing for Undergraduates - Version II - Big Data, Energy, and Distributed Computing 2023

NSF/IEEE-TCPP Curriculum on Parallel and Distributed Computi...

引用

54th annual acm SIGCSE Technical symposium on Computer Science Education (SIGCSE TS)

作者： Prasad, Sushil Weems, Charles Sussman, Alan Gupta, Anshul Estrada, Trilce Vaidyanathan, Ramachandran Ghafoor, Sheikh Kant, Krishna Stunkel, Craig Univ Texas San Antonio San Antonio TX 78249 USA Univ Massachusetts Amherst MA 01003 USA Univ Maryland College Pk MD 20742 USA IBM Res Yorktown Hts NY USA Univ New Mexico Albuquerque NM 87131 USA Louisiana State Univ Baton Rouge LA 70803 USA Tennessee Technol Univ Cookeville TN USA Temple Univ Philadelphia PA 19122 USA NVIDIA St Louis MO USA

ISBN: (纸本)9781450394338

This special session will report on the updated NSF/IEEE-TCPP Curriculum on parallel and Distributed Computing released in Nov 2020 by the Center for parallel and Distributed Computing Curriculum Development and Educational Resources (CDER). The purpose of the special session is to obtain SIGCSE community feedback on this curriculum in a highly interactive manner employing the hybrid modality and supported by a full-time CDER booth for the duration of SIGCSE. In this era of big data, cloud, and multi- and many-core systems, it is essential that the computer science (CS) and computer engineering (CE) graduates have basic skills in parallel and distributed computing (PDC). The topics are primarily organized into the areas of architecture, programming, and algorithms topics. A set of pervasive concepts that percolate across area boundaries are also identified. Version 1 of this curriculum was released in December 2012. That curriculum guideline has over 140 early adopter institutions worldwide and has been incorporated into the 2013 acm/IEEE Computer Science curricula. This Version-II represents a major revision. The updates have focused on enhancing coverage related to the topical aspects of Big Data, Energy, and Distributed Computing. The session will also report on related CDER activities including a workshop series on a PDC institute conceptualization, developing a CE-oriented version of the curriculum, and identifying a minimal set of PDC topics aligned with ABET's exposure-level PDC requirements. The interested SIGCSE audience includes educators, authors, publishers, curriculum committee members, department chairs and administrators, professional societies, and the computing industry.

关键词： Education Undergraduate Curriculum parallel and Distributed Computing Bloom's Classification Learning Outcomes

来源：评论

学校读者我要写书评

暂无评论

Online parallel Paging with Optimal Makespan 22

Online Parallel Paging with Optimal Makespan

引用

34th acm symposium on parallelism in algorithms and architectures (SPAA)

作者： Agrawal, Kunal Bender, Michael A. Das, Rathish Kuszmaul, William Peserico, Enoch Scquizzato, Michele Washington Univ St Louis St Louis MO 63130 USA SUNY Stony Brook Stony Brook NY USA Univ Waterloo Waterloo ON Canada MIT Cambridge MA USA Univ Padua Padua Italy

ISBN: (纸本)9781450391467

The classical paging problem can be described as follows: given a cache that can hold up to k pages (or blocks) and a sequence of requests to pages, how should we manage the cache so as to maximize performance-or, in other words, complete the sequence as quickly as possible. Whereas this sequential paging problem has been well understood for decades, the parallel version, where the cache is shared among p processors each issuing its own sequence of page requests, has been much more resistant. In this problem we are given p request sequences R-1, R-2, . . . , R-p, each of which accesses a disjoint set of pages, and we ask the question: how should the paging algorithm manage the cache to optimize the completion time of all sequences (i.e., the makespan). As for the classical sequential problem, the goal is to design an online paging algorithm that achieves an optimal competitive ratio, using O(1) resource augmentation. In a recent breakthrough, Agrawal et al. [SODA '21] showed that the optimal (deterministic) competitive ratio C for this problem is in the range Omega(log p) <= C <= O(log(2) p). This paper closes that gap, showing how to achieve a competitive ratio C = O(log p). Our techniques reveal surprising combinatorial differences between the problem of optimizing makespan and that of optimizing the closely related metric of mean completion time;and yet our algorithm manages to be simultaneously asymptotically optimal for both tasks.

关键词： Paging parallel paging multicores online algorithms

来源：评论

学校读者我要写书评

暂无评论

Massively parallel and Dynamic algorithms for Minimum Size Clustering 33

Massively Parallel and Dynamic Algorithms for Minimum Size C...

引用

annual acm-SIAM symposium on Discrete algorithms (SODA)

作者： Epasto, Alessandro Mahdian, Mohammad Mirrokni, Vahab Zhong, Peilin Google Res New York NY 10011 USA

ISBN: (纸本)9781611977073

Clustering of data in metric spaces is a fundamental problem and has many applications in data mining and it is often used as an unsupervised learning tool inside other machine learning systems. In many scenarios where we are concerned with the privacy implications of clustering users, clusters are required to have minimum-size constraint. A canonical example of min-size clustering is in enforcing anonymization and the protection of the privacy of user data. Our work is motivated by real-world applications (such as the Federated Learning of Cohorts project -FLoC) where a min size clustering algorithm needs to handle very large amount of data and the data may also change over time. Thus efficient parallel or dynamic algorithms are desired. In this paper, we study the r-gather problem, a natural formulation of minimum-size clustering in metric spaces. The goal of r-gather is to partition n points into clusters such that each cluster has size at least r, and the maximum radius of the clusters is minimized. This additional constraint completely changes the algorithmic nature of the problem, and many clustering techniques fail. Also previous dynamic and parallel algorithms do not achieve desirable complexity. We propose algorithms both in the Massively parallel Computation (MPC) model and in the dynamic setting. Our MPC algorithm handles input points from the Euclidean space Rd. It computes an O(1)-approximate solution of r-gather in O(log(epsilon) n) rounds using total space O(n(1+gamma) . d) for arbitrarily small constants epsilon , gamma > 0. In addition our algorithm is fully scalable, i.e., there is no lower bound on the memory per machine. Our dynamic algorithm maintains an O(1)-approximate r-gather solution under insertions/deletions of points in a metric space with doubling dimension d. The update time is r.2(O(d)) . log(O(1)) and the query time is 2(O(d)) . log(O(1)) Lambda, where Lambda is the ratio between the largest and the smallest distance. To obtain our r

关键词： Data mining

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms for finding large cliques in sparse graphs 21

Parallel algorithms for finding large cliques in sparse grap...

引用

33rd acm symposium on parallelism in algorithms and architectures, SPAA 2021

作者： Gianinazzi, Lukas Besta, Maciej Schaffner, Yannick Hoefler, Torsten ETH Zurich Department of Computer Science Zurich Switzerland ETH Zurich Department of Mathematics Zurich Switzerland

ISBN: (纸本)9781450380706

We present a parallel k-clique listing algorithm with improved work bounds (for the same depth) in sparse graphs with low degeneracy or arboricity. We achieve this by introducing and analyzing a new pruning criterion for a backtracking search. Our algorithm has better asymptotic performance, especially for larger cliques (when k is not constant), where we avoid the straightforwardly exponential runtime growth with respect to the clique size. In particular, for cliques that are a constant factor smaller than the graph's degeneracy, the work improvement is an exponential factor in the clique size compared to previous results. Moreover, we present a low-depth approximation to the community degeneracy (which can be arbitrarily smaller than the degeneracy). This approximation enables a low depth clique listing algorithm whose runtime is parameterized by the community degeneracy. © 2021 acm.

关键词： Approximation algorithms

来源：评论

学校读者我要写书评

暂无评论

Keynote Talk: Large Scale parallel Sparse Matrix Streaming Graph/Network Analysis 22

Keynote Talk: Large Scale Parallel Sparse Matrix Streaming G...

引用

34th acm symposium on parallelism in algorithms and architectures (SPAA)

作者： Kepner, Jeremy MIT Lincoln Lab Supercomp Ctr Lexington Lexington MA 02421 USA

ISBN: (纸本)9781450391467

Groundbreaking work analyzing early Internet data revealed novel phenomena that became the basis of a new endeavor: Network Science. This exciting new field has revealed fundamental properties about communication, social, and biological networks. Simultaneously, the Internet has expanded enormously and is now a domain of activity as important to civilization as land, sea, air, and space. The initial Internet observations that nurtured network science have ballooned and become the largest dynamic streaming data sets availability;creating fresh opportunities to examine the foundations of network science in previously unimagined detail. The analysis of streaming networks with trillions of events have stimulated the development of novel mathematics (e.g., associative array algebra), algorithms (e.g., hypersparse neural networks), software (e.g., ***), and hardware. All of these capabilities are critically dependent on parallel processing. Application of these developments to the worlds' largest publicly available streaming event datasets have revealed a variety of new phenomena.

关键词： networks graphs streaming sparse parallel

来源：评论

学校读者我要写书评

暂无评论

Redundant Dataflow Applications on Clustered Manycore architectures 22

Redundant Dataflow Applications on Clustered Manycore Archit...

引用

37th annual acm symposium on Applied Computing

作者： Kuehbacher, Christoph Ungerer, Theo Altmeyer, Sebastian Univ Augsburg Augsburg Germany

ISBN: (纸本)9781450387132

Increasing performance requirements in the embedded systems domain have encouraged a drift from singlecore to multicore processors. Cars are an example for complex embedded systems in which the use of multicores continues to grow. The requirements of software components running in modern cars are diverse. On the one hand there are safety-critical tasks like the airbag control, on the other hand tasks which do not have any safety-related requirements at all, for example those controlling the infotainment system. Trends like autonomous driving lead to tasks which are simultaneously safety-critical and computationally complex. To satisfy the requirements of modern embedded applications we developed a dataflow-based runtime environment (RTE) for clustered manycore architectures. The RTE is able to execute dataflow graphs in various redundancy configurations and with different schedulers. We implemented our RTE design on the Kalray Bostan Massively parallel Processor Array and evaluated all possible configurations for three common computation tasks. To classify the performance of our RTE, we compared the non-redundant graph executions with OpenCL versions of the three applications. The results show that our RTE can come close or even surpass Kalray's OpenCL framework, although maximum performance was not the primary goal of our design.

关键词： dataflow runtime environment software redundancy NoC-based architecture embedded systems

来源：评论

学校读者我要写书评

暂无评论

ISCA 2023 - Proceedings of the 2023 50th annual International symposium on Computer Architecture

ISCA 2023 - Proceedings of the 2023 50th Annual Internationa...

引用

50th annual International symposium on Computer Architecture, ISCA 2023

ISBN: (纸本)9798400700958

The proceedings contain 84 papers. The topics discussed include: Astrea: accurate quantum error-decoding via practical minimum-weight perfect-matching;TaskFusion: an efficient transfer learning architecture with dual delta sparsity for multi-task natural language processing;scaling qubit readout with hardware efficient machine learning architectures;enabling high performance debugging for variational quantum algorithms using compressed sensing;HAAC: a hardware-software co-design to accelerate garbled circuits;inter-layer scheduling space definition and exploration for tiled accelerators;ArchGym: an open-source gymnasium for machine learning assisted architecture design;SHARP: a short-word hierarchical accelerator for robust and practical fully homomorphic encryption;V10: hardware-assisted NPU multi-tenancy for improved resource utilization and fairness;GenDP: a framework of dynamic programming acceleration for genome sequencing analysis;CamJ: enabling system-level energy modeling and architectural exploration for in-sensor visual computing;and an algorithm and architecture co-design for accelerating smart contracts in blockchain.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Increasing Ising Machine Capacity with Multi-Chip architectures 22

Increasing Ising Machine Capacity with Multi-Chip Architectu...

引用

49th IEEE/acm annual International symposium on Computer Architecture (ISCA)

作者： Sharma, Anshujit Afoakwa, Richard Ignjatovic, Zeljko Huang, Michael Univ Rochester Dept Elect & Comp Engn New York NY 14627 USA

ISBN: (纸本)9781450386104

Nature has inspired a lot of problem solving techniques over the decades. More recently, researchers have increasingly turned to harnessing nature to solve problems directly. Ising machines are a good example and there are numerous research prototypes as well as many design concepts. They can map a family of NP-complete problems and derive competitive solutions at speeds much greater than conventional algorithms and in some cases, at a fraction of the energy cost of a von Neumann computer. However, physical Ising machines are often fixed in its problem solving capacity. Without any support, a bigger problem cannot be solved at all. With a simple divide-and-conquer strategy, it turns out, the advantage of using an Ising machine quickly diminishes. It is therefore desirable for Ising machines to have a scalable architecture where multiple instances can collaborate to solve a bigger problem. We then discuss scalable architecture design issues which lead to a multiprocessor Ising machine architecture. Experimental analyses show that our proposed architectures allow an Ising machine to scale in capacity and maintain its significant performance advantage (about 2200x speedup over a state-of-the-art computational substrate). In the case of communication bandwidth-limited systems, our proposed optimizations in supporting batch mode operation can cut down communication demand by about 4-5x without a significant impact on solution quality.

关键词： Ising machine scaling multi-chip nature-based computing

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共161页 << < 8 9 10 11 12 13 14 15 16 17 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：