检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

4,668 篇 会议
194 篇 期刊文献
27 册 图书

馆藏范围

4,889 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

2,499 篇 工学
- 2,385 篇 计算机科学与技术...
- 1,470 篇 软件工程
- 419 篇 信息与通信工程
- 385 篇 电气工程
- 193 篇 控制科学与工程
- 159 篇 电子科学与技术（可...
- 75 篇 网络空间安全
- 33 篇 材料科学与工程（可...
- 33 篇 生物医学工程（可授...
- 26 篇 机械工程
- 26 篇 安全科学与工程
- 21 篇 光学工程
- 21 篇 动力工程及工程热...
- 21 篇 生物工程
- 20 篇 建筑学
- 19 篇 土木工程
- 17 篇 力学（可授工学、理...
- 17 篇 环境科学与工程（可...
- 15 篇 交通运输工程
792 篇 理学
- 669 篇 数学
- 103 篇 统计学（可授理学、...
- 89 篇 系统科学
- 66 篇 物理学
- 29 篇 生物学
- 19 篇 化学
451 篇 管理学
- 322 篇 管理科学与工程(可...
- 229 篇 工商管理
- 153 篇 图书情报与档案管...
51 篇 经济学
- 51 篇 应用经济学
18 篇 法学
- 17 篇 社会学
15 篇 医学
9 篇 教育学
9 篇 农学
5 篇 文学
4 篇 军事学

主题

564 篇 distributed comp...
540 篇 computer science
490 篇 parallel process...
485 篇 concurrent compu...
410 篇 parallel process...
379 篇 application soft...
342 篇 distributed data...
335 篇 distributed comp...
310 篇 databases
288 篇 computer archite...
249 篇 database systems
222 篇 computational mo...
220 篇 delay
215 篇 hardware
211 篇 costs
188 篇 processor schedu...
172 篇 protocols
170 篇 computer network...
162 篇 large-scale syst...
152 篇 parallel program...

机构

20 篇 ibm thomas j. wa...
11 篇 department of co...
10 篇 department of co...
8 篇 school of comput...
8 篇 department of co...
8 篇 department of co...
7 篇 ieee
7 篇 department of co...
7 篇 department of ee...
7 篇 department of co...
7 篇 college of compu...
7 篇 hewlett packard ...
7 篇 northwestern uni...
6 篇 department of co...
6 篇 syracuse univ sy...
6 篇 department of co...
6 篇 department of co...
6 篇 lawrence berkele...
6 篇 college of compu...
6 篇 univ of californ...

作者

20 篇 a. choudhary
16 篇 a. boukerche
10 篇 s.k. das
10 篇 boukerche azzedi...
10 篇 choudhary a
10 篇 li keqin
9 篇 j. saltz
9 篇 sun xian-he
9 篇 choudhary alok
9 篇 k. schwan
9 篇 das sajal k.
8 篇 l.r. welch
8 篇 a. makinouchi
8 篇 chen haibo
8 篇 anon
8 篇 a.a. chien
7 篇 c. katsinis
7 篇 t. kurc
7 篇 d.k. panda
7 篇 keqin li

语言

4,864 篇 英文
23 篇 其他
2 篇 中文

检索条件"任意字段=Proceedings International Symposium on Databases in Parallel and Distributed Systems"

共 4889 条记录，以下是171-180 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Falcon: A Timestamp-based Protocol to Maximize the Cache Efficiency in the distributed Shared Memory

Falcon: A Timestamp-based Protocol to Maximize the Cache Eff...

引用

international symposium on parallel and distributed Processing (IPDPS)

作者： Jin Zhang Xiangyao Yu Zhengwei Qi Haibing Guan Shanghai Jiao Tong University Shanghai China University of Wisconsin-Madison Madison WI USA

distributed shared memory (DSM) systems can handle data-intensive applications and recently receiving more attention. A majority of existing DSM implementations are based on write-invalidation (WI) protocols, which achieve sub-optimal performance when the cache size is small. Specifically, the vast majority of invalidation messages become useless when evictions are frequent. The problem is troublesome regarding scarce memory resources in data centers. To this end, we propose a self-invalidation protocol Falcon to eliminate invalidation messages. It relies on per-operation timestamps to achieve the global memory order required by sequential consistency (SC). Furthermore, we conduct a comprehensive discussion on the two protocols with an emphasis on the cache size impact. We also implement both protocols atop a recent DSM system, Grappa. The evaluation shows that the optimal protocol can improve the performance of a KV database by 27% and a graph processing application by 71.4% against the vanilla cache-free scheme.

关键词： distributed processing Data centers Protocols Memory management distributed databases

来源：评论

学校读者我要写书评

暂无评论

Memory-Disaggregated In-Memory Object Store Framework for Big Data Applications

Memory-Disaggregated In-Memory Object Store Framework for Bi...

引用

IEEE international symposium on parallel and distributed Processing Workshops and Phd Forum (IPDPSW)

作者： Robin Abrahamse Á kos Hadnagy Zaid Al-Ars Accelerated Big Data Systems Delft University of Technology Delft The Netherlands

ISBN: (数字)9781665497473

ISBN: (纸本)9781665497480

The concept of memory disaggregation has recently been gaining traction in research. With memory disaggregation, data center compute nodes can directly access memory on adjacent nodes and are therefore able to overcome local memory restrictions, introducing a new data management paradigm for distributed computing. This paper proposes and demonstrates a memory disaggregated in-memory object store framework for big data applications by leveraging the newly introduced Thymes-isFlow memory disaggregation system. The framework extends the functionality of the pre-existing Apache Arrow Plasma object store framework to distributed systems by enabling clients to easily and efficiently produce and consume data objects across multiple compute nodes. This allows big data applications to increasingly leverage parallel processing at reduced development costs. In addition, the paper includes latency and throughput measurements that indicate only a modest performance penalty is incurred for remote disaggregated memory access as opposed to local (~6.5 vs ~5.75 GiB/s). The results can be used to guide the design of future systems that leverage memory disaggregation as well as the newly presented framework. This work is open-source and publicly accessible at https://***/10.5281/zenodo.6368998.

关键词： Data centers Memory management distributed databases Programming parallel processing Big Data applications Throughput

来源：评论

学校读者我要写书评

暂无评论

An evaluation of Cassandra NoSQL database on a low-power cluster 33

An evaluation of Cassandra NoSQL database on a low-power clu...

引用

33rd IEEE international symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

作者： da Silva, Lucas Ferreira Lima, Joao V. F. Univ Fed Santa Maria Grad Program Comp Sci Santa Maria RS Brazil

ISBN: (纸本)9781665417303

The constant growth of social media, unconventional web technologies, mobile applications, and Internet of Things (IoT) devices, create challenges for cloud data systems in order to support huge datasets and very high request rates. NoSQL distributed databases such as Cassandra have been used for unstructured data storage and to increase horizontal scalability and high availability. In this paper, we evaluated Cassandra on a low-power low-cost cluster of commodity Single Board Computers (SBC). The cluster has 15 Raspberry Pi v3 nodes with Docker Swarm orchestration tool for Cassandra service deployment and ingress load balancing over SBCs. Experimental results demonstrated that hardware limitations impacted workload throughput, but read and write latencies were comparable to results from other works on high-end or virtualized platforms. Despite the observed limitations, the results show that a low-cost SBC cluster can support cloud serving goals such as scale-out, elasticity and high availability.

关键词： NoSQL databases Raspberry Pi low-power Single Board Computers Docker Swarm Big Data Cassandra

来源：评论

学校读者我要写书评

暂无评论

EQC: Ensembled Quantum Computing for Variational Quantum Algorithms 22

EQC: Ensembled Quantum Computing for Variational Quantum Alg...

引用

49th IEEE/ACM Annual international symposium on Computer Architecture (ISCA)

作者： Stein, Samuel Wiebe, Nathan Ding, Yufei Bo, Peng Kowalski, Karol Baker, Nathan Ang, James Li, Ang Pacific Northwest Natl Lab Richland WA 99354 USA Univ Calif Santa Barbara Santa Barbara CA USA Univ Toronto Toronto ON Canada

ISBN: (纸本)9781450386104

Variational quantum algorithm (VQA), which is comprised of a classical optimizer and a parameterized quantum circuit, emerges as one of the most promising approaches for harvesting the power of quantum computers in the noisy intermediate scale quantum (NISQ) era. However, the deployment of VQAs on contemporary NISQ devices often faces considerable system and time-dependant noise and prohibitively slow training speeds. On the other hand, the expensive supporting resources and infrastructure make quantum computers extremely keen on high utilization. In this paper, we propose a virtualized way of building up a quantum backend for variational quantum algorithms: rather than relying on a single physical device which tends to introduce ever-changing device-specific noise with less reliable performance as time-since-calibration grows, we propose to constitute a quantum ensemble, which dynamically distributes quantum tasks asynchronously across a set of physical devices, and adjusts the ensemble configuration with respect to machine status. In addition to reduced machine-dependant noise, the ensemble can provide significant speedups for VQA training. With this idea, we build a novel VQA training framework called EQC - a distributed gradient-based processor-performance-aware optimization system - that comprises: (i) a system architecture for asynchronous parallel VQA cooperative training;(ii) an analytical model for assessing the quality of a circuit output concerning its architecture, transpilation, and runtime conditions;(iii) a weighting mechanism to adjust the quantum ensemble's computational contribution according to the systems' current performance. Evaluations comprising 500K times' circuit evaluations across 10 IBMQ NISQ devices using a VQE and a QAOA applications demonstrate that EQC can attain error rates very close to the most performant device of the ensemble, while boosting the training speed by 10.5x on average (up to 86x and at least 5.2x). EQC is available at

关键词： Quantum Computing Variational Quantum Algorithms distributed Computing

来源：评论

学校读者我要写书评

暂无评论

Scalable Data parallel distributed Training for Graph Neural Networks

Scalable Data Parallel Distributed Training for Graph Neural...

引用

IEEE international symposium on parallel and distributed Processing Workshops and Phd Forum (IPDPSW)

作者： Sohei Koyama Osamu Tatebe University of Tsukuba Tsukuba Ibaraki Japan

ISBN: (数字)9781665497473

ISBN: (纸本)9781665497480

Graph neural networks (GNNs) operate on data represented as graphs, and are useful for a wide variety of tasks from chemical reaction and protein structure prediction to content recommendation systems. However, training for large graphs and improving training performance remain significant challenges. Existing distributed training systems partition a graph among all compute nodes to train for large graphs; however, this results in a communication overhead to degrade training performance. In this study, to solve these two problems, we propose a scalable data-parallel distributed GNN training system designed to partition a graph redundantly. It is implemented using remote direct memory access (RDMA) and nonblocking active messages to efficiently utilize network performance and hide communication overhead by overlapping with the training computation. Experimental results are presented to show the strong scalability of the proposed approach, which achieved parallel efficiencies of 0.93 using eight compute nodes for the ogbn-products dataset in the Open Graph Benchmark (OGB) and 0.95 based on two compute nodes using 32 compute nodes for the ogbn-papers100M dataset. The proposed system exhibited a training performance 18.9% better than the state-of-the-art DistDGL, even with only a single compute node. The results demonstrate that the proposed approach may be considered a promising method to achieve scalable training performance for large graphs.

关键词： Training Proteins distributed processing Scalability Conferences Memory management distributed databases

来源：评论

学校读者我要写书评

暂无评论

RLRP: High-Efficient Data Placement with Reinforcement Learning for Modern distributed Storage systems

RLRP: High-Efficient Data Placement with Reinforcement Learn...

引用

international symposium on parallel and distributed Processing (IPDPS)

作者： Kai Lu Nannan Zhao Jiguang Wan Changhong Fei Wei Zhao Tongliang Deng Wuhan National Laboratory for Optoelectronics Huazhong University of Science and Technology Wuhan China The School of Computer Science Northwestern Polytechnical University Xi&#x0027 an China SenseTime Research Shenzhen China

Modern distributed storage systems with massive data and storage nodes pose higher requirements to the data placement strategy. Furthermore, with emerged new storage devices, heterogeneous storage architecture has become increasingly common and popular. However, traditional strategies expose great limitations in the face of these requirements, especially do not well consider distinct characteristics of heterogeneous storage nodes yet, which will lead to suboptimal performance. In this paper, we present and evaluate the RLRP, a deep reinforcement learning (RL) based replica placement strategy. RLRP constructs placement and migration agents through the Deep-Q-Network (DQN) model to achieve fair distribution and adaptive data migration. Besides, RLRP provides optimal performance for heterogeneous environment by an attentional Long Short-term Memory (LSTM) model. Finally, RLRP adopts Stagewise Training and Model fine-tuning to accelerate the training of RL models with large-scale state and action space. RLRP is implemented on Park and the evaluation results indicate RLRP is a highly efficient data placement strategy for modern distributed storage systems. RLRP can reduce read latency by 10%∼50% in heterogeneous environment compared with existing strategies. In addition, RLRP is used in the real-world system Ceph, which improves the read performance of Ceph by 30%∼40%.

关键词： Training Performance evaluation Adaptation models distributed processing distributed databases Reinforcement learning Data models

来源：评论

学校读者我要写书评

暂无评论

Modeling Memory Contention between Communications and Computations in distributed HPC systems

Modeling Memory Contention between Communications and Comput...

引用

IEEE international symposium on parallel and distributed Processing Workshops and Phd Forum (IPDPSW)

作者： Alexandre DENIS Emmanuel JEANNOT Philippe SWARTVAGHER Inria Bordeaux - Sud-Ouest Bordeaux France

ISBN: (数字)9781665497473

ISBN: (纸本)9781665497480

To amortize the cost of MPI communications, distributed parallel HPC applications can overlap network communications with computations in the hope that it improves global application performance. When using this technique, both computations and communications are running at the same time. But computation usually also performs some data movements. Since data for computations and for communications use the same memory system, memory contention may occur when computations are memory-bound and large messages are transmitted through the network at the same time. In this paper we propose a model to predict memory band-width for computations and for communications when they are executed side by side, according to data locality and taking contention into account. Elaboration of the model allowed to better understand locations of bottleneck in the memory system and what are the strategies of the memory system in case of contention. The model was evaluated on many platforms with different characteristics, and showed a prediction error in average lower than 4 %.

关键词： distributed processing Runtime Computational modeling distributed databases Bandwidth Predictive models Data models

来源：评论

学校读者我要写书评

暂无评论

Memory-Aware Scheduling of Tasks Sharing Data on Multiple GPUs with Dynamic Runtime systems

Memory-Aware Scheduling of Tasks Sharing Data on Multiple GP...

引用

international symposium on parallel and distributed Processing (IPDPS)

作者： Maxime Gonthier Loris Marchal Samuel Thibault ENS-Lyon France University of Bordeaux France

The use of accelerators such as GPUs has become mainstream to achieve high performance on modern computing systems. GPUs come with their own (limited) memory and are connected to the main memory of the machine through a bus (with limited bandwidth). When a computation is started on a GPU, the corresponding data needs to be transferred to the GPU before the computation starts. Such data movements may become a bottleneck for performance, especially when several GPUs have to share the communication bus. Task-based runtime schedulers have emerged as a convenient and efficient way to use such heterogeneous platforms. When processing an application, the scheduler has the knowledge of all tasks available for processing on a GPU, as well as their input data dependencies. Hence, it is able to choose which task to allocate to which GPU and to reorder tasks so as to minimize data movements. We focus on this problem of partitioning and ordering tasks that share some of their input data. We present a novel dynamic strategy based on data selection to efficiently allocate tasks to GPUs and a custom eviction policy, and compare them to existing strategies using either a well-known graph partitioner or standard scheduling techniques in runtime systems. We also improved an offline scheduler recently proposed for a single GPU, by adding load balancing and task stealing capabilities. All strategies have been implemented on top of the STARPU runtime, and we show that our dynamic strategy achieves better performance when scheduling tasks on multiple GPU s with limited memory.

关键词： distributed processing Runtime Graphics processing units distributed databases Bandwidth Dynamic scheduling Load management

来源：评论

学校读者我要写书评

暂无评论

Hardware Specialization for distributed Computing 21

Hardware Specialization for Distributed Computing

引用

30th international symposium on High-Performance parallel and distributed Computing (HPDC)

作者： Alonso, Gustavo Swiss Fed Inst Technol Dept Comp Sci Zurich Switzerland

ISBN: (纸本)9781450382175

Several trends in the IT industry are driving an increasing specialization of the hardware layers. On the one hand, demanding workloads, large data volumes, diversity in data types, etc. are all factors contributing to make general purpose computing too inefficient. On the other hand, cloud computing and its economies of scale allow vendors to invest on specialized hardware for particular tasks that otherwise would be too expensive or consume resources needed elsewhere. In this talk I will discuss the shift towards hardware acceleration and show with several examples why specialized systems are here to stay and are likely to dominate the computer landscape for years to come. I will also discuss Enzian, an open research platform developed at ETH to enable the exploration of hardware acceleration and present some preliminary results achieved with it.

关键词： distributed Computing Accelerators HPC

来源：评论

学校读者我要写书评

暂无评论

Pipelining Graph Construction and Agent-based Computation over distributed Memory

Pipelining Graph Construction and Agent-based Computation ov...

引用

2022 IEEE international Conference on Big Data, Big Data 2022

作者： Hong, Yan Fukuda, Munehiro University of Washington Bothell Computing and Software Systems BothellWA98011 United States

ISBN: (纸本)9781665480451

Graph streaming has received substantial attention for the past 10+ years to cope with large-scale graph computation. Two major approaches, one using conventional data-streaming tools and the other accessing graph databases, facilitate continuous analysis of endlessly flowing graphs and query-based incremental construction of huge graphs, respectively. However, some scientific graphs including biological networks need to stay in memory for repetitive but various analyses. Although a cluster system, thus distributed memory can entirely handle a big graph in memory, a challenge is substantial overhead incurred by loading graphs into memory. A solution is hiding such graph-loading and construction overheads with graph computation in a pipelined fashion. We adapted this pipelining approach for agent-based graph computing where thousands of agents traverse a graph for finding its attributes and shape. We used the multi-agent spatial simulation (MASS) library to implement the concept. A huge graph is incrementally constructed in batches, each spawning and walking agents over the corresponding subgraph, and thus all eventually completing a given computation. We coded and ran two MASS benchmark programs: triangle counting and connected components, with which we evaluated our pipelined graph processing. The best performance was obtained once the batch size shrunk enough to fit cache memory, regardless of the number of cluster nodes. For a single node execution of connected components over a 140MB graph, our graph-pipelining implementation performed 7.7 times faster than non-pipelining execution. Its parallel execution with 24 cluster nodes achieved 8.3 times speed-up as compared to the pipelined single-node execution. © 2022 IEEE.

关键词： Cache memory

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共489页 << < 14 15 16 17 18 19 20 21 22 23 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：