检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

3,680 篇 会议
122 篇 期刊文献
22 册 图书

馆藏范围

3,824 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

2,671 篇 工学
- 2,547 篇 计算机科学与技术...
- 1,152 篇 软件工程
- 412 篇 信息与通信工程
- 411 篇 电气工程
- 207 篇 电子科学与技术（可...
- 136 篇 控制科学与工程
- 78 篇 网络空间安全
- 40 篇 动力工程及工程热...
- 37 篇 机械工程
- 37 篇 建筑学
- 33 篇 生物医学工程（可授...
- 29 篇 光学工程
- 29 篇 生物工程
- 28 篇 土木工程
- 22 篇 仪器科学与技术
- 20 篇 化学工程与技术
- 20 篇 安全科学与工程
- 18 篇 力学（可授工学、理...
634 篇 理学
- 493 篇 数学
- 88 篇 物理学
- 67 篇 统计学（可授理学、...
- 56 篇 系统科学
- 35 篇 生物学
- 31 篇 化学
402 篇 管理学
- 339 篇 管理科学与工程(可...
- 157 篇 工商管理
- 84 篇 图书情报与档案管...
28 篇 医学
- 25 篇 临床医学
26 篇 经济学
- 25 篇 应用经济学
18 篇 法学
- 18 篇 社会学
12 篇 农学
6 篇 教育学
3 篇 文学
1 篇 军事学
1 篇 艺术学

主题

354 篇 parallel process...
302 篇 application soft...
238 篇 distributed comp...
208 篇 computer archite...
204 篇 concurrent compu...
199 篇 hardware
181 篇 computational mo...
177 篇 parallel process...
172 篇 graphics process...
171 篇 computer science
129 篇 runtime
120 篇 parallel program...
104 篇 processor schedu...
103 篇 distributed comp...
101 篇 distributed proc...
100 篇 grid computing
98 篇 scalability
96 篇 high performance...
96 篇 delay
95 篇 libraries

机构

12 篇 school of comput...
12 篇 ohio state univ ...
10 篇 argonne natl lab...
9 篇 univ chinese aca...
9 篇 hiroshima univ d...
9 篇 oak ridge natl l...
7 篇 ibm thomas j. wa...
7 篇 oak ridge nation...
7 篇 univ warwick dep...
7 篇 carnegie mellon ...
7 篇 department of co...
7 篇 ibm corp thomas ...
6 篇 oak ridge natl l...
6 篇 iit dept comp sc...
6 篇 lawrence berkele...
6 篇 georgia inst tec...
6 篇 department of co...
6 篇 univ coll dublin...
6 篇 department of co...
6 篇 department of co...

作者

20 篇 nakano koji
17 篇 lastovetsky alex...
16 篇 ito yasuaki
11 篇 dongarra jack
11 篇 jarvis stephen a...
11 篇 sun xian-he
11 篇 agrawal gagan
10 篇 wolf felix
9 篇 schulz martin
9 篇 guo minyi
9 篇 robert yves
8 篇 hoefler torsten
8 篇 h. casanova
8 篇 jack dongarra
8 篇 prasad sushil k.
8 篇 casanova henri
8 篇 magoules frederi...
8 篇 kale laxmikant v...
8 篇 labarta jesus
7 篇 bader david a.

语言

3,816 篇 英文
6 篇 其他
1 篇 土耳其文
1 篇 中文

检索条件"任意字段=4th International Symposium on Parallel and Distributed Processing and Applications"

共 3824 条记录，以下是241-250 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Consistent Lock-free parallel Stochastic Gradient Descent for Fast and Stable Convergence 35

Consistent Lock-free Parallel Stochastic Gradient Descent fo...

引用

35th IEEE international parallel and distributed processing symposium (IPDPS)

作者： Backstrom, Karl Walulya, Ivan Papatriantafilou, Marina Tsigas, Philippas Chalmers Univ Technol Dept Comp Sci & Engn Gothenburg Sweden

ISBN: (纸本)9781665440660

Stochastic Gradient Descent (SGD) is an essential element in Machine Learning (ML) algorithms. Asynchronous shared-memory parallel SGD (AsyncSGD), including synchronization-free algorithms, e.g. HOGWILD!, have received interest in certain contexts, due to reduced overhead compared to synchronous parallelization. Despite that they induce staleness and inconsistency, they have shown speedup for problems satisfying smooth, strongly convex targets, and gradient sparsity. Recent works take important steps towards understanding the potential of parallel 50D for problems not conforming to these strong assumptions, in particular for deep learning (DL). there is however a gap in current literature in understanding when AsyncSGD algorithms are useful in practice, and in particular how mechanisms for synchronization and consistency play a role. We contribute with answering questions in this gap by studying a spectrum of parallel algorithmic implementations of AsyncSGD, aiming to understand how shared-data synchronization influences the convergence properties in fundamental DL applications. We focus on the impact of consistency-preserving non-blocking synchronization in SGD convergence, and in sensitivity to hyperparameter tuning. We propose Leashed-SGD, an extensible algorithmic framework of consistency-preserving implementations of AsyncSGD, employing lock-free synchronization, effectively balancing throughput and latency. Leashed-SGD features a natural contention-regulating mechanism, as well as dynamic memory management, allocating space only when needed. We argue analytically about the dynamics of the algorithms, memory consumption, the threads' progress over time, and the expected contention. We provide a comprehensive empirical evaluation, validating the analytical claims, benchmarking the proposed Leashed-SGD framework, and comparing to baselines for two prominent deep learning (DL) applications: multilayer perceptrons (MLP) and convolutional neural networks (CNN). We o

关键词： artificial neural networks parallel algorithms lock-free synchronization stochastic gradient descent

来源：评论

学校读者我要写书评

暂无评论

Accelerating Native Transaction processing in LSM-Based Persistent Key-Value Stores

Accelerating Native Transaction Processing in LSM-Based Pers...

引用

IEEE international symposium on parallel and distributed processing Workshops and Phd Forum (IPDPSW)

作者： Jin Xue Zili Shao Department of Computer Science and Engineering The Chinese University of Hong Kong

ISBN: (数字)9798350364606

ISBN: (纸本)9798350364613

In recent years, key-value stores (KV stores) [1]-[3] begin to gain popularity as storage engines for large-scale data applications. KV stores are fundamentally different from traditional SQL databases and with the key-value data model, they have various advantages, such as ease of use, flexibility and higher performance. However, some essential features in SQL databases, most notably atomicity, consistency, isolation and durability (ACID) transaction processing [4], are considered impractical and thus are generally not included in a KV store design. With the recent advancement, these features begin to be integrated into KV stores, giving birth to a brand new class of database management systems called NewSQL databases [5], [6]. the rationale behind NewSQL databases is that with ACID transaction support, KV stores can serve as storage engines for an upper-layer SQL query processor to handle SQL queries [7]-[9]. In this way, NewSQL databases can provide both the scalability of KV stores and the ACID guarantees required for online transaction processing (OLTP), thus providing high performance for different types of workloads in a large-scale data system.

关键词： distributed processing Scalability Conferences Big Data applications Database systems Data systems Data models

来源：评论

学校读者我要写书评

暂无评论

Efficient Spatio-Temporal-Data-Oriented Range Query processing for Air Traffic Flow Statistics 19

Efficient Spatio-Temporal-Data-Oriented Range Query Processi...

引用

19th IEEE international symposium on parallel and distributed processing with applications (IEEE ISPA)

作者： Yu, Huayan Li, Xin Yuan, Ligang Qin, Xiaolin Nanjing Univ Aeronaut & Astronaut Coll Comp Sci & Technol Nanjing Peoples R China Nanjing Univ State Key Lab Novel Software Technol Nanjing Peoples R China Nanjing Univ Aeronaut & Astronaut Coll Civil Aviat Nanjing Peoples R China

ISBN: (纸本)9781665435741

Timely and efficient air traffic flow statistics play a significant role in improving the accuracy and intelligence of air traffic flow management (ATFM). the enormous spatio-temporal data collected by location-based services (LBS) intensely aggravate the burden of the statistical tasks. the traditional approaches of calculating such tasks show their weakness in two parts: 1) they fail to capture the features of complicated three-dimensional time-dependent airspace, and 2) they are not optimized to deal with big volume spatio-temporal data covering high-dimensional features. Spatio-temporal range queries have advantages in calculating the eligible flow records. therefore, exploring the efficiency of distributed range query processing methods helps improve the performance of air traffic flow statistics and gain insights into the rationality of the air traffic. To analyze the large-scale spatio-temporal aviation data efficiently, we propose two spatio-temporal range query MapReduce algorithms: 1) spatio-temporal polygon range query, which aims to find all records from a polygonal location in a time interval, 2) spatio-temporal k nearest neighbors query, which directly searches the k closest neighbors of the query point. Moreover, we design an air traffic flow statistic strategy to accurately calculate traffic flow in arbitrary airspace based on real-world aviation trajectory datasets. the experimental results demonstrate that our algorithms perform better in answering spatio-temporal range queries over counterpart algorithms and the average response time is reduced by 81%. the evaluation also proves the effectiveness of our algorithms concerning air traffic flow statistics.

关键词： Air traffic flow management spatio-temporal data management spatio-temporal range query

来源：评论

学校读者我要写书评

暂无评论

Exploring MPI Collective I/O and File-per-process I/O for Checkpointing a Logical Inference Task

Exploring MPI Collective I/O and File-per-process I/O for Ch...

引用

35th IEEE international parallel and distributed processing symposium (IPDPS)

作者： Fan, Ke Micinski, Kristopher Gilray, thomas Kumar, Sidharth Univ Alabama Birmingham Birmingham AL 35294 USA Syracuse Univ Syracuse NY 13244 USA

ISBN: (纸本)9781665435772

We present a scalable parallel I/O system for a logical-inferencing application built atop a deductive database. Deductive databases can make logical deductions (i.e. conclude additional facts), based on a set of program rules, derived from facts already in the database. Datalog is a language or family of languages commonly used to specify rules and queries for a deductive database. applications built using Datalog can range from graph mining (such as computing transitive closure or k-cliques) to program analysis (control and data-flow analysis). In our previous papers, we presented the first implementation of a data-parallel Datalog built using MPI. In this paper, we present a parallel I/O system used to checkpoint and restart applications built on top of our Datalog system. State of the art Datalog implementations, such as Souffle, only support serial I/O, mainly because the implementation itself does not support many-node parallel execution. Computing the transitive closure of a graph is one of the simplest logical-inferencing applications built using Datalog;we use it as a micro-benchmark to demonstrate the efficacy of our parallel I/O system. Internally, we use a nested B-tree data-structure to facilitate fast and efficient in-memory access to relational data. Our I/O system therefore involves two steps, converting the application data-layout (a nested B-tree) to a stream of bytes followed by the actual parallel I/O. We explore two popular I/O techniques POSIX I/O and MPI collective I/O. For extracting performance out of MPI Collective I/O we use adaptive striping, and for POSIX I/O we use file-per-process I/O. We demonstrate the scalability of our system at up to 4,096 processes on the theta supercomputer at the Argonne National Laboratory.

关键词： Checkpointing distributed processing Scalability Conferences Supercomputers Deductive databases Task analysis

来源：评论

学校读者我要写书评

暂无评论

FlexScience'22: 12th Workshop on AI and Scientific Computing at Scale using Flexible Computing Infrastructures 22

FlexScience'22: 12th Workshop on AI and Scientific Computing...

引用

31st international symposium on High-Performance parallel and distributed Computing (HPDC)

作者： Costan, Alexandru Nicolae, Bogdan Sato, Kento INSA Rennes IRISA Rennes France Argonne Natl Lab Lemont IL USA RIKEN Ctr Kobe Hyogo Japan

ISBN: (纸本)9781450391993

Scientific computing applications generate enormous datasets that are continuously increasing exponentially in both complexity and volume, making their analysis, archival, and sharing one of the grand challenges of modern big data analytics. Supported by the rise of artificial intelligence and deep learning, such enormous datasets are becoming valuable resources even beyond their original scope, opening new opportunities to learn patterns and extract new knowledge at large scale, potentially without human intervention. However, this leads to an increasing complexity of the workflows that combine traditional HPC simulations with big data analytics and AI applications. An initial wave that opened this direction was the shift from compute-intensive to data-intensive, which saw several ideas from big data analytics (in-situ processing, shipping computations close to data, complex and dynamic workflows) fused with the tightly coupled patterns addressed by the AI and the high performance computing ecosystems. In a quest to keep up with the complexity of the workflows, the design and operation of the infrastructures capable of running them efficiently at scale has evolved accordingly. Extreme heterogeneity at all levels (combinations of CPUs and accelerators, various types of memories and local storage and network links, parallel file systems and object stores, etc.) is now the norm. ideas pioneered by cloud and edge computing (aspects related to elasticity, multi-tenancy, geo-distributed processing, stream computing) are also beginning to be adopted in the HPC ecosystem (containerized workflows, on-demand jobs to complement batch jobs, streaming of experimental data from instruments directly to supercomputers, etc.). thus, modern scientific applications need to be integrated into an entire Compute Continuum from the edge all the way to supercomputers and large data-centers using flexible infrastructures and middlewares. the 12th workshop on AI and Scientific Computing at

关键词： computing continuum scientific computing cloud computing edge processing artificial intelligence

来源：评论

学校读者我要写书评

暂无评论

Combining Static and Dynamic Analysis to Query Characteristics of HPC applications

Combining Static and Dynamic Analysis to Query Characteristi...

引用

35th IEEE international parallel and distributed processing symposium (IPDPS)

作者： Welch, Aaron Hernandez, Oscar Chapman, Barbara Univ Houston Houston TX 77004 USA Oak Ridge Natl Lab Oak Ridge TN USA SUNY Stony Brook Stony Brook NY 11794 USA

ISBN: (纸本)9781665435772

Emerging HPC platforms are becoming more difficult to program as a result of systems with different node architectures, some with a small number of "fat" heterogenous nodes (consisting of multiple accelerators) and others with a large number of "thin" homogenous nodes consisting of multi-core CPUs connected with high speed interconnects. New programming models are emerging to address performance portability of the applications as well as a set of scientific libraries that applications can use to exploit these architectures efficiently. To port applications to new architectures, developers need information about their source code characteristics including static and dynamic (e.g. performance) information to refactor the code, understand their data and code structure, and library usage as well as program information to direct their optimisation efforts and make key decisions. In this paper, we describe a tool that combines compiler and profiler information to query program characteristics in a given programming environment. Static and dynamic data about applications is collected and stored together in an SQL database that can be later queried to study application characteristics and patterns. We will demonstrate the capabilities of this tool with an application-driven case study that aims at understanding application code and its use of scientific libraries via a real world example from the molecular simulation application CP2K.

关键词： distributed processing Program processors Databases Conferences Tools Programming Libraries

来源：评论

学校读者我要写书评

暂无评论

Periodicity detection algorithm and applications on IoT data 20

Periodicity detection algorithm and applications on IoT data

引用

20th international symposium on parallel and distributed Computing (ISPDC)

作者： Tolas, Ramona Portase, Raluca Iosif, Andrei Potolea, Rodica Tech Univ Cluj Napoca Comp Sci Dept Cluj Napoca Romania

ISBN: (纸本)9781665432818

Data collected by sensors has hidden value that can be used to infer valuable knowledge about the system, such as identifying faults in transmission or functioning faults in various system components. Solutions for exploring and exploiting data need to be developed to extract such knowledge. this paper shows how the identification of transmission regularities can be used to extract knowledge about the overall system state. the focus of this work is defining a methodology for detecting transmission periodicity. In our approach, we evaluated other strategies, addressed various limitations they have, and narrowed their utility on real-world data. We further expand the scope by defining strategies for the identification of transmission gaps and duplicates. Finally, we validate the algorithms on samples of real industrial data obtained from monitoring different parts of home appliances.

关键词： Transmission pattern identification IoT data signal transmission periodicity real data processing

来源：评论

学校读者我要写书评

暂无评论

Revisiting Huffman Coding: Toward Extreme Performance on Modern GPU Architectures 35

Revisiting Huffman Coding: Toward Extreme Performance on Mod...

引用

35th IEEE international parallel and distributed processing symposium (IPDPS)

作者： Tian, Jiannan Rivera, Cody Di, Sheng Chen, Jieyang Liang, Xin Tao, Dingwen Cappello, Franck Washington State Univ Sch Elect Engn & Comp Sci Pullman WA 99164 USA Univ Alabama Dept Comp Sci Tuscaloosa AL 35487 USA Argonne Natl Lab Math & Comp Sci Div Lemont IL USA Oak Ridge Natl Lab Oak Ridge TN USA Univ Illinois Champaign IL USA

ISBN: (纸本)9781665440660

Today's high-performance computing (HPC) applications are producing vast volumes of data, which are challenging to store and transfer efficiently during the execution, such that data compression is becoming a critical technique to mitigate the storage burden and data movement cost. Huffman coding is arguably the most efficient Entropy coding algorithm in information theory, such that it could be found as a fundamental step in many modern compression algorithms such as DEFLATE. On the other hand, today's HPC applications are more and more relying on the accelerators such as GPU on supercomputers, while Huffman encoding suffers from low throughput on GPUs, resulting in a significant bottleneck in the entire data processing. In this paper, we propose and implement an efficient Huffman encoding approach based on modern GPU architectures, which addresses two key challenges: (1) how to parallelize the entire Huffman encoding algorithm, including codebook construction, and (2) how to fully utilize the high memory-bandwidth feature of modern GPU architectures. the detailed contribution is fourfold. (1) We develop an efficient parallel codebook construction on GPUs that scales effectively with the number of input symbols. (2) We propose a novel reduction based encoding scheme that can efficiently merge the codewords on GPUs. (3) We optimize the overall GPU performance by leveraging the state-of-the-art CUDA APIs such as Cooperative Groups. (4) We evaluate our Huffman encoder thoroughly using six real-world application datasets on two advanced GPUs and compare with our implemented multi-threaded Huffman encoder. Experiments show that our solution can improve the encoding throughput by up to 5.0x and 6.8x on NVIDIA RTX 5000 and V100, respectively, over the state-of-the-art GPU Huffman encoder, and by up to 3.3x over the multi-thread encoder on two 28-core Xeon Platinum 8280 CPUs.

关键词： distributed processing Platinum Graphics processing units Computer architecture throughput Data processing Supercomputers

来源：评论

学校读者我要写书评

暂无评论

PARSIR: a Package for Effective parallel Discrete Event Simulation on Multi-processor Machines

PARSIR: a Package for Effective Parallel Discrete Event Simu...

引用

international Workshop/symposium on distributed Simulation and Real-Time applications (DS-RT)

作者： Francesco Quaglia DICII - University of Rome Tor Vergata

ISBN: (数字)9798331527211

ISBN: (纸本)9798331527228

In this article we present PARSIR (parallel SImulation Runner), a package that enables the effective exploitation of shared-memory multi-processor machines for running discrete event simulation models. PARSIR is a compile/run-time environment for discrete event simulation models developed with the C programming language. the architecture of PARSIR has been designed in order to keep low the amount of CPU-cycles required for running models. this is achieved via the combination of a set of techniques like: 1) causally consistent batch-processing of simulation events at an individual simulation object for caching effectiveness; 2) high likelihood of disjoint access parallelism; 3) the favoring of memory accesses on local NUMA (Non-Uniform-Memory-Access) nodes in the architecture, while still enabling well balanced workload distribution via work-stealing from remote nodes; 4) the use of RMW (Read-Modify-Write) machine instructions for fast access to simulation engine data required by the worker threads for managing the concurrent simulation objects and distributing the workload. Furthermore, any architectural solution embedded in the PARSIR engine is fully transparent to the application level code implementing the simulation model. We also provide experimental results showing the effectiveness of PARSIR when running the reference PHOLD benchmark on a NUMA shared-memory multi-processor machine equipped with 40 CPUs.

关键词： Instruction sets Computational modeling Scalability Benchmark testing parallel processing Real-time systems Discrete event simulation Synchronization Usability Engines

来源：评论

学校读者我要写书评

暂无评论

CLMS: Configurable and Lightweight Metadata Service for parallel File Systems on NVMe SSDs 15th

CLMS: Configurable and Lightweight Metadata Service for Para...

引用

15th international symposium on Advanced parallel processing Technologies, APPT 2023

作者： Li, Qiong Lv, Shuaizhe Xie, Xuchao Song, Zhenlong College of Computer National University of Defense Technology Changsha China Defense Innovation Institute Academy of Military Sciences Beijing China

ISBN: (纸本)9789819978717

With the tendency of running large-scale data-intensive applications on High-Performance Computing (HPC) systems, the I/O workloads of HPC storage systems are becoming more complex, such as the increasing metadata-intensive I/O operations in Exascale computing and High-Performance Data Analytics (HPDA). To meet the increasing performance requirements of the metadata service in HPC parallel file systems, this paper proposes a Configurable and Lightweight Metadata Service (CLMS) design for the parallel file systems on NVMe SSDs. CLMS introduces a configurable metadata distribution policy that simultaneously enables the directory-based and hash-based metadata distribution strategies and can be activated according to the application I/O access pattern, thus improving the processing efficiency of metadata accesses from different kinds of data-intensive applications. CLMS further reduces the memory copy and serialization processing overhead in the I/O path through the full-user metadata service design. We implemented the CLMS prototype and evaluated it under the MDTest benchmarks. Our experimental results demonstrate that CLMS can significantly improve the performance of metadata services. Besides, CMLS achieves a linear growth trend as the number of metadata servers increases for the unique-directory file distribution pattern. © the Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.

关键词： Metadata

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共383页 << < 21 22 23 24 25 26 27 28 29 30 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：