检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

3,672 篇 会议
122 篇 期刊文献
6 册 图书

馆藏范围

3,800 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

2,671 篇 工学
- 2,547 篇 计算机科学与技术...
- 1,152 篇 软件工程
- 412 篇 信息与通信工程
- 411 篇 电气工程
- 207 篇 电子科学与技术（可...
- 136 篇 控制科学与工程
- 78 篇 网络空间安全
- 40 篇 动力工程及工程热...
- 37 篇 机械工程
- 37 篇 建筑学
- 33 篇 生物医学工程（可授...
- 29 篇 光学工程
- 29 篇 生物工程
- 28 篇 土木工程
- 22 篇 仪器科学与技术
- 20 篇 化学工程与技术
- 20 篇 安全科学与工程
- 18 篇 力学（可授工学、理...
634 篇 理学
- 493 篇 数学
- 88 篇 物理学
- 67 篇 统计学（可授理学、...
- 56 篇 系统科学
- 35 篇 生物学
- 31 篇 化学
402 篇 管理学
- 339 篇 管理科学与工程(可...
- 157 篇 工商管理
- 84 篇 图书情报与档案管...
28 篇 医学
- 25 篇 临床医学
26 篇 经济学
- 25 篇 应用经济学
18 篇 法学
- 18 篇 社会学
12 篇 农学
6 篇 教育学
3 篇 文学
1 篇 军事学
1 篇 艺术学

主题

348 篇 parallel process...
302 篇 application soft...
238 篇 distributed comp...
208 篇 computer archite...
204 篇 concurrent compu...
197 篇 hardware
181 篇 computational mo...
177 篇 parallel process...
172 篇 graphics process...
171 篇 computer science
129 篇 runtime
120 篇 parallel program...
104 篇 processor schedu...
103 篇 distributed comp...
101 篇 distributed proc...
100 篇 grid computing
97 篇 scalability
96 篇 high performance...
96 篇 delay
94 篇 libraries

机构

12 篇 school of comput...
12 篇 ohio state univ ...
10 篇 argonne natl lab...
9 篇 univ chinese aca...
9 篇 hiroshima univ d...
9 篇 oak ridge natl l...
7 篇 ibm thomas j. wa...
7 篇 oak ridge nation...
7 篇 univ warwick dep...
7 篇 carnegie mellon ...
7 篇 department of co...
7 篇 ibm corp thomas ...
6 篇 oak ridge natl l...
6 篇 iit dept comp sc...
6 篇 lawrence berkele...
6 篇 georgia inst tec...
6 篇 department of co...
6 篇 univ coll dublin...
6 篇 department of co...
6 篇 department of co...

作者

20 篇 nakano koji
17 篇 lastovetsky alex...
16 篇 ito yasuaki
11 篇 dongarra jack
11 篇 jarvis stephen a...
11 篇 sun xian-he
11 篇 agrawal gagan
10 篇 wolf felix
9 篇 schulz martin
9 篇 guo minyi
9 篇 robert yves
8 篇 hoefler torsten
8 篇 h. casanova
8 篇 prasad sushil k.
8 篇 casanova henri
8 篇 magoules frederi...
8 篇 kale laxmikant v...
8 篇 labarta jesus
7 篇 bader david a.
7 篇 m. kandemir

语言

3,792 篇 英文
6 篇 其他
1 篇 土耳其文
1 篇 中文

检索条件"任意字段=4th International Symposium on Parallel and Distributed Processing and Applications"

共 3800 条记录，以下是31-40 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

25th international Workshop on Job Scheduling Strategies for parallel processing, JSSPP 2022, held in conjunction with the 36th IEEE international parallel and distributed processing symposium, IPDPS 2022

25th International Workshop on Job Scheduling Strategies for...

引用

25th international Workshop on Job Scheduling Strategies for parallel processing, JSSPP 2022, held in conjunction with the 36th IEEE international parallel and distributed processing symposium, IPDPS 2022

ISBN: (纸本)9783031226977

the proceedings contain 12 papers. the special focus in this conference is on Job Scheduling Strategies for parallel processing. the topics include: Optimization of Execution Parameters of Moldable Ultrasound Workflows Under Incomplete Performance Data;Scheduling of Elastic Message Passing applications on HPC Systems;preface;on the Feasibility of Simulation-Driven Portfolio Scheduling for Cyberinfrastructure Runtime Systems;Improving Accuracy of Walltime Estimates in PBS Professional Using Soft Walltimes;re-making the Movie-Making Machine;using Kubernetes in Academic Environment: Problems and Approaches;AI-Job Scheduling on Systems with Renewable Power Sources;Toward Building a Digital Twin of Job Scheduling and Power Management on an HPC System;encoding for Reinforcement Learning Driven Scheduling.

关键词：

来源：评论

学校读者我要写书评

暂无评论

PARSIR: a Package for Effective parallel Discrete Event Simulation on Multi-processor Machines 28

PARSIR: a Package for Effective Parallel Discrete Event Simu...

引用

28th IEEE/ACM international symposium on distributed Simulation and Real Time applications, DS-RT 2024

作者： Quaglia, Francesco DICII - University of Rome Tor Vergata Italy

ISBN: (纸本)9798331527211

In this article we present PARSIR (parallel SImulation Runner), a package that enables the effective exploitation of shared-memory multi-processor machines for running discrete event simulation models. PARSIR is a compile/run-time environment for discrete event simulation models developed with the C programming language. the architecture of PARSIR has been designed in order to keep low the amount of CPU-cycles required for running models. this is achieved via the combination of a set of techniques like: 1) causally consistent batch-processing of simulation events at an individual simulation object for caching effectiveness;2) high likelihood of disjoint access parallelism;3) the favoring of memory accesses on local NUMA (Non-Uniform-Memory-Access) nodes in the architecture, while still enabling well balanced workload distribution via work-stealing from remote nodes;4) the use of RMW (Read-Modify-Write) machine instructions for fast access to simulation engine data required by the worker threads for managing the concurrent simulation objects and distributing the workload. Furthermore, any architectural solution embedded in the PARSIR engine is fully transparent to the application level code implementing the simulation model. We also provide experimental results showing the effectiveness of PARSIR when running the reference PHOLD benchmark on a NUMA shared-memory multi-processor machine equipped with 40 CPUs. © 2024 IEEE.

关键词： Batch data processing

来源：评论

学校读者我要写书评

暂无评论

A Locality-aware Cooperative distributed Memory Caching for parallel Data Analytic applications 36

A Locality-aware Cooperative Distributed Memory Caching for ...

引用

36th IEEE international parallel and distributed processing symposium (IEEE IPDPS)

作者： Hung, Chia-Ting Chou, Jerry Chen, Ming-Hung Chung, I-Hsin Natl Tsing Hua Univ Comp Sci Dept Hsinchu Taiwan IBM TJ Watson Res Ctr New York NY USA

ISBN: (纸本)9781665497473

Memory caching has long been used to fill up the performance gap between processor and disk for reducing the data access time of data-intensive computations. Previous studies on caching mostly focus on optimizing the hit rate of a single machine. But in this paper, we argue that the caching decision of a distributed memory system should be performed in a cooperative manner for the parallel data analytic applications, which are commonly used by emerging technologies, such as Big Data and AI (Artificial Intelligence), to perform data mining and sophisticated analytics on larger data volume in a shorter time. A parallel data analytic job consists of multiple parallel tasks. Hence, the completion time of a job is bounded by its slowest task, meaning that the job cannot benefit from caching until all inputs of its tasks are cached. To address the problem, we proposed a cooperative caching design that periodically rearranges the cache placement among nodes according to the data access pattern while taking the task dependency and network locality into account. Our approach is evaluated by a trace-driven simulator using both synthetic workload and real-world traces. the results show that we can reduce the average completion times up to 33% compared to a non-collaborative caching polices and 25% compared to other start-of-the-art collaborative caching policies.

关键词： parallel Data processing Caching Algorithm Performance distributed Systems

来源：评论

学校读者我要写书评

暂无评论

DAG Scheduling Considering parallel Execution for High-Load processing on Clustered Many-core Processors 26

DAG Scheduling Considering Parallel Execution for High-Load ...

引用

26th IEEE/ACM international symposium on distributed Simulation and Real Time applications (DS-RT)

作者： Okamura, Ryo Azumi, Takuya Saitama Univ Grad Sch Sci & Engn Saitama Japan

ISBN: (数字)9781665497992

ISBN: (纸本)9781665497992

In recent years, high computational power has been required for computer platforms to support complex systems such as self-driving systems. Clustered many-core processors and directed acyclic graphs (DAGs), which can represent dependencies and parallelism of task processing, have attracted much attention as solutions to this problem. Previous studies on scheduling DAGs on multi-core processors have attempted to reduce the makespan (i.e., time it takes for a task to complete) by increasing the number of processes that can be executed in parallel. However, in self-driving systems, such as those utilizing clustered many-core processors, it is impossible to sufficiently increase the utilization of processor cores due to high-load processing. In this paper, a scheduling method is proposed to improve the utilization of processor cores by parallel executing high-load processes in parallel across multiple cores. the proposed method can reduce the makespan of DAGs performing high-load processing on clustered many-core processors.

关键词： clustered many-core processors DAG work-conserving schedule list scheduling

来源：评论

学校读者我要写书评

暂无评论

Integration Framework for Online thread throttling with thread and Page Mapping on NUMA Systems

Integration Framework for Online Thread Throttling with Thre...

引用

1st international Conference on Smart Energy Systems and Artificial Intelligence (SESAI)

作者： Schwarzrock, Janaina Lorenzon, Arthur E. de Souza, Samuel Xavier Beck, Antonio Carlos S. Univ Fed Rio Grande do Sul Porto Alegre RS Brazil Univ Fed Rio Grande do Norte Natal RN Brazil

ISBN: (纸本)9798350364613;9798350364606

Non-Uniform Memory Access (NUMA) systems are preva-lent in HPC, where optimal thread and page placement are crucial for enhancing performance and minimizing energy us-age [1]-[3]. Moreover, considering that NUMA systems have hardware support for a large number of hardware threads and many parallel applications have limited scalability, throttling the number of active threads may bring further improvements [4]-[6]. However, the optimal configuration (thread mapping, page mapping, number of threads) for energy and performance, quantified by the Energy-Delay Product (EDP), varies with the system hardware, application, input set, and even during execution [1], [4], [6], [7]. Only online optimization approaches can easily adapt to these changes. © 2024 IEEE.

关键词： parallel applications NUMA systems dynamic concurrency throttling thread mapping page mapping

来源：评论

学校读者我要写书评

暂无评论

Diverse Adaptive Bulk Search: a Framework for Solving QUBO Problems on Multiple GPUs

Diverse Adaptive Bulk Search: a Framework for Solving QUBO P...

引用

37th IEEE international parallel and distributed processing symposium (IPDPS)

作者： Nakano, Koji Takafuji, Daisuke Ito, Yasuaki Yazane, Takashi Yano, Junko Ozaki, Shiro Katsuki, Ryota Mori, Rie Hiroshima Univ Grad Sch Adv Sci & Engn Kagamiyama 1-4-1 Higashihiroshima 7398527 Japan NTT DATA Corp Res & Dev Headquarters Toyosu Ctr BldgAnnex3-9Toyosu 3-chomeKoto ku Tokyo 1358671 Japan

ISBN: (纸本)9798350311990

Quadratic Unconstrained Binary Optimization (QUBO) is a combinatorial optimization to find an optimal binary solution vector that minimizes the energy value defined by a quadratic formula of binary variables in the vector. As many NP-hard problems can be reduced to QUBO problems, considerable research has gone into developing QUBO solvers running on various computing platforms such as quantum devices, ASICs, FPGAs, GPUs, and optical fibers. this paper presents a framework called Diverse Adaptive Bulk Search (DABS), which has the potential to find optimal solutions of many types of QUBO problems. Our DABS solver employs a genetic algorithm-based search algorithm featuring three diverse strategies: multiple search algorithms, multiple genetic operations, and multiple solution pools. During the execution of the solver, search algorithms and genetic operations that succeeded in finding good solutions are automatically selected to obtain better solutions. Moreover, search algorithms traverse between different solution pools to find good solutions. We have implemented our DABS solver to run on multiple GPUs. Experimental evaluations using eight NVIDIA A100 GPUs confirm that our DABS solver succeeds in finding optimal or potentially optimal solutions for three types of QUBO problems.

关键词： Quantum annealing combinatorial algorithms heuristic algorithms genetic algorithms GPGPU

来源：评论

学校读者我要写书评

暂无评论

Memory-Disaggregated In-Memory Object Store Framework for Big Data applications 36

Memory-Disaggregated In-Memory Object Store Framework for Bi...

引用

36th IEEE international parallel and distributed processing symposium (IEEE IPDPS)

作者： Abrahamse, Robin Hadnagy, Akos Al-Ars, Zaid Delft Univ Technol Accelerated Big Data Syst Delft Netherlands

ISBN: (纸本)9781665497473

the concept of memory disaggregation has recently been gaining traction in research. With memory disaggregation, data center compute nodes can directly access memory on adjacent nodes and are therefore able to overcome local memory restrictions, introducing a new data management paradigm for distributed computing. this paper proposes and demonstrates a memory disaggregated in-memory object store framework for big data applications by leveraging the newly introduced thymesisFlow memory disaggregation system. the framework extends the functionality of the pre-existing Apache Arrow Plasma object store framework to distributed systems by enabling clients to easily and efficiently produce and consume data objects across multiple compute nodes. this allows big data applications to increasingly leverage parallel processing at reduced development costs. In addition, the paper includes latency and throughput measurements that indicate only a modest performance penalty is incurred for remote disaggregated memory access as opposed to local (similar to 6.5 vs similar to 5.75 GiB/s). the results can be used to guide the design of future systems that leverage memory disaggregation as well as the newly presented framework. this work is open-source and publicly accessible at https://***/10.5281/zenodo.6368998.

关键词： Memory Disaggregation Apache Arrow Plasma thymesisFlow

来源：评论

学校读者我要写书评

暂无评论

A Novel DA-Based parallel Architecture for Inner-Product of Variable Vectors

A Novel DA-Based Parallel Architecture for Inner-Product of ...

引用

IEEE international symposium on Circuits and Systems (ISCAS)

作者： Kali, Anil Sabat, Samrat L. Mehert, Pramod K. Univ Hyderabad CASEST Hyderabad India CV Raman Global Univ Dept Comp Sci & Engn Bhubaneswar India

ISBN: (纸本)9798350330991;9798350331004

Computation of the inner products is frequently used in machine learning (ML) algorithms apart from signal processing and communication applications. distributed arithmetic (DA) has been frequently employed for area-time efficient inner-product implementations. In conventional DA-based architectures, one of the vectors is constant and known a priori. Hence, the traditional DA architectures are not suitable when both vectors are variable. However, computing the inner product of a pair of variable vectors is frequently used for matrix multiplication of various forms and convolutional neural networks. In this paper, we present a novel DA-based architecture for computing the inner product of variable vectors. To derive the proposed architecture, the inner product of any given length is decomposed into a set of short-length inner products, such that the inner product could be computed by successive accumulation of the results of shortlength inner products. We have designed a DA-based architecture for the computation of the short-length inner-product of variable vectors and used that in successive clock cycles to compute the whole inner-product by successive accumulation. the post-layout synthesis results using Cadence Innovus with a GPDK 90nm technology library show that the proposed DA-based parallel architecture offers significant advantages in area-delay product and energy consumption over the bit-serial DA architecture.

关键词： parallel distributed arithmetic Inner-product Radix-4 modified Booth encoding Adder tree

来源：评论

学校读者我要写书评

暂无评论

FaultyRank: A Graph-based parallel File System Checker 37

FaultyRank: A Graph-based Parallel File System Checker

引用

37th IEEE international parallel and distributed processing symposium (IPDPS)

作者： Kamat, Saisha Islam, Abdullah Al Raqibul Zheng, Mai Dai, Dong Univ N Carolina Dept Comp Sci Charlotte NC 28223 USA Iowa State Univ Dept Elect & Comp Engn Ames IA USA

ISBN: (纸本)9798350337662

Similar to local file system checkers such as e2fsck for Ext4, a parallel file system (PFS) checker ensures the file system's correctness. the basic idea of file system checkers is straightforward: important metadata are stored redundantly in separate places for cross-checking;inconsistent metadata will be repaired or overwritten by its 'more correct' counterpart, which is defined by the developers. Unfortunately, implementing the idea for PFSes is non-trivial due to the system complexity. Although many popular parallel file systems already contain dedicated checkers (e.g., LFSCK for Lustre, BeeGFS-FSCK for BeeGFS, mmfsck for GPFS), the existing checkers often cannot detect or repair inconsistencies accurately due to one fundamental limitation: they rely on a fixed set of consistency rules predefined by developers, which cannot cover the various failure scenarios that may occur in practice. In this study, we propose a new graph-based method to build PFS checkers. Specifically, we model important PFS metadata into graphs, then generalize the logic of cross-checking and repairing into graph analytic tasks. We design a new graph algorithm, FaultyRank, to quantitatively calculate the correctness of each metadata object. By leveraging the calculated correctness, we are able to recommend the most promising repairs to users. Based on the idea, we implement a prototype of FaultyRank on Lustre, one of the most widely used parallel file systems, and compare it with Lustre's default file system checker LFSCK. Our experiments show that FaultyRank can achieve the same checking and repairing logic as LFSCK. Moreover, it is capable of detecting and repairing complicated PFS consistency issues that LFSCK can not handle. We also show the performance advantage of FaultyRank compared with LFSCK. through this study, we believe FaultyRank opens a new opportunity for building PFS checkers effectively and efficiently.

关键词： checker file system checker graph lustre

来源：评论

学校读者我要写书评

暂无评论

Communication-efficient Massively distributed Connected Components 36

Communication-efficient Massively Distributed Connected Comp...

引用

36th IEEE international parallel and distributed processing symposium (IEEE IPDPS)

作者： Lamm, Sebastian Sanders, Peter Karlsruhe Inst Technol Inst Theoret Informat Karlsruhe Germany

ISBN: (纸本)9781665481069

Finding the connected components of an undirected graph is one of the most fundamental graph problems. Connected components are used in a wide spectrum of applications including VLSI design, machine learning and image analysis. Sequentially, one can easily find all connected components in linear time using breadth-first traversal. However, in a massively distributed setting, finding connected components in a scalable way becomes much harder due to data irregularities and the overhead associated with the increased need for communication. In this work, we present a communication-efficient distributed graph algorithm for finding connected components that scales to massively parallel machines. Our algorithm is based on a recent linear-work shared-memory parallel algorithm by Blelloch et al. [1] and refines it for a distributed memory setting. this includes a communication-efficient graph contraction procedure, as well as a distributed variant of the low diameter decomposition by Miller et al. [2]. We tackle the data irregularities introduced by high degree vertices by using an efficient procedure for distributing their incident edges. Our experimental evaluation on up to 16 384 cores indicates a good weak scaling behavior that outperforms current state-of-the-art algorithms.

关键词： graph algorithms distributed algorithms communication efficiency graph connectivity

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共380页 << < 1 2 3 4 5 6 7 8 9 10 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：