检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

27 篇 会议
4 篇 期刊文献

馆藏范围

31 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

25 篇 工学
- 22 篇 软件工程
- 21 篇 计算机科学与技术...
- 1 篇 机械工程
- 1 篇 轻工技术与工程
- 1 篇 核科学与技术
4 篇 理学
- 4 篇 数学
- 1 篇 系统科学
2 篇 管理学
- 2 篇 管理科学与工程(可...

主题

5 篇 algorithms
4 篇 performance
4 篇 parallel program...
3 篇 theory
3 篇 gpu
2 篇 universal primit...
2 篇 reliability
2 篇 raja
2 篇 0
2 篇 many-core archit...
2 篇 deadlock-detecti...
2 篇 performance port...
2 篇 openmp 4
2 篇 non-blocking syn...
2 篇 programming mode...
2 篇 data-parallel re...
2 篇 combinability
2 篇 deadlocks
2 篇 kokkos
2 篇 full/empty bit

机构

2 篇 uk atom weap est...
2 篇 univ chinese aca...
2 篇 birla inst techn...
2 篇 tech univ catalu...
2 篇 univ bristol hpc...
2 篇 univ tromso n-90...
2 篇 chalmers univ te...
1 篇 ist austria klos...
1 篇 ctr perceptual &...
1 篇 shanghai jiao to...
1 篇 yale university ...
1 篇 univ calif merce...
1 篇 onera onera dtis...
1 篇 computer network...
1 篇 ecole polytech f...
1 篇 department of co...
1 篇 politecnico di m...
1 篇 school of comput...
1 篇 cnr ieiit milan
1 篇 chinese acad sci...

作者

2 篇 martineau matthe...
2 篇 cristal adrian
2 篇 gaudin wayne
2 篇 anshus otto j.
2 篇 kulkarni chinmay
2 篇 tsigas philippas
2 篇 tan guangming
2 篇 unsal osman
2 篇 mcintosh-smith s...
2 篇 valero mateo
2 篇 ayguade eduard
2 篇 ha phuong hoai
1 篇 breveglieri luca
1 篇 wang jiaping
1 篇 shang honghui
1 篇 tatsuoka curtis
1 篇 huimin cui
1 篇 li yewen
1 篇 feng siyuan
1 篇 castro roberto l...

语言

27 篇 英文
4 篇 其他

检索条件"任意字段=30th Symposium on Principles and Practice of Parallel Programming"

共 31 条记录，以下是11-20 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

30th International Conference on principles and practice of Constraint programming, CP 2024

30th International Conference on Principles and Practice of ...

引用

30th International Conference on principles and practice of Constraint programming, CP 2024

ISBN: (纸本)9783959773362

the proceedings contain 38 papers. the topics discussed include: solving patience and solitaire games with good old fashioned AI;the complexity of symmetry breaking beyond Lex-Leader;certifying without loss of generality reasoning in solution-improving maximum satisfiability;ParLS-PBO: a parallel local search solver for pseudo Boolean optimization;deep cooperation of local search and unit propagation techniques;cumulative scheduling with calendars and overtime;pseudo-Boolean reasoning about states and transitions to certify dynamic programming and decision diagram algorithms;anytime weighted model counting with approximation guarantees for probabilistic inference;and a multi-stage proof logging framework to certify the correctness of CP solvers.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Constraint programming Model for Assembly Line Balancing and Scheduling with Walking Workers and parallel Stations 30

Constraint Programming Model for Assembly Line Balancing and...

引用

30th International Conference on principles and practice of Constraint programming, CP 2024

作者： Pucel, Xavier Roussel, Stéphanie ONERA ONERA DTIS Université de Toulouse Toulouse France

ISBN: (纸本)9783959773362

In the context of aircraft assembly lines, increasing the production rate and decreasing the operating costs are two important, and sometimes contradictory, objectives. In small assembly lines, sharing production resources across workstations is a simple and efficient way to reduce operating costs. therefore, workers are not assigned to a unique workstation but can walk between them. On the other side, paralleling workstations is an efficient way to increase the production rate. However, the combination of both strategies create complex conditions for tasks to access the production resources. this paper addresses the problem of allocating tasks to workstations and scheduling them in an assembly line where workers can freely walk across workstations, and where workstations can be organized in parallel. We model this novel problem with Constraint programming. We evaluate it on real world industrial use cases coming from aircraft manufacturers, as well as synthetic use cases adapted from the literature. © Xavier Pucel and Stéphanie Roussel.

关键词： Constraint programming

来源：评论

学校读者我要写书评

暂无评论

ParLS-PBO: A parallel Local Search Solver for Pseudo Boolean Optimization 30

ParLS-PBO: A Parallel Local Search Solver for Pseudo Boolean...

引用

30th International Conference on principles and practice of Constraint programming, CP 2024

作者： Chen, Zhihan Lin, Peng Hu, Hao Cai1, Shaowei State Key Laboratory of Computer Science Institute of Software Chinese Academy of Sciences Beijing China School of Computer Science and Technology University of Chinese Academy of Sciences Beijing China

ISBN: (纸本)9783959773362

As a broadly applied technique in numerous optimization problems, recently, local search has been employed to solve Pseudo-Boolean Optimization (PBO) problem. A representative local search solver for PBO is LS-PBO. In this paper, firstly, we improve LS-PBO by a dynamic scoring mechanism, which dynamically strikes a balance between score on hard constraints and score on the objective function. Moreover, on top of this improved LS-PBO, we develop the first parallel local search PBO solver. the main idea is to share good solutions among different threads to guide the search, by maintaining a pool of feasible solutions. For evaluating solutions when updating the pool, we propose a function that considers both the solution quality and the diversity of the pool. Furthermore, we calculate the polarity density in the pool to enhance the scoring function of local search. Our empirical experiments show clear benefits of the proposed parallel approach, making it competitive with the parallel version of the famous commercial solver Gurobi. © Zhihan Chen, Peng Lin, Hao Hu, and Shaowei Cai.

关键词： Local search (optimization)

来源：评论

学校读者我要写书评

暂无评论

CP for Bin Packing with Multi-Core and GPUs 30

CP for Bin Packing with Multi-Core and GPUs

引用

30th International Conference on principles and practice of Constraint programming, CP 2024

作者： Tardivo, Fabio Michel, Laurent Pontelli, Enrico Department of Computer Science New Mexico State University Las CrucesNM United States Synchrony Chair in Cybersecurity School of Computing University of Connecticut StorrsCT United States

ISBN: (纸本)9783959773362

the BinPacking constraint models the requirements of many logistics, resource allocation, and production scheduling applications. this paper explores new avenues based on the impressive computational power of modern GPUs to propagate the BinPacking constraint. this work showcases how the perspective of massive parallelization can lead to novel approaches, such as the use of a portfolio of lower bounds, to enhance the pruning of the BinPacking constraints. It delivers insights into the design choices and challenges presented by GPU platform for constraint propagation. the paper evaluates a GPU-accelerated propagator against both sequential and parallel CPU versions, as well as state-of-the-art approaches. Comparisons across various benchmarks from the literature show strong performances with respect to both CPU versions and the standard pruning approach. When compared to techniques based on Linear programming, our approach proves valuable for large instances or when spending extensive time to obtain the best possible bound is not convenient. © Fabio Tardivo, Laurent Michel, and Enrico Pontelli.

关键词： Linear programming

来源：评论

学校读者我要写书评

暂无评论

Minimizing speculation overhead in a parallel recognizer for regular texts 25

Minimizing speculation overhead in a parallel recognizer for...

引用

Proceedings of the 30th ACM SIGPLAN Annual symposium on principles and practice of parallel programming

作者： Angelo Borsotti Luca Breveglieri Angelo Morzenti Stefano Crespi Reghizzi Politecnico di Milano Milano Italy Politecnico di Milano and CNR-IEIIT Milano Italy

ISBN: (纸本)9798400714436

Speculative data-parallel algorithms for language recognition have been widely experimented for various types of finitestate automata (FA), deterministic (DFA) and nondeterministic (NFA), often derived fromregular expressions (RE). Such an algorithm cuts the input string into chunks, independently recognizes each chunk in parallel by means of identical FAs, and at last joins the chunk results and checks the overall consistency. In chunk recognition, it is necessary to speculatively start the FAs in any state, thus causing an overhead that reduces the speedup over a serial algorithm. the existing data-parallel DFA-based recognizers suffer from an excessive number of starting states, and the NFA-based ones suffer from the number of nondeterministic transitions.

关键词： data-parallel recognition algorithm

来源：评论

学校读者我要写书评

暂无评论

A Scalable Hybrid Total FETI Method for Massively parallel FEM Simulations 23

A Scalable Hybrid Total FETI Method for Massively Parallel F...

引用

28th ACM SIGPLAN Annual symposium on principles and practice of parallel programming, PPoPP 2023

作者： Lin, Kehao Zhou, Chunbao Zeng, Yan Nie, Ningming Wang, Jue Li, Shigang Feng, Yangde Wang, Yangang Yao, Kehan Yao, Tiechui Zhang, Jilin Wan, Jian Hangzhou Dianzi University Hangzhou China Computer Network Information Center Chinese Academy of Sciences Beijing China University of Chinese Academy of Sciences Beijing China School of Computer Science Beijing University of Posts and Telecommunications Beijing China

ISBN: (纸本)9798400700156

the Hybrid Total Finite Element Tearing and Interconnecting (HTFETI) method plays an important role in solving large-scale and complex engineering problems. this method needs to handle numerous matrix-vector multiplications. Directly calling the vendor-optimized library for general matrix-vector multiplication (gemv) on GPU leads to low performance, since it does not consider optimizations for different matrix sizes in HTFETI, i.e. different row and column sizes. In addition, state-of-the-art graph partitioning methods cannot guarantee load balancing for HTFETI, since the matrix size is determined by the length of the subdomain boundary. To solve the problems above, we first port gemv to the multi-stream pipeline scheme and develop a new batched kernel function on GPU, which brings 15%∼30% throughput improvement and 37% average GFLOPs improvement, respectively. We also propose a multi-grained load-balancing scheme based on graph repartitioning and work-stealing, and the load imbalance ratio is down to 1.05∼1.09 from 1.5. We have successfully applied the scalable HTFETI method to simulate the whole core assembly of China Experimental Fast Reactor (CEFR) for steady-state analysis, and the efficiencies of weak scalability and strong scalability reach 78% and 72% on 12,288 GPUs, respectively. As far as we know, this is the first time that HTFETI has been used in large-scale and high-fidelity whole core assembly simulation. © 2023 Owner/Author.

关键词： Scalability

来源：评论

学校读者我要写书评

暂无评论

Magneto: Accelerating parallel Structures in DNNs via Co-Optimization of Operators 25

Magneto: Accelerating Parallel Structures in DNNs via Co-Opt...

引用

Proceedings of the 30th ACM SIGPLAN Annual symposium on principles and practice of parallel programming

作者： Zhanyuan Di Leping Wang Ziyi Ren En Shao Jie Zhao Siyuan Feng Dingwen Tao Guangming Tan Ninghui Sun SKLP Institute of Computing Technology CAS University of Chinese Academy of Sciences SKLP Institute of Computing Technology CAS Hunan University Shanghai Jiao Tong University

ISBN: (纸本)9798400714436

Deep neural networks (DNNs) increasingly rely on parallel structures to enhance performance and efficiency. However, existing machine learning compilers (MLCs) face challenges in optimizing these structures due to limited parallel fusion scopes and insufficient consideration of intra-operator information. this paper introduces Magneto, a novel framework designed to accelerate parallel structures in DNNs through the co-optimization of parallel operators. By expanding the scope of parallel operator fusion and introducing a dedicated co-tuning algorithm, Magneto unlocks new opportunities for co-optimization. Experimental results demonstrate that Magneto outperforms NVIDIA TensorRT and AMD MIGraphX, achieving speedups of 3.02× and 4.19×, respectively.

关键词： DNN

来源：评论

学校读者我要写书评

暂无评论

parallel k-Core Decomposition with Batched Updates and Asynchronous Reads 24

Parallel k-Core Decomposition with Batched Updates and Async...

引用

29th ACM SIGPLAN Annual symposium on principles and practice of parallel programming, PPoPP 2024

作者： Liu, Quanquan C. Shun, Julian Zablotchi, Igor Yale University United States MIT CSAIL United States Mysten Labs Switzerland

ISBN: (纸本)9798400704352

Maintaining a dynamic k-core decomposition is an important problem that identifies dense subgraphs in dynamically changing graphs. Recent work by Liu et al. [SPAA 2022] presents a parallel batch-dynamic algorithm for maintaining an approximate k-core decomposition. In their solution, both reads and updates need to be batched, and therefore each type of operation can incur high latency waiting for the other type to finish. To tackle most real-world workloads, which are dominated by reads, this paper presents a novel hybrid concurrent-parallel dynamic k-core data structure where asynchronous reads can proceed concurrently with batches of updates, leading to significantly lower read latencies. Our approach is based on tracking causal dependencies between updates, so that causally related groups of updates appear atomic to concurrent readers. Our data structure guarantees linearizability and liveness for both reads and updates, and maintains the same approximation guarantees as prior work. Our experimental evaluation on a 30-core machine shows that our approach reduces read latency by orders of magnitude compared to the batch-dynamic algorithm, up to a (4.05 · 105 ) -factor. Compared to an unsynchronized (non-linearizable) baseline, our read latency overhead is only up to a 3.21-factor greater, while improving accuracy of coreness estimates by up to a factor of 52.7. © 2024 Copyright held by the owner/author(s).

关键词： Data structures

来源：评论

学校读者我要写书评

暂无评论

TensorMD: Molecular Dynamics Simulation with Ab Initio Accuracy of 50 Billion Atoms 25

TensorMD: Molecular Dynamics Simulation with Ab Initio Accur...

引用

Proceedings of the 30th ACM SIGPLAN Annual symposium on principles and practice of parallel programming

作者： Yucheng Ouyang Ying Liu Honghui Shang Zhenchuan Chen Jiahao Shan Huimin Cui Xiaobing Feng Xin Chen Xingyu Gao Lifang Wang Haifeng Song Rongfen Lin Fang Li Institute of Computing Technology Chinese Academy of Sciences Beijing China National Research Center of Parallel Computer Engineering and Technology Beijing China Institute of Applied Physics and Computational Mathematics Beijing China

ISBN: (纸本)9798400714436

Molecular dynamics simulation emerges as an important area that HPC+AI helps to investigate the physical properties, with machine-learning interatomic potentials (MLIPs) being used. General-purpose machine-learning (ML) tools have been leveraged in MLIPs, but they are not perfectly matched with each other, since many optimization opportunities in MLIPs have been missed by ML tools. this inefficiency arises from the fact that HPC+AI applications work with far more computational complexity compared with pure AI scenarios. this paper has developed an MLIP, named TensorMD, independently from any ML tool. TensorMD has been evaluated on two supercomputers and scaled to 51.8 billion atoms, i.e., ~ 3× compared with state-of-the-art.

关键词： GPU

来源：评论

学校读者我要写书评

暂无评论

Types for Complexity of parallel Computation in Pi-Calculus 1

引用

30th European symposium on programming (ESOP) Held as Part of the 24th European Joint Conferences on theory and practice of Software (ETAPS)

作者： Baillot, Patrick Ghyselen, Alexis Univ Claude Bernard Lyon 1 LIP Univ Lyon CNRSENS Lyon F-69342 Lyon 07 France

ISBN: (数字)9783030720193

ISBN: (纸本)9783030720193;9783030720186

Type systems as a technique to analyse or control programs have been extensively studied for functional programming languages. In particular some systems allow to extract from a typing derivation a complexity bound on the program. We explore how to extend such results to parallel complexity in the setting of the pi-calculus, considered as a communication-based model for parallel computation. Two notions of time complexity are given: the total computation time without parallelism (the work) and the computation time under maximal parallelism (the span). We define operational semantics to capture those two notions, and present two type systems from which one can extract a complexity bound on a process. the type systems are inspired both by size types and by input/output types, with additional temporal information about communications.

关键词： Type Systems Pi-calculus Process Calculi Complexity Analysis Implicit Computational Complexity Size Types

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共4页 << < 1 2 3 4 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：