检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

3,680 篇 会议
122 篇 期刊文献
22 册 图书

馆藏范围

3,824 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

2,671 篇 工学
- 2,547 篇 计算机科学与技术...
- 1,152 篇 软件工程
- 412 篇 信息与通信工程
- 411 篇 电气工程
- 207 篇 电子科学与技术（可...
- 136 篇 控制科学与工程
- 78 篇 网络空间安全
- 40 篇 动力工程及工程热...
- 37 篇 机械工程
- 37 篇 建筑学
- 33 篇 生物医学工程（可授...
- 29 篇 光学工程
- 29 篇 生物工程
- 28 篇 土木工程
- 22 篇 仪器科学与技术
- 20 篇 化学工程与技术
- 20 篇 安全科学与工程
- 18 篇 力学（可授工学、理...
634 篇 理学
- 493 篇 数学
- 88 篇 物理学
- 67 篇 统计学（可授理学、...
- 56 篇 系统科学
- 35 篇 生物学
- 31 篇 化学
402 篇 管理学
- 339 篇 管理科学与工程(可...
- 157 篇 工商管理
- 84 篇 图书情报与档案管...
28 篇 医学
- 25 篇 临床医学
26 篇 经济学
- 25 篇 应用经济学
18 篇 法学
- 18 篇 社会学
12 篇 农学
6 篇 教育学
3 篇 文学
1 篇 军事学
1 篇 艺术学

主题

354 篇 parallel process...
302 篇 application soft...
238 篇 distributed comp...
208 篇 computer archite...
204 篇 concurrent compu...
199 篇 hardware
181 篇 computational mo...
177 篇 parallel process...
172 篇 graphics process...
171 篇 computer science
129 篇 runtime
120 篇 parallel program...
104 篇 processor schedu...
103 篇 distributed comp...
101 篇 distributed proc...
100 篇 grid computing
98 篇 scalability
96 篇 high performance...
96 篇 delay
95 篇 libraries

机构

12 篇 school of comput...
12 篇 ohio state univ ...
10 篇 argonne natl lab...
9 篇 univ chinese aca...
9 篇 hiroshima univ d...
9 篇 oak ridge natl l...
7 篇 ibm thomas j. wa...
7 篇 oak ridge nation...
7 篇 univ warwick dep...
7 篇 carnegie mellon ...
7 篇 department of co...
7 篇 ibm corp thomas ...
6 篇 oak ridge natl l...
6 篇 iit dept comp sc...
6 篇 lawrence berkele...
6 篇 georgia inst tec...
6 篇 department of co...
6 篇 univ coll dublin...
6 篇 department of co...
6 篇 department of co...

作者

20 篇 nakano koji
17 篇 lastovetsky alex...
16 篇 ito yasuaki
11 篇 dongarra jack
11 篇 jarvis stephen a...
11 篇 sun xian-he
11 篇 agrawal gagan
10 篇 wolf felix
9 篇 schulz martin
9 篇 guo minyi
9 篇 robert yves
8 篇 hoefler torsten
8 篇 h. casanova
8 篇 jack dongarra
8 篇 prasad sushil k.
8 篇 casanova henri
8 篇 magoules frederi...
8 篇 kale laxmikant v...
8 篇 labarta jesus
7 篇 bader david a.

语言

3,816 篇 英文
6 篇 其他
1 篇 土耳其文
1 篇 中文

检索条件"任意字段=4th International Symposium on Parallel and Distributed Processing and Applications"

共 3824 条记录，以下是201-210 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

perun: Benchmarking Energy Consumption of High-Performance Computing applications 29th

perun: Benchmarking Energy Consumption of High-Performance C...

引用

29th international Conference on parallel and distributed Computing (Euro-Par)

作者： Muriedas, Juan Pedro Gutierrez Hermosillo Fluegel, Katharina Debus, Charlotte Obermaier, Holger Streit, Achim Goetz, Markus Karlsruhe Inst Technol KIT Steinbuch Ctr Comp SCC Hermann Von Helmholtz Pl 1 D-76344 Eggenstein Leopoldshafen Germany Helmholtz AI Karlsruhe Germany

ISBN: (纸本)9783031396977;9783031396984

Looking closely at the Top500 list of high-performance computers (HPC) in the world, it becomes clear that computing power is not the only number that has been growing in the last three decades. the amount of power required to operate such massive computing machines has been steadily increasing, earning HPC users a higher than usual carbon footprint. While the problem is well known in academia, the exact energy requirements of hardware, software and how to optimize it are hard to quantify. To tackle this issue, we need tools to understand the software and its relationship with power consumption in today's high performance computers. With that in mind, we present perun, a Python package and command line interface to measure energy consumption based on hardware performance counters and selected physical measurement sensors. this enables accurate energy measurements on various scales of computing, from a single laptop to an MPI-distributed HPC application. We include an analysis of the discrepancies between these sensor readings and hardware performance counters, with particular focus on the power draw of the usually overlooked non-compute components such as memory. One of our major insights is their significant share of the total energy consumption. We have equally analyzed the runtime and energy overhead perun generates when monitoring common HPC applications, and found it to be minimal. Finally, an analysis on the accuracy of different measuring methodologies when applied at large scales is presented.

关键词： Energy Benchmarking High-performance Computing Artificial Intelligence distributed Memory System

来源：评论

学校读者我要写书评

暂无评论

parallel Cholesky Factorization for Banded Matrices Using OpenMP Tasks 29th

Parallel Cholesky Factorization for Banded Matrices Using Op...

引用

29th international Conference on parallel and distributed Computing (Euro-Par)

作者： Liu, Felix Fredriksson, Albin Markidis, Stefano KTH Royal Inst Technol Stockholm Sweden RaySearch Labs Stockholm Sweden

ISBN: (纸本)9783031396977;9783031396984

Cholesky factorization is a method for solving linear systems involving symmetric, positive-definite matrices, and can be an attractive choice in applications where a high degree of numerical stability is needed. One such application is mathematical optimization, where direct methods for solving linear systems are widely used and often a significant performance bottleneck. An example where this is the case, and the specific type of optimization problem motivating this work, is radiation therapy treatment planning, where mathematical optimization is used to create individual treatment plans for patients. To address this bottleneck, we propose a task-based multi-threaded method for Cholesky factorization of banded matrices with medium-sized bands. We implement our algorithm using OpenMP tasks and compare our performance with state-of-the-art libraries such as Intel MKL. Our performance measurements show a performance that is on par or better than Intel MKL (up to similar to 26% on a single CPU socket) for a wide range of matrix bandwidths on two different Intel CPU systems.

关键词： Cholesky factorization Task-Based parallelism Linear Solver

来源：评论

学校读者我要写书评

暂无评论

DAG-based Scheduling with Resource Sharing for Multi-task applications in a Polyglot GPU Runtime 35

DAG-based Scheduling with Resource Sharing for Multi-task Ap...

引用

35th IEEE international parallel and distributed processing symposium (IPDPS)

作者： Parravicini, Alberto Delamare, Arnaud Arnaboldi, Marco Santambrogio, Marco D. Politecn Milan Milan Italy Oracle Labs Zurich Switzerland

ISBN: (纸本)9781665440660

GPUs are readily available in cloud computing and personal devices, but their use for data processing acceleration has been slowed down by their limited integration with common programming languages such as Python or Java. Moreover, using GPUs to their full capabilities requires expert knowledge of asynchronous programming. In this work, we present a novel GPU run time scheduler for multi-task GPU computations that transparently provides asynchronous execution, space-sharing, and transfer-computation overlap without requiring in advance any information about the program dependency structure. We leverage the GrCUDA polyglot API to integrate our scheduler with multiple high-level languages and provide a platform for fast prototyping and easy GPU acceleration. We validate our work on 6 benchmarks created to evaluate task-parallelism and show an average of 44% speedup against synchronous execution, with no execution lime slowdown compared to hand-optimized has' code written using the C++ CUDA Graphs API.

关键词： GPU Scheduling Software Runtime Hardware Acceleration

来源：评论

学校读者我要写书评

暂无评论

Modelling a Communication Channel under Jamming: Experimental Model and applications 19

Modelling a Communication Channel under Jamming: Experimenta...

引用

19th IEEE international symposium on parallel and distributed processing with applications (IEEE ISPA)

作者： Tedeschi, Pietro Sciancalepore, Savio Di Pietro, Roberto Hamad Bin Khalifa Univ HBKU Coll Sci & Engn CSE Div Informat & Comp Technol ICT Doha Qatar Eindhoven Univ Technol Eindhoven Netherlands

ISBN: (纸本)9781665435741

Traditional studies on jamming effectiveness and propagation over the wireless channel assume ideal theoretical models, such as Friis and Rician. However, the cited models have been hardly validated by on-field assessments in real jamming scenarios. To the best of our knowledge, we are the first ones to fill the highlighted gap. In particular, our objective is to provide a realistic jamming propagation model, taking into account heterogeneous operating frequencies and technologies. Our findings, supported by an extensive experimental campaign on outdoor jamming propagation, show that independently from the communication frequency the jamming power received at a given distance from the jamming source (fast fading) can be best modelled through a t-locationScale distribution, while the power of the received jamming decades with the increase of the distance from the jamming source (slow fading) following a power law. As reference applications of the derived experimental model, we describe and demonstrate its usage in two different use-cases, i.e., jamming source localization and dead-reckoning navigation, showing that our model outperforms traditional and state-of-the-art propagation models when dealing with real jamming scenarios. All the acquired data have been released as open-source, to foster experimental research activities on jamming propagation models and their applications.

关键词： Jamming Channel Propagation Model Jamming Localization Dead Reckoning Navigation

来源：评论

学校读者我要写书评

暂无评论

4th Workshop on parallel AI and Systems for the Edge

4th Workshop on Parallel AI and Systems for the Edge

引用

IEEE international symposium on parallel and distributed processing Workshops and Phd Forum (IPDPSW)

来源：评论

学校读者我要写书评

暂无评论

Lightweight Function Monitors for Fine-Grained Management in Large Scale Python applications 35

Lightweight Function Monitors for Fine-Grained Management in...

引用

35th IEEE international parallel and distributed processing symposium (IPDPS)

作者： Shaffer, Tim Li, Zhuozhao Tovar, Ben Babuji, Yadu Dasso, T. J. Surma, Zoe Chard, Kyle Foster, Ian thain, Douglas Univ Notre Dame Notre Dame IN 46556 USA Univ Chicago Chicago IL 60637 USA Argonne Natl Lab Argonne IL 60439 USA

ISBN: (纸本)9781665440660

Python has become a widely used programming language for research, not only for small one-off analyses, but also for complex application pipelines running at supercomputer-scale. Modern parallel programming frameworks for Python present users with a more granular unit of management than traditional Unix processes and batch submissions: the Python function. We review the challenges involved in running native Python functions at scale, and present techniques for dynamically determining a minimal set of dependencies and for assembling a lightweight function monitor (LFM) that captures the software environment and manages resources at the granularity of single functions. We evaluate these techniques in a range of environments, from campus cluster to supercomputer, and show that our advanced dependency management planning and dynamic resource management methods provide superior performance and utilization relative to coarser-grained management approaches, achieving several-fold decrease in execution time for several large Python applications.

关键词： parallel programming Pipelines Tools Software Supercomputers Planning Resource management

来源：评论

学校读者我要写书评

暂无评论

A Two-stage Replica Management Mechanism for Latency-Aware applications in Multi-Access Edge Computing 19

A Two-stage Replica Management Mechanism for Latency-Aware A...

引用

19th IEEE international symposium on parallel and distributed processing with applications (IEEE ISPA)

作者： Liang, Yang Hu, Zhigang Yang, Liu Cent South Univ Sch Comp Sci & Engn Changsha Peoples R China Hunan Univ Chinese Med Sch Informat Changsha Peoples R China

ISBN: (纸本)9781665435741

With the explosive increase of the various user equipments, access latency has become a paramount metric of QoS in multi-access edge computing (MEC). At the same time, cost expenditure affects and restrains the reduction of latency. To cope with these issues, a two-stage replica management mechanism (TRMM) for latency-aware applications in MEC is proposed. First, we design the system architecture of TRMM in MEC environment and construct a novel mathematical model to describe the replica placement decision problem as a dual-objective problem with both latency and cost constraints. Subsequently, in replica recommendation stage, we present a file prospective popularity model based on user mobility and a replica recommendation algorithm;in replica placement rule learning stage, we construct a Q-Learning model, in which a new reward function is defined in terms of data access latency and replica placement cost, and the replica placement rule is defined in terms of 0-1 matrix. Finally, numerical results demonstrate that the TRMM outperforms other replica placement schemes.

关键词： access latency MEC replica recommendation user mobility replica placement

来源：评论

学校读者我要写书评

暂无评论

distributed Deep Learning Using Volunteer Computing-Like Paradigm

Distributed Deep Learning Using Volunteer Computing-Like Par...

引用

35th IEEE international parallel and distributed processing symposium (IPDPS)

作者： Atre, Medha Jha, Birendra Rao, Ashwini Eydle Inc Los Angeles CA 90803 USA

ISBN: (纸本)9781665435772

Use of Deep Learning (DL) in commercial applications such as image classification, sentiment analysis and speech recognition is increasing. When training DL models with large number of parameters and/or large datasets, cost and speed of training can become prohibitive. distributed DL training solutions that split a training job into subtasks and execute them over multiple nodes can decrease training time. However, the cost of current solutions, built predominantly for cluster computing systems, can still be an issue. In contrast to cluster computing systems, Volunteer Computing (VC) systems can lower the cost of computing, but applications running on VC systems have to handle fault tolerance, variable network latency and heterogeneity of compute nodes, and the current solutions are not designed to do so. We design a distributed solution that can run DL training on a VC system by using a data parallel approach. We implement a novel asynchronous SGD scheme called VC-ASGD suited for VC systems. In contrast to traditional VC systems that lower cost by using untrustworthy volunteer devices, we lower cost by leveraging preemptible computing instances on commercial cloud platforms. By using preemptible instances that require applications to be fault tolerant, we lower cost by 70-90% and improve data security.

关键词： distributed systems distributed training deep learning volunteer computing ASGD

来源：评论

学校读者我要写书评

暂无评论

A Sleep Stage Classification Method via Combination of Time and Frequency Domain Features based on Single-Channel EEG 19

A Sleep Stage Classification Method via Combination of Time ...

引用

19th IEEE international symposium on parallel and distributed processing with applications (IEEE ISPA)

作者： Zhao, Caihong Neng, Wenpeng Heilongjiang Univ Coll Comp Sci & Technol Harbin Peoples R China Heilongjiang Univ Coll Elect Engn Heilongjiang Key Lab Database & Parallel Comp Harbin Peoples R China

ISBN: (纸本)9781665435741

Sleep staging is an important method to diagnose and treat insomnia, sleep apnea, and other sleep disorders. Compared with the multi-channel automatic sleep staging system, the single-channel EEG signal contains less information, and the traditional single analysis domain feature parameter extraction algorithm cannot meet the requirement of sleep stage classification accuracy. To solve this problem, we propose an automatic sleep staging method based on the combination of time-domain and frequency-domain features based on single-channel EEG signals. Empirical mode decomposition is used to decompose EEG signals in the time domain to obtain the decomposed signals at different time scales. Multiple local features are extracted from each decomposed signal. the frequency-domain features of EEG signals are obtained by using the frequency domain decomposition of EEG signals in various rhythms. the time-domain and frequency-domain decomposition features are combined into eigenvectors and selected for sleep staging. the experimental results show that the sleep staging method proposed in this paper with time-frequency domain features of single-cyhannel EEG signals can approach the accuracy of sleep staging of multi-channel signals on the same data set, and superior to the sleep staging method with the same single-channel EEG signals.

关键词： sleep stage classification Empirical Mode Decomposition frequency-domain decomposition time-frequency domain features

来源：评论

学校读者我要写书评

暂无评论

Design of Analog Neuron Circuit with ReLU Activation Function

Design of Analog Neuron Circuit with ReLU Activation Functio...

引用

Computer applications and Information Technology (ISCAIT), international symposium on

作者： Jiayi Zhang Xusen Zhao Wenguan Zhu School of Information Science and Technology North China University of Technology Beijing P.R. China

ISBN: (数字)9798331542856

ISBN: (纸本)9798331542863

this paper presents an innovative analog neuron circuit design, subtly implementing ReLU activation function. this analogy neuron mainly consists of two parts. the first part is about linear processing, which responses for weighted summation of input signals. In our design, with the use of hardware circuit, it is able to assign desired weight to signals from different inputs. the other part is activation function circuit. In this part, we designed four different circuits. Proteus simulation results indicate that all four circuits can implement ReLU function well. With precision rectifier circuit selected, the output voltage has a strong ReLU relationship with the input, while the other circuits have their advantages and disadvantages. Additionally, the diode clamp circuit realizes ELU function approximately, which provides a new idea for the design of analog neurons.

关键词： Power demand Simulation Neurons Nonlinear distortion Rectifiers Voltage Information processing parallel processing Impedance Information technology

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共383页 << < 17 18 19 20 21 22 23 24 25 26 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：