检索结果-内蒙古大学图书馆

40th IEEE International Conference on distributed Computing systems (ICDCS)

作者： Xu, Lijie Ye, Xingtong Kang, Kai Guo, Tian Dou, Wensheng Wang, Wei Wei, Jun Chinese Acad Sci Inst Software State Key Lab Comp Sci Guangzhou Peoples R China Univ Chinese Acad Sci Beijing Peoples R China Chinese Acad Sci Inst Software Technol Nanjing Peoples R China Worcester Polytech Inst Worcester MA 01609 USA

ISBN: (纸本)9781728170022

Stream clustering is an important data mining technique to capture the evolving patterns in real-time data streams. Today's data streams, e.g., IoT events and Web clicks, are usually high-speed and contain dynamically-changing patterns. Existing stream clustering algorithms usually follow an online-offline paradigm with a one-record-at-a-time update model, which was designed for running in a single machine. These stream clustering algorithms, with this sequential update model, cannot be efficiently parallelized and fail to deliver the required high throughput for stream clustering. In this paper, we present DistStream, a distributed framework that can effectively scale out online-offline stream clustering algorithms. To parallelize these algorithms for high throughput, we develop a mini-batch update model with efficient parallelization approaches. To maintain high clustering quality, DistStream's mini-batch update model preserves the update order in all the computation steps during parallel execution, which can reflect the recent changes for dynamically-changing streaming data. We implement DistStream atop Spark Streaming, as well as four representative stream clustering algorithms based on DistStream. Our evaluation on three real-world datasets shows that DistStream-based stream clustering algorithms can achieve sublinear throughput gain and comparable (99%) clustering quality with their single-machine counterparts.

关键词： Stream clustering data stream scalability

来源：评论

学校读者我要写书评

暂无评论

An OpenMP parallel Genetic Algorithm for Design Space Exploration of Heterogeneous Multi-processor Embedded systems 2020

An OpenMP Parallel Genetic Algorithm for Design Space Explor...

引用

11th workshop on parallel Programming and Run-time Management Techniques for Many-Core Architectures / 9th workshop on Design Tools and Architectures for Multicore Embedded Computing Platforms, PARMA-DITAM 2020

作者： Muttillo, Vittoriano Giammatteo, Paolo Fiorilli, Giuseppe Pomante, Luigi University of L’Aquila L’Aquila Italy

ISBN: (纸本)9781450375450

Heterogeneous multiprocessor platforms are becoming widespread in the embedded system domain, mainly for the opportunity to improve timing performance and to minimize energy/power consumption and costs. Therefore, when using such platforms, it is important to adopt a Design Space Exploration (DSE) strategy that considers compromises among different objectives. Existing DSE approaches are generally based on evolutionary algorithms to solve Multi-Objective Optimization Problems (MOOPs) by minimizing a linear combination of weighted cost functions (i.e., Weighted Sum Method, WSM). In this way, the main issues are related to reduce timing execution while trying to improve the evolutionary algorithm performance, introducing strategies that attempt to bring better solutions. Code parallelization is one of the most used approaches in this field, but no standard methods have been released since different aspects could affect the performance. This approach leads to exploit parallel and distributed processing elements in order to implement evolutionary algorithms. In the latter case, if we consider genetic algorithms, it is possible to talk about parallel Genetic Algorithms (PGA). Considering this context, this paper focuses on DSE for heterogeneous multi-processor embedded systems and introduces an improvement that reduces execution time using parallel programming languages (i.e., OpenMP) inside the main genetic algorithm approach, while trying to lead to better partitioning solutions. The descriptions of the adopted DSE activities and the OpenMP implementation, validated by means of a case study, represent the core of the paper. © 2020 Association for Computing Machinery.

关键词： Multiobjective optimization

来源：评论

学校读者我要写书评

暂无评论

1st International workshop on distributed Computing for Emerging Smart Networks, DiCES-N 2019

1st International Workshop on Distributed Computing for Emer...

引用

1st International workshop on distributed Computing for Emerging Smart Networks, DiCES-N 2019

ISBN: (纸本)9783030401306

The proceedings contain 9 papers. The special focus in this conference is on distributed Computing for Emerging Smart Networks. The topics include: A Comparative Study of Vehicle Detection Methods in a Video Sequence;energy Efficient Handshake Algorithm for Wireless Sensor Networks;Inter-slice Mobility Management in the Context of SDN/NFV Networks;on a New Quantization Algorithm for Secondary User Scheduling in 5G Network;an Efficient Fault-Tolerant Scheduling Approach with Energy Minimization for Hard real-time Embedded systems;using Dynamic Bayesian Networks to Solve Road Traffic Congestion in the Sfax City;energy Efficient Target Coverage in Wireless Sensor Networks Using Adaptive Learning.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Toward Ultra-large Scale Neural Spike Sorting with distributed Sorting Channels and Unsupervised Training

Toward Ultra-large Scale Neural Spike Sorting with Distribut...

引用

IEEE International Symposium on Circuits and systems

作者： Junhong Sun Tianhao Li Tongtong Guo Yongfu Li Changyun Fu Yan Liu Deren of Micro-Nno Eecronics Shnghi Jio Tong Universiy ChinDeren of Micro-Nno Eecronics Shnghi Jio Tong Universiy ChinDeren of Micro-Nno Eecronics Shnghi Jio Tong Universiy ChinDeren of Micro-Nno Eecronics Shnghi Jio Tong Universiy ChinDeren of Micro-Nno Eecronics Shnghi Jio Tong Universiy ChinDeren of Micro-Nno Eecronics Shnghi Jio Tong Universiy Chin

ISBN: (数字)9781665484855

ISBN: (纸本)9781665484862

Brain machine interface systems will require recording thousands of neural channels in parallel to acquire large scale neuronal activity. High bandwidth action potential signal will overload the data communication bandwidth, and on-site spike sorting can extract essential information, however, requires extensive computational resources to achieve high classification accuracy. This demands for high resources consuming, especially in large-scale real-time sorting systems. In this work, a customized unsupervised training engine incorporated with distributed and optimized sorting channels is presented in order to reduce the hardware complexity without compromising the accuracy of spike sorting. A mixed-domain feature set is extracted in each channel, followed by feature based sorting. Each channel will constantly monitor the sorting accuracy and will request training engine intervention when in need. The proposed system is implemented in a 180 nm CMOS process, consuming only 0.33 W/channel with a clock of 25 kHz and power supply of 1.8 V, and in-channel sorting occupies 0.0023 mm~2, with training engines occupying 1.956 mm~2, which can be shared by all the channels.

关键词： Neural spike sorting unsupervised training feature extraction

来源：评论

学校读者我要写书评

暂无评论

Analysis of parallel M5P and Random Forest Regression for Visualization of Traffic Behavior 1

引用

2nd International Conference on Computational Intelligence in Pattern Recognition (CIPR)

作者： Mudali, Prateek Roopa, J. Raju, M. Govinda Yadav, Akhilesh RV Coll Engn Dept Elect & Commun Engn Bengaluru Karnataka India Aptus Data Lab Bengaluru 560103 Karnataka India

ISBN: (数字)9789811524493

ISBN: (纸本)9789811524493;9789811524486

Traffic data collection and information extraction have been a wide area of study for various objectives. One such objective is to predict the nature of traffic in a particular road region followed by its visualization. The primary objective of this paper is to analyze the traffic big data using two comparative parallel algorithms of M5P rules and random forest regression for determining the average journey time based on other parameters related to nature of traffic such as flow, time of the day. These algorithms have been implemented in a distributed computing environment in Spark clusters using Apache Mesos resource management. The secondary objective of the paper is to visualize the correlation of average journey time with the flow of traffic and plotting comparative graphs for the real and predicted values of the average journey time. Based on root-mean-square error, mean absolute error, and other performance parameters like correlation coefficient, this paper concludes that parallel algorithms fared better in terms of prediction accuracy and error rates than traditional regression methods.

关键词： Traffic big data M5P Random forest regression

来源：评论

学校读者我要写书评

暂无评论

Model-based Code Generation Framework for parallel and distributed Embedded systems

Model-based Code Generation Framework for Parallel and Distr...

引用

作者： EunJin Jeong Seoul National University

学位级别：博士

While various software development methodologies have been proposed to increase the design productivity and maintainability of software, they usually focus on the develop- ment of application software running on a single processing element, without concern about the non-functional requirements of an embedded system such as latency and re- source requirements. In this thesis, we present a model-based software development method for paral- lel and distributed embedded systems. An application is specified as a set of tasks that follow a set of given rules for communication and synchronization in a hierarchical fash- ion, independently of the hardware platform. Having such rules enables us to perform static analysis to check some software errors at compile time to reduce the verification difficulty. Platform-specific program is synthesized automatically after mapping of tasks onto processing elements is determined. The program synthesizer is also proposed to generate codes which satisfies platform requirements for parallel and distributed embedded systems. As multiple models which can express dynamic behaviors can be depicted hierarchically, the synthesizer supports to manage multiple task graphs with a different hierarchy to run tasks with parallelism. Also, the synthesizer shows methods of managing codes for heterogeneous platforms and generating various communication methods. The viability of the proposed software de- velopment method is verified with a real-life surveillance application that runs on six pro- cessing elements with three remote communication methods, and remote deep learning example is conducted to use heterogeneous multiprocessing components on distributed systems. Also, supporting a new platform and network requires a small effort by measur- ing and estimating development costs. Since tolerance to unexpected errors is a required feature of many embedded sys- tems, we also support an automatic fault-tolerant code generation. Fault tolerance can be ap

关键词：

来源：评论

学校读者我要写书评

暂无评论

气象格点数算一体空间分析库的设计与实现

引用

应用气象学报 2025年第1期36卷 121-128页

作者：王舒徐拥军何文春吴焕萍高峰刘媛媛刘北吕冠儒倪学磊国家气象信息中心北京100081 国家气候中心北京100081 湖南省气象信息中心长沙410118

气象格点数据通常以文件形式存储在分布式文件库中,业务系统在使用过程中需要将文件下载到本地,对文件解析后再进行分析计算。这种方式导致数据检索困难、响应时间长、无法满足业务在线计算及交互式应用需求。为此,2022年底国家气象信... 详细信息

气象格点数据通常以文件形式存储在分布式文件库中,业务系统在使用过程中需要将文件下载到本地,对文件解析后再进行分析计算。这种方式导致数据检索困难、响应时间长、无法满足业务在线计算及交互式应用需求。为此,2022年底国家气象信息中心基于天擎空间分析库研发完成了分布式环境下气象格点数据与计算集成的数算一体数据库——Post Grid,该数据库包含数据层和算子层。数据层将气象格点数据在要素、起报、预报、空间、层次、样本等维度上的拆分后统一规范化存储,提高数据库的数据读取和分析效率。算子层通过数据库中的SQL函数实现,支持在数据库内部对格点数据进行各种操作,且算子支持分布式并行计算。性能测试和业务应用结果表明:Post Grid数据库能将传统的聚合计算服务时效由分钟级提升至毫秒级,极大提高了气象格点数据服务的性能、灵活性和数算一体能力,具有广泛应用价值。

关键词：数算一体气象格点数据 Post Grid 并行计算分布式

来源：评论

学校读者我要写书评

暂无评论

Pattern Sampling in distributed Databases 24th

Pattern Sampling in Distributed Databases

引用

24th East-European Conference on Advances in Databases and Information systems/24th International Conference on Theory and Practice of Digital Libraries/16th workshop on Business Intelligence and Big Data (ADBIS/TPDL/EDA)

作者： Diop, Lamine Diop, Cheikh Talibouya Giacometti, Arnaud Soulet, Arnaud Univ Tours Tours France Univ Gaston Berger St Louis St Louis Senegal

ISBN: (纸本)9783030548315;9783030548322

Many applications rely on distributed databases. However, only few discovery methods exist to extract patterns without centralizing the data. In fact, this centralization is often less expensive than the communication of extracted patterns from the different nodes. To circumvent this difficulty, this paper revisits the problem of pattern mining in distributed databases by benefiting from pattern sampling. Specifically, we propose the algorithm DDSAMPLING that randomly draws a pattern from a distributed database with a probability proportional to its interest. We demonstrate the soundness of DDSAMPLING and analyze its time complexity. Finally, experiments on benchmark datasets highlight its low communication cost and its robustness. We also illustrate its interest on real-world data from the Semantic Web for detecting outlier entities in DBpedia and Wikidata.

关键词： Database systems

来源：评论

学校读者我要写书评

暂无评论

FJoin:一种基于FPGA的流连接并行加速器

引用

中国科学：信息科学 2022年第2期52卷 314-333页

作者：林力韬陈汉华金海华中科技大学计算机科学与技术学院大数据技术与系统国家地方联合工程研究中心服务计算技术与系统教育部重点实验室集群与网格计算湖北省重点实验室武汉430074

流连接广泛应用于提取多源流数据之间的关键信息,是大数据处理的重要支撑技术.但连接两条大数据流时大规模的连接谓词计算,使其易成为性能瓶颈.为提高处理性能,流连接系统常采用并行和分布式两种方式扩展.然而,采用多核并行的流连接系... 详细信息

流连接广泛应用于提取多源流数据之间的关键信息,是大数据处理的重要支撑技术.但连接两条大数据流时大规模的连接谓词计算,使其易成为性能瓶颈.为提高处理性能,流连接系统常采用并行和分布式两种方式扩展.然而,采用多核并行的流连接系统的扩展性受到CPU核数限制,无法应对大规模数据流.采用分布式扩展的流连接系统由于引入分布式框架运行的开销,导致硬件处理效率严重下降.为实现高效大规模扩展,本文提出一种利用FPGA加速器外设向上扩展的流连接系统FJoin.加速器可进行高并行的流动连接,载入多个流元组后,连接窗口中的数据流经一次即可完成所有连接计算.对于逻辑容易在FPGA实现的连接谓词,通过大量基本连接单元串联构成深度连接流水线,实现大规模并行.通过主机CPU和FPGA设备协同进行连接控制,将连续的流连接计算划分为独立的小批量任务,高效地保证并行化流连接的完整性.在装备FPGA加速卡的平台实现了FJoin,基于大规模真实数据集的测试结果表明,对比部署在40个节点集群上的目前最好的分布式流连接系统,本文提出的流连接加速器FJoin可在单一FPGA加速卡上将连接计算速度提升16倍,达到5倍的系统吞吐,且时延满足实时流处理要求.

关键词：流连接 FPGA 流处理硬件加速并行计算

来源：评论

学校读者我要写书评

暂无评论

Requirements for control strategies of grid-connected converters in the future power system

引用

IET RENEWABLE POWER GENERATION 2020年第8期14卷 1288-1295页

作者： Emanuel, Hanna Brombach, Johannes Rosso, Roberto Pierros, Konstantinos ENERCON GmbH Sales Grid Integrat Bremen Germany WRD GmbH Control Syst Aurich Germany

The increasing penetration of converter-based generation in many power systems around the world has sparked a discussion about how to operate these power systems with the usual levels of efficiency, reliability and cost-effectiveness. Current grid-following converter-based generators have proven to run stably in parallel to one another, even if there are thousands of them connected in a power system, and even in very small isolated power systems with extremely low system inertia. Discussions around the necessity of additional converter performance, usually under the 'grid-forming' and 'Virtual Synchronous Machines' concepts, have recently been transferred from the academic sphere to national and international industry fora. Formal discussions have started in Great Britain, in Germany and at ENTSO-E level. However, there is still a lot of uncertainty about the real and not simulated performance of grid-forming converters, whilst the needs case for requiring this radically different control method has not been adequately justified. With the present paper we raise key questions that will serve towards an objective discussion about power system needs, grid infeed technologies and their interaction.

关键词： distributed power generation power grids power convertors synchronous machines converter-based generation isolated power systems low system inertia grid-forming converters grid-connected converters virtual synchronous machines grid-forming concepts Great Britain Germany ENTSO-E level grid infeed technology grid-forming concept

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：