检索结果-内蒙古大学图书馆

Detecting faults by tracing companion states in cloud computing systems

Jisuanji Xuebao/Chinese Journal of Computers 2012年第5期35卷 856-870页

作者： Rao, Xiang Wang, Huai-Min Chen, Zhen-Bang Zhou, Yang-Fan Cai, Hua Zhou, Qi Sun, Ting-Tao National Key Laboratory for Parallel and Distributed Processing National University of Defense Technology Changsha 410073 China Shenzhen Research Institute The Chinese University of Hong Kong Shenzhen China Department of Computing Platform Alibaba Cloud Computing Company Hangzhou 310011 China

A common way to construct a fault model is injecting the fault into the system and observing the subsequent symptoms, e. g. event logs. However, fault features would vary during the propagation period, and present different symptoms at different stage of the fault propagation process. The exiting detection window based feature extraction methods can only identify the early symptoms of a fault, but fail to detect the latter symptoms and cause false alarms. To solve the problem, we present a fault feature extraction method, called Companion State Tracer (CSTracer), which consists of 3 integrated steps: (1) pre-process logs to remove the unrelated logs;(2) construct a general identifier for the early symptoms of a fault;(3) construct a finite state machine model for the fault to trace the latter symptoms. CSTracer can persistently monitor a fault after the fault has been identified. We have justified the effectiveness of CSTracer in an enterprise cloud system. Compared with the existing, the results show that CSTracer has a better detection accuracy.

关键词： Fault detection

来源：评论

学校读者我要写书评

暂无评论

Short fragment sequence alignment on the HP-SEE infrastructure

Short fragment sequence alignment on the HP-SEE infrastructu...

引用

Proceedings of the International Convention MIPRO

作者： Miklos Kozlovszky Gergely Windisch Ákos Balaskó Laboratory of Parallel and Distributed Computing MTA SZTAKI Budapest Hungary John von Neumann Faculty of Informatics Óbuda University Budapest Hungary

The recently used deep sequencing techniques represent a new data processing challenge: mapping short fragment reads to open-access eukaryotic genomes at the scale of several hundred thousand. This problem is solvable by BLAST, BWA and similar sequence alignment tools. BLAST is one of the most frequently used tool in bioinformatics and BWA is a relative new fast light-weighted tool that aligns effectively short sequences. Local installations of these algorithms are typically not able to handle large problem size therefore the sequence alignment process runs slowly, while web based implementations cannot accept high number of queries. HP-SEE infrastructure allows accessing massively parallel supercomputing infrastructure. With gUSE/WS-PGRADE we have created successfully an online Bioinformatics eScience Gateway, which is capable to serve the short fragment sequence alignment demand of the regional bioinformatics communities within the SEE region. Using workflows we have ported algorithms (BLAST and BWA) to the massively parallel HP-SEE infrastructure. In this paper we describe the created Bioinformatics eScience Gateway, and show as case study how we have implemented the ported BLAST workflow using parameter study. With our online service, researchers can do high throughput sequence alignments against the eukaryotic genomes to search for regulatory mechanisms controlled by short fragments on HP-SEE's supercomputing infrastructure.

关键词： Bioinformatics Logic gates Europe Communities Portals Graphical user interfaces Educational institutions

来源：评论

学校读者我要写书评

暂无评论

Campaign scheduling

Campaign scheduling

引用

International Conference on High Performance computing

作者： Vinicius Pinheiro Krzysztof Rzadca Denis Trystram Laboratory for Parallel and Distributed Computing University of São Paulo Brazil Institute of Informatics University of Warsaw Poland Institut Universitaire de France France

We study the problem of scheduling in parallel systems with many users. We analyze scenarios with many submissions issued over time by several users. These submissions contain one or more jobs; the set of submissions are organized in successive campaigns. Jobs belonging to a single campaign are sequential and independent, but any job from a campaign cannot start until all the jobs from the previous campaign are completed. Each user's goal is to minimize the sum of flow times of his campaigns. We define a theoretical model for Campaign scheduling and show that, in the general case, it is NP-hard. For the single-user case, we show that an ρ-approximation scheduling algorithm for the (classic) parallel job scheduling problem is also an ρ-approximation for the Campaign scheduling problem. For the general case with k users, we establish a fairness criterion inspired by time sharing. We propose FAIRCAMP, a scheduling algorithm which uses campaign deadlines to achieve fairness among users between consecutive campaigns. We prove that FAIRCAMP increases the flow time of each user by a factor of at most k ρ compared with a machine dedicated to the user. We also prove that FAIRCAMP is a ρ-approximation algorithm for the maximum stretch. By simulation, we compare FAIRCAMP to the First-Come-First-Served (FCFS). We show that, compared with FCFS, FAIRCAMP reduces the maximum stretch by up to 3.4 times. The difference is significant in systems used by many (k > 5) users. Our results show that, rather than just individual, independent jobs, campaigns of jobs can be handled by the scheduler efficiently and fairly.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Evaluation and comparison of cell nuclei detection algorithms

Evaluation and comparison of cell nuclei detection algorithm...

引用

IEEE International Conference on Intelligent Engineering Systems (INES)

作者： Sándor Szénási Zoltán Vámossy Miklós Kozlovszky Doctoral School of Applied Informatics Óbuda University Budapest Hungary John von Neumann Faculty of Informatics Óbuda University Budapest Hungary Laboratory of Parallel and Distributed Computing MTA SZTAKI Budapest Hungary

The processing of microscopic tissue images and especially the detection of cell nuclei is nowadays done more and more using digital imagery and special immunodiagnostic software products. Since several methods (and applications) were developed for the same purpose, it is important to have a measuring number to determine which one is more efficient than the others. The purpose of the article is to develop a generally usable measurement number that is based on the “gold standard” tests used in the field of medicine and that can be used to perform an evaluation using any of image segmentation algorithms. Since interpreting the results themselves can be a pretty time consuming task, the article also contains a recommendation for the efficient implementation and a simple example to compare three algorithms used for cell nuclei detection.

关键词： Accuracy Algorithm design and analysis Image segmentation Time measurement Gold Standards Classification algorithms

来源：评论

学校读者我要写书评

暂无评论

Preparing initial population of genetic algorithm for region growing parameter optimization

Preparing initial population of genetic algorithm for region...

引用

International Symposium on Logistics and Industrial Informatics, LINDI

The processing of microscopic tissue images is nowadays done more and more using special immunodiagnostic-evaluation software products. Often to evaluate the samples, the first step is determining the number and location of cell nuclei. To do this, one of the most promising methods is the region growing, but this algorithm is very sensitive to the appropriate setting of different parameters. Due to the large number of parameters and due to the big set of possible values setting those parameters manually is a quite hard task, so we developed a genetic algorithm to optimize these values. The first step of the development is the statistical analysis of the parameters, and the determination of the important features, to extract valuable information for a to-be-implemented genetic algorithm that will perform the optimization.

关键词： Optimization Genetic algorithms Gold Standards Medical services Shape Databases

来源：评论

学校读者我要写书评

暂无评论

A science gateway getting ready for serving the international molecular simulation community 2

A science gateway getting ready for serving the internationa...

引用

2012 EGI Community Forum / EMI 2nd Technical Conference, EGICF-EMITC 2012

作者： Gesing, Sandra Herres-Pawlis, Sonja Birkenheuer, Georg Brinkmann, André Grunzke, Richard Kacsuk, Peter Kohlbacher, Oliver Kozlovszky, Miklos Krüger, Jens Müller-Pfefferkorn, Ralph Schäfer, Patrick Steinke, Thomas Center for Bioinformatics Department of Computer Science University of Tübingen Sand 14 Tübingen72076 Germany Department of Chemistry Ludwig-Maximilians-University Munich Butenandtstr. 5-13 München81377 Germany Paderborn Center for Parallel Computing University of Paderborn Warburger Str. 100 Paderborn33089 Germany Johannes Gutenberg-University Mainz Mainz55099 Germany Center for Information Services and High Performance Computing Technische Universität Dresden Zellescher Weg 12-14 Germany Laboratory of Parallel and Distributed Systems MTA SZTAKI Kende Street 13-17 Budapest1111 Hungary Zuse Institute Berlin Takustraße 7 Berlin14195 Germany

The project MoSGrid (Molecular Simulation Grid) has been developing a web-based science gateway supporting the community with various services for quantum chemistry, molecular modeling, and docking. Users gain access to distributed computing infrastructures (DCIs) via intuitive user interfaces for sophisticated tools, specialized workflows, and distributed repositories. Currently, the MoSGrid community consists of about 120 users from a number of fields related to chemistry and bioinformatics located in Germany. However, the underlying security infrastructure is generally applicable and can be deployed in arbitrary projects. MoSGrid intends to address the international community by participating in the EU-projects SCI-BUS (Scientific gateway Based User Support) and ER-flow (Building an European Research Community through Interoperable Workflows and Data), and collaborating with the EU-project EDGI (European Desktop Grid Initiative). © Copyright owned by the author(s) under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike Licence.

关键词： Quantum chemistry

来源：评论

学校读者我要写书评

暂无评论

Performance problems online detection in cloud computing systems via analyzing request execution paths

Performance problems online detection in cloud computing sys...

引用

Workshop on Proactive Failure Avoidance, Recovery, and Maintenance

作者： Mi, Haibo Wang, Huaimin Yin, Gang Cai, Hua Zhou, Qi Sun, Tingtao National Laboratory for Parallel and Distributed Processing National University of Defense Technology Changsha China Computing Platform Alibaba Cloud Computing Company Hangzhou China

ISBN: (纸本)9781457703751

It is quite a headache for developers to online detect performance problems in large-scale cloud computing systems. The behavior and the hidden connections among the huge amount of runtime request execution paths in cloud computing systems usually contain useful information for performance problem detection. In this paper, we propose an approach to rapidly diagnose the source of performance degradation in large-scale non-stop cloud computing systems. The approach first groups the user requests into categories with a fast clustering algorithm;then applies the principal components analysis to extract the primary methods;finally compares the normal and abnormal behaviors of the primary methods to localize the main cause of performance problems. We conduct extensive experiments over a real-world enterprise system providing services for the public. The results show that our approach can locate the prime causes of performance problems accurately and efficiently.1 © 2011 IEEE.

关键词： Cloud computing

来源：评论

学校读者我要写书评

暂无评论

Identifying faults in large-scale distributed systems by filtering noisy error logs

Identifying faults in large-scale distributed systems by fil...

引用

Workshop on Proactive Failure Avoidance, Recovery, and Maintenance

作者： Rao, Xiang Wang, Huaimin Shi, Dianxi Chen, Zhenbang Cai, Hua Zhou, Qi Sun, Tingtao National Laboratory for Parallel and Distributed Processing National University of Defense Technology Changsha 410073 China Computing Platform Alibaba Cloud Computing Corporation Hangzhou China

ISBN: (纸本)9781457703751

Extracting fault features with the error logs of fault injection tests has been widely studied in the area of large scale distributed systems for decades. However, the process of extracting features is severely affected by a large amount of noisy logs. While the existing work tries to solve the problem by compressing logs in temporal and spatial views or removing the semantic redundancy between logs, they fail to consider the co-existence of other noisy faults that generate error logs instead of injected faults, for example, random hardware faults, unexpected bugs of softwares, system configuration faults or the error rank of a log severity. During a fault feature extraction process, those noisy faults generate error logs that are not related to a target fault, and will strongly mislead the resulted fault features. We call an error log that is not related to a target fault a noisy error log. To filter out noisy error logs, we present a similarity-based error log filtering method SBF, which consists of three integrated steps: (1) model error logs into time series and use haar wavelet transform to get the approximate time series;(2) divide the approximate time series into sub time series by valleys;(3) identify noisy error logs by comparing the similarity between the sub time series of target error logs and the template of noisy error logs. We apply our log filtering method in an enterprise cloud system and show its effectiveness. Compared with the existing work, we successfully filter out noisy error logs and increase the precision and the recall rate of fault feature extraction.1 © 2011 IEEE.

关键词： Errors

来源：评论

学校读者我要写书评

暂无评论

Network and service management and diagnostics solution of a remote patient monitoring system

Network and service management and diagnostics solution of a...

引用

3rd IEEE International Symposium on Logistics and Industrial Informatics, LINDI 2011

作者： Kozlovszky, Miklos Meixner, Zsolt Windisch, Gergely Márton, Judit Ács, Sándor Bogdanov, Pál Boruzs, Anikó Kotcauer, Péter Ferenczi, János Kozlovszky, Viktor MTA SZTAKI Laboratory of Parallel and Distributed Computing Budapest Hungary Óbuda University John von Neumann Faculty of Informatics Budapest Hungary

ISBN: (纸本)9781457718410

We have developed a combined network and service management and diagnostics solution for our in-house developed remote patient monitoring system. The developed system has included into the ALPHA eHealth/remote patient monitoring system and was successfully used in our large Living Lab infrastructure operating in three different Hungarian regions with 40 patients. In this paper we will identify the key elements of the combined Network and Service Management solution of the remote patient monitoring system. © 2011 IEEE.

关键词： Telemedicine

来源：评论

学校读者我要写书评

暂无评论

GPGPU-based data parallel region growing algorithm for cell nuclei detection

GPGPU-based data parallel region growing algorithm for cell ...

引用

12th IEEE International Symposium on Computational Intelligence and Informatics, CINTI 2011

作者： Szénási, Sándor Vámossy, Zoltán Kozlovszky, Miklós Óbuda University John Von Neumann Faculty of Informatics Budapest Hungary MTA SZTAKI Laboratory of Parallel and Distributed Computing Budapest Hungary

ISBN: (纸本)9781457700453

Nowadays microscopic analysis of tissue samples is done more and more by using digital imagery and special immunodiagnostic software. These are typically specific applications developed for one distinct field, but some subroutines are commonly repeated, for example several applications contain steps that can detect cell nuclei in a sample image. The aim of our research is developing a new data parallel algorithm that can be implemented even in a GPGPU environment and that is capable of counting hematoxylin eosin (HE) stained cell nuclei and of identifying their exact locations and sizes (using a variation of the region growing method). Our presentation contains the detailed description of the algorithm, the peculiarity of the CUDA implementation, and the evaluation of the created application (regarding its accuracy and the decrease in the execution time). © 2011 IEEE.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：