Electronic sports or pro gaming have become very popular in this millenium and the increased value of this new industry is attracting investors with various interests. One of these interest is game betting, which requ...
详细信息
Astrophysical databases have used proprietary formats (especially the FITS format) to represent measured data and related metadata. The design of the FITS format was influenced by punch cards, thus it is extremely ina...
详细信息
During the tools have been also influenced by this evolutionary fashion. In this paper, we give a broader view of the application of techniques inspired by nature to hardware design and parallel architectures problem ...
详细信息
In spite of the wealth Of existing data distribution methods, most parallel programming languages support only some form of cyclic blockwise distribution. The main reason why only this single method is supported is th...
详细信息
In spite of the wealth Of existing data distribution methods, most parallel programming languages support only some form of cyclic blockwise distribution. The main reason why only this single method is supported is that it is relatively simple to implement. However, it is as yet nuclear whether cyclic blockwise distribution is sufficiently powerful for a wide class of distribution problems. In this paper the method will be analysed, showing that for a wide range of problems it is indeed sufficient. It will also be shown in which cases cyclic blockwise distribution can be effected to fail. From this analysis, it is possible to formulate practical guidelines to assist Programmers in choosing the cycle frequency for cyclic blockwise distribution that leads to an optimal result.
We consider the problem of optimizing the execution of dataintensive scientific workflows in the Cloud. We address the problem under the following scenario. The tasks of the workflows communicate through files;the out...
详细信息
ISBN:
(纸本)9781450307048
We consider the problem of optimizing the execution of dataintensive scientific workflows in the Cloud. We address the problem under the following scenario. The tasks of the workflows communicate through files;the output of a task is used by another task as an input file and if these tasks are assigned on different execution sites, a file transfer is necessary. The output files are to be stored at a site. Each execution site is to be assigned a certain percentage of the files and tasks. These percentages, called target weights, are pre-determined and reflect either user preferences or the storage capacity and computing power of the sites. The aim is to place the data files into and assign the tasks to the execution sites so as to reduce the cost associated with the file transfers, while complying with the target weights. To do this, we model the workflow as a hypergraph and with a hypergraph-partitioning-based formulation, we propose a heuristic which generates data placement and task assignment schemes simultaneously. We report simulation results on a number of real-life and synthetically generated scientific workflows. Our results show that the proposed heuristic is fast, and can find mappings and assignments which reduce file transfers, while respecting the target weights. Copyright 2011 ACM.
A vast majority of nuclear and particle physicists in the world are currently using the ROOT framework (developed in CERN) as a software platform for simulations and data evaluations. Some of the simulations and exper...
详细信息
External sorting methods which are designed to order large amounts of data stored in persistent memory are well known for decades. These methods were originally designed for systems with small amount of operating (int...
详细信息
External sorting methods which are designed to order large amounts of data stored in persistent memory are well known for decades. These methods were originally designed for systems with small amount of operating (internal) memory and magnetic tapes used as external mem-ory. Data on magnetic tapes has to be accessed in strictly serial man-ner and this limitation shaped the external sorting algorithms. In time, magnetic tapes were replaced with hard drives which are now being re-placed with solid state drives. Furthermore, the amount of the operating memory in mainstream servers have increased by orders of magnitude and the future may hold even more impressive innovations such as non-volatile memories. As a result, most of the assumptions of the external sorting algorithms are not valid any more and these methods needs to be innovated to better reect the hardware of the day. In this work, we critically evaluate original assumptions in empirical manner and propose possible improvements.
Current web technologies have been leaping forward, especially since the introduction of HTML5. The web browsers of the day implement various APIs for the client-side scripts, such as elaborate data storage or advance...
详细信息
ISBN:
(纸本)9788001054826
Current web technologies have been leaping forward, especially since the introduction of HTML5. The web browsers of the day implement various APIs for the client-side scripts, such as elaborate data storage or advanced network connectivity. We propose, how to combine these technologies to create distributed data storage using the web environment. We have implemented a prototype framework as a proof of concept and explored the most problematic issues which require to be researched further. Systems that would use this storage may benefit from the fact that the users do not need to install or configure any type of client application, since the only thing required is the web browser. Web-based distributed data storage can be used as an extension of server storage, for data caching, and in various other applications.
In the past decade, automated astronomical observatories collected huge amounts of data which can no longer be explored by astronomers individually. In our case, we deal with optical spectra produced by multi-object l...
详细信息
Distributed computing and cloud phenomenon have become an intensively studied topic in the past decade. These technologies have been enveloped with attractive business models, where the customer pays only for the reso...
详细信息
ISBN:
(纸本)9781515120650
Distributed computing and cloud phenomenon have become an intensively studied topic in the past decade. These technologies have been enveloped with attractive business models, where the customer pays only for the resources or services which have been actually utilized. Even though this popularity lead to rapid development of distributed algorithms, virtualization platforms, and various cloud services, many issues are still waiting to be solved. One of these issues is the question of power efficiency. In this paper, we investigate possibilities of applying single-board computers as platform for distributed systems and cloud computing. These small devices (such as Raspberry Pi) are quite power efficient and relatively cheap, so they may reduce the overall cost for cloud services. Furthermore, they may be employed to create small clusters that could replace traditional enterprise servers and achieve lower cost and better robustness for some tasks.
暂无评论