This paper identifies the high value to researchers in many disciplines of having web-based graphical editors for scientific workflows and draws attention to two technological transitions: good quality editors can now...
详细信息
ISBN:
(纸本)9781450325028
This paper identifies the high value to researchers in many disciplines of having web-based graphical editors for scientific workflows and draws attention to two technological transitions: good quality editors can now run in a browser and workflow enactment systems are emerging that manage multiple workflow languages and support multi-lingual workflows. We contend that this provides a unique opportunity to introduce multi-lingual graphical workflow editors which in turn would yield substantial benefits: workflow users would find it easier to share and combine methods encoded in multiple workflow languages, the common framework would stimulate conceptual convergence and increased workflow component sharing, and the many workflow communities could share a substantial part of the effort of delivering good quality graphical workflow editors in browsers. The paper examines whether such a common framework is feasible and presents an initial design for a web-based editor, tested with a preliminary prototype. It is not a fait accompli but rather an urgent rallying cry to explore collaboratively a generic web-based framework before investing in many divergent individual implementations.
Over the last decade, interest in parallel programming has grown tremendously. Hardware systems that contain many different levels of parallelism have become mainstream. At one end of the spectrum, computer system tha...
Over the last decade, interest in parallel programming has grown tremendously. Hardware systems that contain many different levels of parallelism have become mainstream. At one end of the spectrum, computer system that contain many processing cores, each capable of running multiple hardware threads, are becoming commonplace. It is common to find laptop and desktop systems that contain a small number of these Shared-Memory Processor (SMP) chips. Furthermore, high-end computing systems can now contain hundreds of these SMP chips, resulting in machines capable of running an incredibly large number of hardware threads simultaneously. As processor speeds begin to stagnate, software developers are being forced to exploit the parallelism that is available in these systems in order to improve the performance of their *** the other end of the spectrum, as commodity hardware prices fall it is becoming increasingly affordable to build large-scale multi-node distributed machines. A survey of the top 10 supercomputers in the world (***) shows that the systems contain an average of about 400,000 cores running at an average frequency of 2.5GHz. Since the average clock frequency of these machines is fairly low, the full potential of these systems must be exploited through efficient use of the parallelism provided by the thousands of processors they *** types of heterogeneous parallel computing system have begun to emerge. These systems contain multiple types of processors - typically a powerful CPU core and some type of Graphics Processing Unit (GPU) or hardware accelerator unit. This type of heterogeneous system provides a new set of challenges for software developers in terms of how to distribute work among the different units to maximize their utilization, based on their capabilities. Many advances in heterogeneous system design are still being realized as people continue to explore how to combine existing hardware in new and novel ways. Others sti
In linear algebra, Cholesky factorization is useful in solving a system of equations with a symmetric positive definite coefficient matrix. Cholesky factorization is roughly twice as fast relative to LU factorization ...
详细信息
ISBN:
(纸本)9781467345651;9780769549033
In linear algebra, Cholesky factorization is useful in solving a system of equations with a symmetric positive definite coefficient matrix. Cholesky factorization is roughly twice as fast relative to LU factorization which applies to general matrices. In recent years, with advances in technology, a Fermi GPU card can accommodate hundreds of cores compared to the small number of 8 or 16 cores on CPU. Therefore a trend is seen to use the graphics card as a general purpose graphics processing unit (GPGPU) for parallel computation. In this work, Volkov's hybrid implementation of Cholesky factorization is evaluated on the new Fermi GPU with others and then some improvement strategies were proposed. After experiments, compared to the CPU version using Intel Math Kernel Library (MKL), our proposed GPU improvement strategy can achieve a speedup of 3.85x on Cholesky factorization of a square matrix of dimension 10,000.
Recent technological advances are putting increased pressure on CPU scheduling. On one hand, processors have more cores. On the other hand, I/O systems have become more complex. Intensive research has been conducted o...
详细信息
ISBN:
(纸本)9780769546759
Recent technological advances are putting increased pressure on CPU scheduling. On one hand, processors have more cores. On the other hand, I/O systems have become more complex. Intensive research has been conducted on multi/many-core scheduling;however, most of the studies follow the conventional approach and focus on the utilization and load balance of the cores. In this study, we focus on increasing data locality by bringing source information from I/O into the core interrupt scheduling process. The premise is to group interrupts associated for the same I/O request together on the same core, and prove that data locality is more important than core utilization for many applications. Based on this idea, a source-aware affinity interrupt-scheduling scheme is introduced and a prototype system, SAIs, is implemented. Experiment results show that SAIs is feasible and promising;bandwidth shows a 23.57% improvement in a 3-Gigabit NIC environment and in the optimal case without the NIC bottleneck, the bandwidth improvement increases to 53.23%.
Over the past 15 years, data warehousing and OLAP technologies have matured to the point whereby they have become a cornerstone for the decision making process in organizations of all sizes. With the underlying databa...
详细信息
ISBN:
(纸本)9780769546759
Over the past 15 years, data warehousing and OLAP technologies have matured to the point whereby they have become a cornerstone for the decision making process in organizations of all sizes. With the underlying databases growing enormously in size, parallel DBM systems have become a popular target platform. Perhaps the most "obvious" approach to scalable warehousing is to combine a small collection of conventional relational DBMSs into a loosely connected parallel DBMS. Such systems, however, benefit little, if at all, from advances in OLAP indexing, storage, compression, modeling, or query optimization. In the current paper, we discuss a parallel analytics server that has been designed from the ground up as a high performance OLAP query engine. Moreover, its indexing and query processing model directly exploits an OLAP-specific algebra that enables performance optimizations beyond the reach of simple relational DBMS clusters. Taken together, the server provides class-leading query performance with the scalability of shared nothing databases and, perhaps most importantly, achieves this balance with a modest physical architecture.
The proceedings contain 40 papers. The topics discussed include: fair and efficient dead reckoning-based update dissemination for distributed virtual environments;multi-level parallelism for time- and cost-efficient p...
ISBN:
(纸本)9780769547145
The proceedings contain 40 papers. The topics discussed include: fair and efficient dead reckoning-based update dissemination for distributed virtual environments;multi-level parallelism for time- and cost-efficient parallel discrete event simulation on GPUs;dynamically adjusting core frequencies to accelerate time warp simulations in many-core processors;HLA-based parallel simulation: a case study;cloud-based simulation: the state-of-the-art computer simulation paradigm;vector time management based on topology information for HLA/RTI;a simplified belonging tree for optimizing information transmission on large-scale distributed simulations;performance analysis of a multithreaded PDEs simulator on multicore clusters;enhancement of collaborative interest management mechanism for P2P networked virtual environment;parallel simulation of large-scale artificial society on CPU/GPU mixed architecture;and SEMSim: a distributed architecture for multi-scale traffic simulation.
We restate the objective of the first workshop on Collaboration and Cloud Computing" in WETICE 2009;"to analyze current trends in Cloud Computing and identify long-term research themes and facilitate collabo...
详细信息
ISBN:
(纸本)9780769547176
We restate the objective of the first workshop on Collaboration and Cloud Computing" in WETICE 2009;"to analyze current trends in Cloud Computing and identify long-term research themes and facilitate collaboration in future research in the field that will ultimately enable global advancements in the field that are not dictated or driven by the prototypical short term profit driven motives of a particular corporate entity." We are glad to report that the discussions started in 2009 have directly resulted in an alternative approach to self-managing distributed computing systems totally different from current industry trend showing a way to eliminate the complexity of virtual machines and Hypervisors. If this approach is proven to be theoretically sound (as a paper in this conference is investigating) and extend its usefulness (demonstrated through their feasibility in the form of two proofs of concepts in the last conference) to mission critical environments, the DIME network architecture may yet prove to be an important contribution to computer science. In addition to the two papers related to the DIME computing model, there are nine other long papers and two short papers selected out of twenty two submissions discussing various aspects of clouds, grids and their management.
Developing complex technical systems requires a systematic exploration of the given design space in order to identify optimal system configurations. However, studying the effects and interactions of even a small numbe...
详细信息
The proceedings contain 119 papers. The topics discussed include: efficient computation of the Pareto boundary for the two-user single-stream MIMO interference channel;does non-orthogonal spectrum sharing in the same ...
ISBN:
(纸本)9781467309714
The proceedings contain 119 papers. The topics discussed include: efficient computation of the Pareto boundary for the two-user single-stream MIMO interference channel;does non-orthogonal spectrum sharing in the same cell improve the sum-rate of wireless operators?;feedback-efficient agile scheduling-beamforming for network MIMO OFDMA systems with realistic channel models;multi-cell coordination: a stochastic geometry approach;robust detection and optimization with decentralized parallel sensor networks;mapping optimization for the map turbo-detector with channel estimation;rate prediction and receding horizon power minimization in block-fading broadcast channels;supermodular game for energy efficient TOA-based positioning;a Chernoff convexification for chance constrained MIMO training sequence design;and flexible codebook design for limited feedback downlink systems via smooth optimization on the Grassmannian manifold.
暂无评论