the research of a parallelization efficiency of a batch pattern training algorithm of a multilayer perceptron on computational clusters is presented in this paper. the multilayer perceptron model and the usual sequent...
详细信息
this paper presents an efficient approach to parallel pricing of multi-dimensional financial derivatives based on the Black-Scholes Partial Differential Equation (BS-PDE). One of the main challenges for such multi-dim...
详细信息
this paper deals withthe electromagnetic modeling of large and complex electrical structures by means of large scale parallel systems, such as Grid computing and supercomputer. Transmission-Line Matrix modeling metho...
详细信息
An application of extremal optimization algorithm for mapping Java program components on clusters of Java Virtual Machines (JVMs) is presented. Java programs are represented as Directed Acyclic Graphs in which tasks c...
详细信息
ISBN:
(纸本)9783642281501;9783642281518
An application of extremal optimization algorithm for mapping Java program components on clusters of Java Virtual Machines (JVMs) is presented. Java programs are represented as Directed Acyclic Graphs in which tasks correspond to methods of distributed active Java objects that communicate using the RMI mechanism. the presented probabilistic extremal optimization approach is based on the local fitness function composed of two sub-functions in which elimination of delays of task execution after reception of required data and the imbalance of tasks execution in processors are used as heuristics for improvements of extremal optimization solutions. the evolution of an extremal optimization solution is governed by task clustering supported by identification of the dominant path in the graph. the applied task mapping is based on dynamic measurements of current loads of JVMs and inter-JVM communication link bandwidth. the JVM loads are approximated by observation of the average idle time that threads report to the OS. the current link bandwidth is determined by observation of the performed average number of RMI calls per second.
distributed Virtual Environments (DVEs), like Massively Multiplayer Online Games(MMOG) are attracting millions of users from all over the world. However, as the number of simultaneous users keeps growing, the current ...
详细信息
the QR algorithm computes the Schur form of a matrix and is by far the most popular approach for solving dense nonsymmetric eigenvalue problems. Multishift and aggressive early deflation (AED) techniques have led to s...
详细信息
ISBN:
(纸本)9783642281501;9783642281518
the QR algorithm computes the Schur form of a matrix and is by far the most popular approach for solving dense nonsymmetric eigenvalue problems. Multishift and aggressive early deflation (AED) techniques have led to significantly more efficient sequential implementations of the QR algorithm during the last decade. More recently, these techniques have been incorporated in a novel parallel QR algorithm on hybrid distributed memory HPC systems. While leading to significant performance improvements, it has turned out that AED may become a computational bottleneck as the number of processors increases. In this paper, we discuss a two-level approach for performing AED in a parallel environment, where the lower level consists of a novel combination of AED withthe pipelined QR algorithm implemented in the ScaLAPACK routine PDLAHQR. Numerical experiments demonstrate that this new implementation further improves the performance of the parallel QR algorithm.
New applications are motivating and informing the design of sensor/actuator networks, and, more broadly, research in cyber-physical systems (CPS). Our knowledge of many physical systems is uncertain, so that sensing a...
详细信息
ISBN:
(纸本)9780769548654;9781467351461
New applications are motivating and informing the design of sensor/actuator networks, and, more broadly, research in cyber-physical systems (CPS). Our knowledge of many physical systems is uncertain, so that sensing and actuation must be mediated by inference of the structure and parameters of physical-system models. One CPS application domain of growing interest is ecological systems, motivated by the need to understand plant survival and growth as a function of genetics, environment, and climate change. For this effort to be successful, we must be able to infer coupled, data-driven predictive models of plant growth dynamics in response to climate drivers that allow incorporation of uncertainty. We are developing an architecture and implementation for precise fine-scale control of irrigation in an array of geographically-distributed outdoor gardens on an elevational gradient of over 1500 m, allowing design of experiments that combine control of temperature and water availability. this paper describes a system architecture and implementation for this class of cyber-eco systems, including sensor/actuator node design, site-level networking, data assimilation, inference, and distributed control. Among its innovations are a modular, parallel-processing node hardware design allowing real-time processing and heterogeneous nodes, energy-aware hardware/software design, and a networking protocol that builds in trade-offs between energy conservation and latency. throughout, we emphasize the changes in system architecture required as missions evolve from sensing-only to sensing, inference, and control. We also describe our developmental implementation of the architecture and its planned deployment. Future extensions will likely add negative control of precipitation using active rain-out shelters and additional plant-level control of air or soil temperature.
the advent of Cloud computing has given to researchers the ability to access resources that satisfy their growing needs, which could not be satisfied by traditional computing resources such as PCs and locally managed ...
详细信息
ISBN:
(纸本)9783642328206
the advent of Cloud computing has given to researchers the ability to access resources that satisfy their growing needs, which could not be satisfied by traditional computing resources such as PCs and locally managed clusters. On the other side, such ability, has opened new challenges for the execution of their computational work and the managing of massive amounts of data into resources provided by different private and public infrastructures. COMP Superscalar (COMPSs) is a programming framework that provides a programming model and a runtime that ease the development of applications for distributed environments and their execution on a wide range of computational infrastructures. COMPSs has been recently extended in order to be interoperable with several cloud technologies like Amazon, OpenNebula, Emotive and other OCCI compliant offerings. this paper presents the extensions of this interoperability layer to support the execution of COMPSs applications into the Windows Azure Platform. the framework has been evaluated through the porting of a data mining workflow to COMPSs and the execution on an hybrid testbed.
Petascale plasma physics simulations have recently entered the regime of simulating trillions of particles. these unprecedented simulations generate massive amounts of data, posing significant challenges in storage, a...
详细信息
ISBN:
(纸本)9781467308052;9781467308045
Petascale plasma physics simulations have recently entered the regime of simulating trillions of particles. these unprecedented simulations generate massive amounts of data, posing significant challenges in storage, analysis, and visualization. In this paper, we present parallel I/O, analysis, and visualization results from a VPIC trillion particle simulation running on 120,000 cores, which produces similar to 30TB of data for a single timestep. We demonstrate the successful application of H5Part, a particle data extension of parallel HDF5, for writing the dataset at a significant fraction of system peak I/O rates. To enable efficient analysis, we develop hybrid parallel FastQuery to index and query data using multi-core CPUs on distributed memory hardware. We show good scalability results for the FastQuery implementation using up to 10,000 cores. Finally, we apply this indexing/query-driven approach to facilitate the first-ever analysis and visualization of the trillion particle dataset.
暂无评论