The understanding of biological processes, e.g. related to cardio-vascular disease and treatment, can significantly be improved by numerical simulation. In this paper, we present an approach for a multiscale simulatio...
详细信息
The understanding of biological processes, e.g. related to cardio-vascular disease and treatment, can significantly be improved by numerical simulation. In this paper, we present an approach for a multiscale simulation environment, applied for the prediction of in-stent re-stenos is. Our focus is on the coupling of distributed, heterogeneous hardware to take into account the different requirements of the coupled sub-systems concerning computing power. For such a concept, which is an extension of the standard multiscale computing approach, we want to apply the term Distributed Multiscale computing.
We prove that the error estimates of a large class of nonconforming finite elements are dominated by their approximation errors, which means that the well-known Cea’s lemma is still valid for these nonconforming fini...
详细信息
We prove that the error estimates of a large class of nonconforming finite elements are dominated by their approximation errors, which means that the well-known Cea’s lemma is still valid for these nonconforming finite element methods. Furthermore, we derive the error estimates in both energy and L2 norms under the regularity assumption u ∈ H1+s(Ω) with any s > 0. The extensions to other related problems are possible.
The two-sided rank-one (TR1) update method was introduced by Griewank and Walther (2002) for solving nonlinear equations. It generates dense approximations of the Jacobian and thus is not applicable to large-scale spa...
详细信息
The two-sided rank-one (TR1) update method was introduced by Griewank and Walther (2002) for solving nonlinear equations. It generates dense approximations of the Jacobian and thus is not applicable to large-scale sparse problems. To overcome this difficulty, we propose sparse extensions of the TR1 update and give some convergence analysis. The numerical experiments show that some of our extensions are superior to the TR1 update method. Some convergence analysis is also presented.
In this paper, we investigate new active-settype methods for l 1-regularized linear regression that overcome some difficulties of existing active set methods. By showing a relationship between l 1-regularized linear r...
详细信息
In this paper, we investigate new active-settype methods for l 1-regularized linear regression that overcome some difficulties of existing active set methods. By showing a relationship between l 1-regularized linear regression and the linear complementarity problem with bounds, we present a fast active-set-type method, called block principal pivoting. This method accelerates computation by allowing exchanges of several variables among working sets. We further provide an improvement of this method, discuss its properties, and also explain a connection to the structure learning of Gaussian graphical models. Experimental comparisons on synthetic and real data sets show that the proposed method is significantly faster than existing active set methods and competitive against recently developed iterative methods.
Massively parallel computations consist of a mixture of computation, communication, and I/O. Of these three components, implementing an effective parallel I/O solution has often been overlooked by application scientis...
详细信息
ISBN:
(纸本)9780769545981
Massively parallel computations consist of a mixture of computation, communication, and I/O. Of these three components, implementing an effective parallel I/O solution has often been overlooked by application scientists and has typically been added to large scale simulations only when existing serial techniques have failed. As scientists' teams scaled their codes to run on hundreds of processors, it was common to call on an I/O expert to implement a set of more scalable I/O routines. These routines were easily separated from the calculations and communication, and in many cases, an I/O kernel was derived from the application which could be used for testing I/O performance independent of the application. These I/O kernels developed a life of their own used as a broad measure for comparing different I/O techniques. Unfortunately, as years passed and computation and communication changes required changes to the I/O, the separate I/O kernel used for benchmarking remained static, no longer providing an accurate indicator of the I/O performance of the simulation, and making I/O research less relevant for the application scientists. In this paper we describe a new approach to this problem where I/O kernels are replaced with skeletal I/O applications that are automatically generated from an abstract set of simulation I/O parameters. We realize this abstraction by leveraging the ADIOS [1] middleware's XML I/O specification with additional runtime parameters. Skeletal applications offer all of the benefits of I/O kernels including allowing I/O optimizations to focus on useful I/O patterns. Moreover, since they are automatically generated, it is easy to produce an updated I/O skeleton whenever the simulation's I/O changes. In this paper we analyze the performance of automatically generated I/O skeletal applications for the S3D and GTS codes. We show that these skeletal applications achieve performance comparable to that of the production applications. We wrap up the paper with
The multi-symplectic Runge-Kutta (MSRK) methods and multi-symplecticFourier spectral (MSFS) methods will be employed to solve the fourth-orderSchrodinger equations with trapped term. Using the idea of split-step numer...
详细信息
The multi-symplectic Runge-Kutta (MSRK) methods and multi-symplecticFourier spectral (MSFS) methods will be employed to solve the fourth-orderSchrodinger equations with trapped term. Using the idea of split-step numericalmethod and the MSRK methods, we devise a new kind of multi-symplectic integrators, which is called split-step multi-symplectic (SSMS) methods. The numerical experiments show that the proposed SSMS methods are more efficient than the conventionalmulti-symplectic integrators with respect to the the numerical accuracy and conservation perserving properties.
CHIMERA is a multi-dimensional radiation hydrodynamics code designed to study core-collapse supernovae. The code is made up of three essentially independent parts: a hydrodynamics module, a nuclear burning module, and...
详细信息
CHIMERA is a multi-dimensional radiation hydrodynamics code designed to study core-collapse supernovae. The code is made up of three essentially independent parts: a hydrodynamics module, a nuclear burning module, and a neutrino transport solver combined within an operator-split approach. Given CHIMERA's complexity and pace of ongoing development, a new support system, Bellerophon, has been designed and implemented to perform automated verification, visualization and management tasks while integrating with other workflow systems utilized by CHIMERA's development group. In order to achieve these goals, a multitier approach has been adopted. By integrating supercomputing platforms, visualization clusters, a dedicated web server and a client-side desktop application, this system attempts to provide an encapsulated, end-to-end solution to these needs.
We present an interactive visual analytics system for classification, iVisClassifier, based on a supervised dimension reduction method, linear discriminant analysis (LDA). Given high-dimensional data and associated cl...
详细信息
Fast facial points fitting plays an important role in applications such as Human-Computer Interaction, entertainment, surveillance, and is highly relevant to the techniques of facial expression analysis, face recognit...
详细信息
When large sparse symmetric systems of linear equations are solved by the Cholesky factorization, nonzero elements can be generated at positions where the original matrix contains zero elements. This phenomenon is cal...
详细信息
When large sparse symmetric systems of linear equations are solved by the Cholesky factorization, nonzero elements can be generated at positions where the original matrix contains zero elements. This phenomenon is called fill-in and it is often crucial in large-scale problems. The symbolic Cholesky factorization solely takes into account the nonzero structure of a sparse matrix to determine the nonzero structure of its Cholesky factor. Sequences of elimination graphs are typically used to model this combinatorial problem. We propose an interactive educational module to visualize and explore the symbolic Cholesky factorization in terms of both elimination graphs and matrix representation. We describe the design and implementation of this interactive module that is intended to be used in a face-to-face learning environment.
暂无评论