In recent years, high performance computing underwent a deep transformation. In this paper, we review the state of parallel computation with detailed discussion of the current and future research issues in the area of...
详细信息
In recent years, high performance computing underwent a deep transformation. In this paper, we review the state of parallel computation with detailed discussion of the current and future research issues in the area of parallel architectures and compilation methods, instruction level parallelism and optimization methods to improve the performance of the memory hierarchy.
Myocardial ischemia is usually identified by echocardiography using qualitative analysis of regional left ventricular (LV) endocardial excursion. This is often hampered by sub-optimal endocardial visualization and lac...
详细信息
Myocardial ischemia is usually identified by echocardiography using qualitative analysis of regional left ventricular (LV) endocardial excursion. This is often hampered by sub-optimal endocardial visualization and lack of quantitative standards and measurement techniques Ischemia changes the temporal pattern of LV contraction, and a method for quantifying changes in this temporal sequence would have clinical applicability. The authors developed and tested a computer-assisted technique using Synthetic M-mode (SMM) to measure the temporal variations in regional LV contraction. Using a LV short axis image, 6 SMM images were created through a user-defined LV center. The time to peak (t) contraction in 6 LV segments was then measured using the R-wave as a timing reference point. In the authors' study of 15 closed chest dogs with induced regional ischemia, t was significantly prolonged only in those LV segments subtended by the occluded artery. Temporal SMM is a promising technique for defining temporal contraction patterns indicative of regional ischemia.
Deeply pipelined, superscalar processors require accurate branch prediction to achieve high performance. Two-level branch predictors have been shown to achieve high prediction accuracy. It has also been shown that bra...
详细信息
ISBN:
(纸本)9780897919012
Deeply pipelined, superscalar processors require accurate branch prediction to achieve high performance. Two-level branch predictors have been shown to achieve high prediction accuracy. It has also been shown that branch interference is a major contributor to the number of branches mispredicted by two-level *** paper presents a new method to reduce the interference problem called agree prediction, which reduces the chance that two branches aliasing the same PHT entry will interfere negatively. We evaluate the performance of this scheme using full traces (both user and supervisor) of the SPECint95 benchmarks. The result is a reduction in the misprediction rate of gcc ranging from 8.62% with a 64K-entry PHT up to 33.3% with a 1K-entry PHT.
An underlying assumption for N-version programming technique is that independently developed versions would fail in a statistically independent manner However empirical studies have demonstrated that common mode failu...
详细信息
ISBN:
(纸本)081868271X
An underlying assumption for N-version programming technique is that independently developed versions would fail in a statistically independent manner However empirical studies have demonstrated that common mode failures can occur even for independently developed versions, and that common mode failures degrade system reliability. In this paper, we demonstrate that the weakest precondition analysis is effective in determining input spaces leading to common mode failures. We applied the weakest precondition to the Launch Interceptor Programs which were used in several other experiments related to the N-version programming technique. We detected 13 out of 18 fault pairs which have been known to cause common mode failure. These faults were due to logical flaws in program design. Although the weakest precondition analysis may be labor-intensive since they are applied manually our results convincingly demonstrate that it is effective for identifying input spaces causing common mode failures and further improving the reliability of N-version software.
This paper describes several loop splitting methods for exploiting parallelism from single loops, and also proposes a generalized and optimal loop transformation technique for exploiting parallelism from single loops ...
详细信息
This paper describes several loop splitting methods for exploiting parallelism from single loops, and also proposes a generalized and optimal loop transformation technique for exploiting parallelism from single loops with nonuniform dependencies. The proposed algorithm is based on partitioning a serial loop by using the size of dependence distance such that it varies between different instances of the dependence. It outperforms the two methods proposed by C.D. Polychronopoulos (see Compiler optimizations for enhancing parallelism and their impact on architecture design, IEEE Trans. Comput., vol.37, no.8, p. 991-1004, 1988).
We propose the stochastic lexicon model which represents the pronunciation variations to optimally cope with the continuous speech recognizer. In this lexicon model, the baseform of words are represented by subword st...
详细信息
ISBN:
(纸本)0818679190
We propose the stochastic lexicon model which represents the pronunciation variations to optimally cope with the continuous speech recognizer. In this lexicon model, the baseform of words are represented by subword states and the probability distribution of subwords as a hidden Markov model. Also, the proposed approach can be applied to a system employing non-linguistic recognition units and the lexicon is automatically trained from training utterances. In speaker independent speech recognition tests using a 3000 word continuous speech database, the proposed system improves the word accuracy by about 27.8% and the sentence accuracy by about 22.4%.
A nonvolatile memory called FeRAM (Ferroelectronic Random Access Memory) is one of the promising memory devices. This paper presents a dual plane FeRAM (DFeRAM) architecture for recovery in the shared disk parallel da...
详细信息
A nonvolatile memory called FeRAM (Ferroelectronic Random Access Memory) is one of the promising memory devices. This paper presents a dual plane FeRAM (DFeRAM) architecture for recovery in the shared disk parallel database system. This paper also presents a recovery algorithm based on a shadow paging method using a DFeRAM architecture. We compare the performance of the log based method and that of our method based on simulation results, which show that our method is better than the log based method when both methods use a nonvolatile memory.
The draft international standard ITU-T H.263 is closely related to the well known and widely used ITU-T Recommendation H.261. However, H.263 does provide the same subjective image quality at less than half the bit-rat...
The draft international standard ITU-T H.263 is closely related to the well known and widely used ITU-T Recommendation H.261. However, H.263 does provide the same subjective image quality at less than half the bit-rate. In this paper we investigate to what extend single enhancements of H.263 contribute to this performance gain, and consider the trade-off quality vs. complexity. Based on typical test sequences, H.263 with its various modes is compared to H.261 on the basis of rate distortion curves at bit-rates up to 128 kbps. At 64 kbps, the performance gain of H.263 in its default mode compared to H.261 is approximately 2 dB. This improvement is achieved with only little increase of complexity, and is mainly due to more accurate motion compensation with half-pel accuracy. Considering the trade-off quality vs. complexity, the combination of the optional coding-modes "Advanced prediction mode" and "PB-frames mode" is a good compromise, resulting in an additional performance gain of 1.5 dB PSNR at 64 kbps. The "Syntax-based arithmetic coding mode" on the other hand, offers only a very small performance gain (0.1 dB at 64 kbps) for its increased computational complexity. Results from profiling an H.263 software codec are presented in order to support complexity considerations of the optional coding-modes.
Scaling accounts for a sort of dependence between local (microscopic, or very small) and global (macroscopic) properties of a system in question. Fractal dimension and its generalizations, as critical exponents in a p...
详细信息
Scaling accounts for a sort of dependence between local (microscopic, or very small) and global (macroscopic) properties of a system in question. Fractal dimension and its generalizations, as critical exponents in a power-law scaling, have been chosen as a tool in the reconsideration of a mass transport through membranes, and in the process of casting (DLA membranes). The passage from the diffusion on the prototype structures (e.g. Sierpinski gasket) to the diffusion on real structures has also been shown. Copyright (C) 1996 Elsevier science Ltd
暂无评论