Matrix multiplication is an essential building block of many linear algebra operations and applications. this paper presents parallelalgorithms with shared A or B matrix in the memory for the special massively multit...
详细信息
this article introduces a C++ template library dedicated at vectorizing algorithms for different target architectures: Multi-Target parallel Skeleton (MTPS). Skeletons describing the data structures and algorithms are...
详细信息
Erasure codes can improve the availability of distributed storage in comparison with replication systems. In this paper, we focus on investigating how to map systematically the Reed-Solomon and Cauchy Reed-Solomon era...
详细信息
ISBN:
(纸本)9783642281440;9783642281457
Erasure codes can improve the availability of distributed storage in comparison with replication systems. In this paper, we focus on investigating how to map systematically the Reed-Solomon and Cauchy Reed-Solomon erasure codes onto the Cell/B.E. and GPU multicore architecture. A method for the systematic mapping of computation kernels of encoding/decoding algorithms onto the Cell/B.E. architecture is proposed. this method takes into account properties of the architecture on all three levels of its parallelprocessing hierarchy. the performance results are shown to be very promising. the possibility of using GPUs is studied as well, based on the Cauchy version of Reed-Solomon codes.
In order to take full advantage of multi-core resources to enhance the parallel performance, we study the architecture of multi-core processor and point out that the heterogeneous multi-core processor is the mainstrea...
详细信息
the study proposes an automated procedure for linking an identification algorithm implemented in a general-purpose environment (MatLab) with a commercial finite-element code for magnetic field analysis (VF-Opera). the...
详细信息
the study proposes an automated procedure for linking an identification algorithm implemented in a general-purpose environment (MatLab) with a commercial finite-element code for magnetic field analysis (VF-Opera). the method is used to identify the B-H curves of anisotropic magnetic laminations in the direction normal to the sheet surface withthe aid of a multiprocessor computer to benefit from parallel computing. the proposed approach, based on a non-deterministic algorithm, makes it possible to determine the entire B-H curve and is therefore an advancement on the commonly used assumption of constant permeability.
Next Generation Sequencing (NGS) is gaining interests due to the increased requirements and the decreased sequencing cost. the important and prerequisite step of most NGS applications is the mapping of short sequences...
详细信息
ISBN:
(纸本)9780769546766
Next Generation Sequencing (NGS) is gaining interests due to the increased requirements and the decreased sequencing cost. the important and prerequisite step of most NGS applications is the mapping of short sequences, called reads, to the template reference sequences. Boththe explosion of NGS data with over billions of reads generated each day and the data-intensive computations pose great challenges to the capability of existing computing systems. In this paper, we take a hash-index based algorithm (PerM) as an example to investigate the optimization approaches for accelerating NGS reads mapping on multi-core architectures. First, we propose a new parallel algorithm that reorders bucket access in hash index among multiple threads so that data locality in shared cache is improved. Second, in order to reduce the number of empty hash bucket, we propose a serialized hash index compression algorithm, which coincides withthe sequential access nature of our new parallel algorithm. With reduced hash index size, it also becomes possible for us to use longer hash keys, which alleviates the hash conflicts and improves the query performance. Our experiment on an 8-socket 8-cores Intel Xeon X7550 SMP with 128 GB memory shows that the new parallel algorithm reduces LLC miss ratio to be 8% similar to 15% of the original algorithm and the overall performance is improved by 4 similar to 11 times (6 times avg.).
In this paper, a new parallel phase algorithm for parallel turbo decoder is proposed. Traditional sliding window turbo algorithm exchanges extrinsic information phase by phase, it will induce long decoding latency. th...
详细信息
the proceedings contain 32 papers. the topics discussed include: the impact of crowdsourcing post-editing withthe collaborative translation framework;translation of quantifiers in Japanese-Chinese machine translation...
ISBN:
(纸本)9783642339820
the proceedings contain 32 papers. the topics discussed include: the impact of crowdsourcing post-editing withthe collaborative translation framework;translation of quantifiers in Japanese-Chinese machine translation;toward practical use of machine translation;phrase-level pattern-based machine translation based on analogical mapping method;parallel texts extraction from multimodal comparable corpora;a reliable communication system to maximize the communication quality;exploiting a web-based encyclopedia as a knowledge base for the extraction of multilingual terminology;segmenting long sentence pairs to improve word alignment in English-Hindi parallel corpora;shallow syntactic preprocessing for statistical machine translation;linguistic rules based approach for automatic restoration of accents on French texts;word clustering for Persian statistical parsing;and building a lexically and semantically-rich resource for paraphrase processing.
We show that developing an optimal parallelization of the two-list algorithm is much easier than we once thought. All it takes is to observe that the steps of the search phase of the two-list algorithm are closely rel...
详细信息
In this paper, we introduce a novel parallel corpus of music and lyrics, annotated with emotions at line level. We first describe the corpus, consisting of 100 popular songs, each of them including a music component, ...
详细信息
ISBN:
(纸本)9782951740877
In this paper, we introduce a novel parallel corpus of music and lyrics, annotated with emotions at line level. We first describe the corpus, consisting of 100 popular songs, each of them including a music component, provided in the MIDI format, as well as a lyrics component, made available as raw text. We then describe our work on enhancing this corpus with emotion annotations using crowdsourcing. We also present some initial experiments on emotion classification using the music and the lyrics representations of the songs, which lead to encouraging results, thus demonstrating the promise of using joint music-lyric models for song processing.
暂无评论