The implementation of a complex, large vocabulary, speech recognition application on a modern graphic processors (GPUs) is presented. The parallelsingleinstruction, multipledata (SIMD) architecture is effectively e...
详细信息
ISBN:
(纸本)9781457704345
The implementation of a complex, large vocabulary, speech recognition application on a modern graphic processors (GPUs) is presented. The parallelsingleinstruction, multipledata (SIMD) architecture is effectively exploited by performing various optimizations to expose the algorithmic parallelism. The work addresses particularly the realization of the Gaussian calculation, a key function. The result is an implementation that runs 3.75 faster than real-time and gives a tenfold speedup when compared to a highly optimized sequential CPU-based implementation. The work is also compared with some earlier work involved in building the same system on a Virtex 5-based, Alpha data XRC-5T1 reconfigurable computer.
暂无评论