检索结果-内蒙古大学图书馆

BPL-PLC Voice Communication System for the Oil and Mining Industry

ENERGIES 2020年第18期13卷 4763页

作者： Debita, Grzegorz Falkowski-Gilski, Przemyslaw Habrych, Marcin Wisniewski, Grzegorz Miedzinski, Bogdan Jedlikowski, Przemyslaw Waniewska, Agnieszka Wandzio, Jan Polnik, Bartosz Gen Tadeusz Kosciuszko Mil Univ Land Forces Fac Econ Czajkowskiego St 109 PL-51147 Wroclaw Poland Gdansk Univ Technol Fac Elect Telecommun & Informat Narutowicza St 11-12 PL-80233 Gdansk Poland Wroclaw Univ Sci & Technol Fac Elect Engn Wybrzeze Wyspianskiego St 27 PL-50370 Wroclaw Poland Wroclaw Univ Sci & Technol Fac Elect Wybrzeze Wyspianskiego St 27 PL-50370 Wroclaw Poland KGHM Polska Miedz SA Sklodowskiej Curie St 48 PL-59301 Lubin Poland KOMAG Inst Min Technol Pszczynska St 37 PL-44101 Gliwice Poland

Application of a high-efficiency voice communication systems based on broadband over power line-power line communication (BPL-PLC) technology in medium voltage networks, including hazardous areas (like the oil and mining industry), as a redundant mean of wired communication (apart from traditional fiber optics and electrical wires) can be beneficial. Due to the possibility of utilizing existing electrical infrastructure, it can significantly reduce deployment costs. Additionally, it can be applied under difficult conditions, thanks to battery-powered devices. During an emergency situation (e.g., after coal dust explosion), the medium voltage cables are resistant to mechanical damage, providing a potentially life-saving communication link between the supervisor, rescue team, paramedics, and the trapped personnel. The assessment of such a system requires a comprehensive and accurate examination, including a number of factors. Therefore, various models were tested, considering: different transmission paths and types of coupling (inductive and capacitive), as well as various lengths of transmitted data packets. Next, a subjective quality evaluation study was carried out, considering speech signals from a number of languages (English, German, and Polish). Based on the obtained results, including both simulations and measurements, appropriate practical conclusions were formulated. Results confirmed the applicability of BPL-PLC technology as an efficient voice communication system for the oil and mining industry.

关键词： audio coding digital systems electrical engineering ICT Industry 4 0 IoT power cable QoS reliability voice communication

来源：评论

学校读者我要写书评

暂无评论

Towards a Perceptual Loss: Using a Neural Network Codec Approximation as a Loss for Generative audio Models 19

Towards a Perceptual Loss: Using a Neural Network Codec Appr...

引用

27th ACM International Conference on Multimedia (MM)

作者： Ananthabhotla, Ishwarya Ewert, Sebastian Paradiso, Joseph A. MIT Media Lab Cambridge MA 02139 USA Spotify Inc London England

ISBN: (纸本)9781450368896

Generative audio models based on neural networks have led to considerable improvements across fields including speech enhancement, source separation, and text-to-speech synthesis. These systems are typically trained in a supervised fashion using simple element-wise l(1) or l(2) losses. However, because they do not capture properties of the human auditory system, such losses encourage modelling perceptually meaningless aspects of the output, wasting capacity and limiting performance. Additionally, while adversarial models have been employed to encourage outputs that are statistically indistinguishable from ground truth and have resulted in improvements in this regard, such losses do not need to explicitly model perception as their task;furthermore, training adversarial networks remains an unstable and slow process. In this work, we investigate an idea fundamentally rooted in psychoacoustics. We train a neural network to emulate an MP3 codec as a differentiable function. Feeding the output of a generative model through this MP3 function, we remove signal components that are perceptually irrelevant before computing a loss. To further stabilize gradient propagation, we employ intermediate layer outputs to define our loss, as found useful in image domain methods. Our experiments using an autoencoding task show an improvement over standard losses in listening tests, indicating the potential of psychoacoustically motivated models for audio generation.

关键词： perceptual loss function perception neural networks audio audio coding

来源：评论

学校读者我要写书评

暂无评论

On the Efficiency Difference Between Range and Huffman coding on CELT Layer of Opus audio Coder

On the Efficiency Difference Between Range and Huffman Codin...

引用

IEEE International Conference on Consumer Electronics-Taiwan (ICCE-TW)

作者： Shingchern D. You Po-Yueh Lai National Taipei University of Technology Taipei Taiwan

This paper compares the coding efficiency between the range coder in the Opus coder and the Huffman coder used in the MP-3 (MPEG-I Layer 3) and MPEG-2 AAC. The results show that the range coder has efficiency advantage of about 9 % at a rate of 128 kbps. The simulation, in a sense, indicates that transcoding from the Opus format to MP-3 or AAC format will lead to quality degradation.

关键词： Speech coding Bit rate Transform coding Transcoding audio coding ISO Standards

来源：评论

学校读者我要写书评

暂无评论

GMM-Based Iterative Entropy coding for Spectral Envelopes of Speech and audio

GMM-Based Iterative Entropy Coding for Spectral Envelopes of...

引用

IEEE International Conference on Acoustics, Speech and Signal Processing

作者： Srikanth Korse Guillaume Fuchs Tom Backstrom Fraunhofer IIS Erlangen Germany Aalto University Helsinki Finland

ISBN: (纸本)9781538646595

Spectral envelope modelling is a central part of speech and audio codecs and is traditionally based on either vector quantization or scalar quantization followed by entropy coding. To bridge the coding performance of vector quantization with the low complexity of the scalar case, we propose an iterative approach for entropy coding the spectral envelope parameters. For each parameter, a univariate probability distribution is derived from a Gaussian mixture model of the joint distribution and the previously quantized parameters used as a-priori information. Parameters are then iteratively and individually scalar quantized and entropy coded. Unlike vector quantization, the complexity of proposed method does not increase exponentially with dimension and bitrate. Moreover, the coding resolution and dimension can be adaptively modified without retraining the model. Experimental results show that these important advantages do not impair coding efficiency compared to a state-of-art vector quantization scheme.

关键词： Entropy coding Gaussian mixture models Envelope Modelling Speech coding audio coding Entropy coding audio coding Vector quantization audio Speech Mixture models envelopes entropy speech coding scalar quantization dimensions ENVELOPE

来源：评论

学校读者我要写书评

暂无评论

Predictive Vector Quantized Variational AutoEncoder for Spectral Envelope Quantization

Predictive Vector Quantized Variational AutoEncoder for Spec...

引用

International Conference on Electronics, Information and Communications (ICEIC)

作者： Tanasan Srikotr Kazunori Mano Division of Functional Control System Graduate School of Engineering and Science Shibaura Institute of Technology Japan

ISBN: (数字)9781728162898

ISBN: (纸本)9781728162904

The Predictive Vector Quantized Variational AutoEncoder is proposed to improve the reconstruction error of the conventional VQ-VAE. The proposed model can predict the current data from the previous data. The performance of the quantized spectral envelope parameters of the high-quality 48 kHz WORLD vocoder is evaluated. The results indicate that the Predictive Vector Quantized Variational AutoEncoder has a lower distortion with four target bitrates in term of log-spectral distortion, compared with the conventional VQ-VAE.

关键词： audio coding distortion spectral analysis vector quantisation vocoders Vocoders Vector quantization audio coding Spectrum Analysis distortion reconstruction error abnormal shapes envelopes Predictive Current data

来源：评论

学校读者我要写书评

暂无评论

coding OF FINE GRANULAR audio SIGNALS USING HIGH RESOLUTION ENVELOPE PROCESSING (HREP)

CODING OF FINE GRANULAR AUDIO SIGNALS USING HIGH RESOLUTION ...

引用

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Ghido, Florin Disch, Sascha Herre, Juergen Reutelhuber, Franz Adami, Alexander Fraunhofer Inst Integrierte Schaltungen IIS Wolfsmantel 33 D-91058 Erlangen Germany Int Audio Labs Erlangen Wolfsmantel 33 D-91058 Erlangen Germany

ISBN: (纸本)9781509041176

High Resolution Envelope Processing (HREP) is a new tool for improved perceptual coding of audio signals that predominantly consist of many den se transient events, such as applause, rain drop sounds, etc. These signals have traditionally been very difficult to code for perceptual audio codecs, particularly at low bit rates. Based on the gain control principle, HREP acts as a pre-/post-processor pair to perceptual audio codecs and preserves the temporal fine structure and subjective quality of applause-like signals. Subjective tests have shown a significant improvement in audio quality of around 12 MUSHRA points by HREP processing at 48 kbps stereo when used together with an MPEG-H 3D audio codec. The new coding tool has been adopted as part of MPEG-H 3D audio Second Edition.

关键词： audio coding Gain Control Envelope Simultaneous Masking Applause

来源：评论

学校读者我要写书评

暂无评论

COMPASS: coding AND MULTIDIRECTIONAL PARAMETERIZATION OF AMBISONIC SOUND SCENES

COMPASS: CODING AND MULTIDIRECTIONAL PARAMETERIZATION OF AMB...

引用

IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

作者： Politis, Archontis Tervo, Sakari Pulkki, Ville Aalto Univ Sch Elect Engn Dept Signal Proc & Acoust Espoo 02150 Finland

ISBN: (纸本)9781538646588

Current methods for immersive playback of spatial sound content aim at flexibility in terms of encoding and decoding, abstracting the two from the recording or playback setup. Ambisonics constitutes such a method, that is however signal-independent, and at low spatial resolutions fails to provide appropriate spatialization cues to the listener, with potential severe colouration effects and localization ambiguity. We present a new signal-dependent method for parametric analysis and synthesis of ambisonic sound scenes that takes advantage of the flexibility of Ambisonics as a spatial audio format, while improving reproduction. The proposed approach considers a more general acoustic model than previous proposals, with multiple source signals and a non isotropic ambient component. According to a listening test using headphones, the method is perceived closer to binaural reference sound scenes than ambisonic playback.

关键词： spatial audio acoustic scene analysis audio coding Ambisonics

来源：评论

学校读者我要写书评

暂无评论

Ultra-low latency audio coding based on DPCM and block companding

Ultra-low latency audio coding based on DPCM and block compa...

引用

International Workshop on Image Analysis for Multimedia Interactive Services, WIAMIS

作者： Gediminas Simkus Martin Halters Udo Zölzer Department of Signal Processing and Communications University of the Federal Armed Forces Hamburg Germany Helmut-Schmidt-Universitat Universitat der Bundeswehr Hamburg Hamburg Hamburg DE

A low delay audio coding scheme with good perceptual audio quality for a desired limited bit rate is presented. The proposed audio coding scheme is based on differential pulse code modulation (DPCM) and block companded (BC) quantization. Prediction is realized as a FIR filter in lattice structure. DPCM performs in feedback manner, therefore no transmission of prediction filter coefficients is needed. The incorporation of BC quantization in the DPCM relies on a prediction error recalculation scheme. The use of BC quantization in the DPCM allows to accurately follow the prediction error signal. This improves the perceptual audio quality significantly compared to a plain DPCM with an adaptive quantizer. An algorithmic delay below a half millisecond and an overhead of less than a half bit per sample is introduced due to the short fixed block length of the BC quantizer. Therefore, a real time bidirectional audio application is achievable.

关键词： Quantization (signal) Delays Lattices Decoding audio coding Bit rate

来源：评论

学校读者我要写书评

暂无评论

A NOVEL SCALABLE audio coding SCHEME

A NOVEL SCALABLE AUDIO CODING SCHEME

引用

IEEE International Conference on Acoustics, Speech, and Signal Processing

作者： Huan Zhou Haiyan Shu Rongshan Yu Haibin Huang Susanto Rahardja Signal Processing Department Institute for InfoComm Research Singapore

ISBN: (纸本)9781479903573

A new scalable audio coding scheme is introduced in this paper. Its core idea is to create one additional scalability dimension during the encoding process for the purpose of generating a plural of scalable sub-bitstreams. Based on the multiple sub-streams, a smart truncator is designed that can truncate these sub-bitstreams with optimal rate-distortion (R-D) tradeoff. Benefited from the flexible R-D trade-off, the proposed new scheme could, within a wide bitrate range, outperform those traditional scalable coding schemes, which usually provides a fixed R-D relationship designed at a specified bitrate. To verify the performance, the proposed scheme is further implemented based on a prior art scalable audio codec. Significant quality improvement is observed from the new codec via a series of subjective listening tests.

关键词： audio coding Scalability audio Bit rate Quality Improvement dimensional relationship tradeoffs Scheme Scalable coding

来源：评论

学校读者我要写书评

暂无评论

Scalable audio coding using watermarking

Scalable audio coding using watermarking

引用

IEEE International Conference on Multimedia and Expo (ICME)

作者： Mahmood Movassagh Peter Kabal Department of Electrical and Computer Engineering McGill University Montreal Canada

ISBN: (纸本)9781479900145

A scalable audio coding method is proposed using a technique, Quantization Index Modulation, borrowed from watermarking. Some of the information of each layer output is embedded (watermarked) in the previous layer. This approach leads to a saving in bitrate while keeping the distortion almost unchanged. This makes the scalable coding system more efficient in terms of Rate-Distortion. The results show that the proposed method outperforms the scalable audio coding based on reconstruction error quantization which is used in practical systems such as MPEG-4 AAC.

关键词： Quantization (signal) Entropy Bit rate Watermarking Indexes audio coding

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：