检索结果-内蒙古大学图书馆

Morphological waveform coding for writer identification

PATTERN RECOGNITION 2000年第3期33卷 385-398页

作者： Zois, EN Anastassopoulos, V Univ Patras Dept Phys Elect Lab Patras 26500 Greece

Writer identification is carried out using handwritten text. The feature vector is derived by means of morphologically processing the horizontal profiles (projection functions) of the words. The projections are derived and processed in segments in order to increase the discrimination efficiency of the Feature vector. Extensive study of the statistical properties of the feature space is provided. Both Bayesian classifiers and neural networks are employed to lest the efficiency of the proposed feature. The achieved identification success using a long word exceeds 95%. (C) 2000 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.

关键词： writer identification person verification morphological features waveform coding

来源：评论

学校读者我要写书评

暂无评论

Complex Autoencoder Approach to Constant Envelope waveform coding 18

Complex Autoencoder Approach to Constant Envelope Waveform C...

引用

IEEE 18th Annual Consumer Communications and Networking Conference (CCNC)

作者： Gorday, Paul Erdol, Nurgun Zhuang, Hanqi Florida Atlantic Univ Dept Comp & Elect Engn & Comp Sci Boca Raton FL 33431 USA

ISBN: (纸本)9781728197944

This paper proposes a new complex autoencoder suitable for learning spectrally efficient, constant envelope waveform coding. In contrast to prior work, we model the encoder output layer as a phase modulation layer with a complex exponential activation function. In addition, we model the decoder with a complex-valued feature detection layer that may be coherent or noncoherent. The complex topology leads to noncoherent waveform coding methods not obtained in prior studies. The paper provides a mathematical framework for training the proposed autoencoder along with illustrative examples that demonstrate its ability to learn improved spectral efficiency relative to traditional orthogonal and biorthogonal modulations.

关键词： Machine Learning Autoencoder Complex-valued waveform coding Modulation Communications

来源：评论

学校读者我要写书评

暂无评论

Recent and current research on very low bit-rate video coding in Japan

引用

IEICE TRANSACTIONS ON COMMUNICATIONS 1996年第10期E79B卷 1415-1424页

作者： Kaneko, M Department of Information and Communication Engineering School of Engineering University of Tokyo Tokyo 113 Japan

This paper presents an overview of research activities in Japan in the field of very low bit-rate video coding. Related research based on the concept of ''intelligent image coding'' started in the mid-1980's. Although this concept originated from the consideration of a new type of image coding, it can also be applied to other interesting applications such as human interface and psychology. On the other hand, since the beginning of the 1990's, research on the improvement of waveform coding has been actively performed to realize very low bit-rate video coding. Key techniques employed here are improvement of motion compensation and adoption of region segmentation. In addition to the above, we propose new concepts of image coding, which have the potential to open up new aspects of image coding, e.g. ideas of interactive image coding, integrated 3-D visual communication and coding of multimedia information considering mutual relationship amongst various media.

关键词： very low bit-rate video coding intelligent image coding model-based coding waveform coding motion compensation region segmentation integrated 3-D visual communication

来源：评论

学校读者我要写书评

暂无评论

Scalable and Efficient Neural Speech coding: A Hybrid Design

引用

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 2022年 30卷 12-25页

作者： Zhen, Kai Sung, Jongmo Lee, Mi Suk Beack, Seungkwon Kim, Minje Indiana Univ Dept Comp Sci Bloomington IN 47408 USA Indiana Univ Cognit Sci Program Bloomington IN 47408 USA Elect & Telecommun Res Inst Daejeon 34129 South Korea Indiana Univ Dept Intelligent Syst Engn Bloomington IN 47408 USA

We present a scalable and efficient neural waveform coding system for speech compression. We formulate the speech coding problem as an autoencoding task, where a convolutional neural network (CNN) performs encoding and decoding as a neural waveform codec (NWC) during its feedforward routine. The proposed NWC also defines quantization and entropy coding as a trainable module, so the coding artifacts and bitrate control are handled during the optimization process. We achieve efficiency by introducing compact model components to NWC, such as gated residual networks and depthwise separable convolution. Furthermore, the proposed models are with a scalable architecture, cross-module residual learning (CMRL), to cover a wide range of bitrates. To this end, we employ the residual coding concept to concatenate multiple NWC autoencoding modules, where each NWC module performs residual coding to restore any reconstruction loss that its preceding modules have created. CMRL can scale down to cover lower bitrates as well, for which it employs linear predictive coding (LPC) module as its first autoencoder. The hybrid design integrates LPC and NWC by redefining LPC's quantization as a differentiable process, making the system training an end-to-end manner. The decoder of proposed system is with either one NWC (0.12 million parameters) in low to medium bitrate ranges (12 to 20 kbps) or two NWCs in the high bitrate (32 kbps). Although the decoding complexity is not yet as low as that of conventional speech codecs, it is significantly reduced from that of other neural speech coders, such as a WaveNet-based vocoder. For wide-band speech coding quality, our system yields comparable or superior performance to AMR-WB and Opus on TIMIT test utterances at low and medium bitrates. The proposed system can scale up to higher bitrates to achieve near transparent performance.

关键词： Speech coding Bit rate Encoding Decoding Vocoders Complexity theory Speech codecs Neural speech coding waveform coding representation learning model complexity

来源：评论

学校读者我要写书评

暂无评论

Rate-distortion optimized quantization in multistage audio coding

引用

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 2006年第1期14卷 311-320页

作者： Vafin, R Kleijn, WB Royal Inst Technol KTH Dept Signals Sensors & Syst Tallinn Estonia Royal Inst Technol Dept Signals Sensors & Syst S-10044 Stockholm Sweden

In this work, we develop a new method for quantization in multistage audio coding. Given a (perceptual) distortion measure and a bit-rate constraint, we analytically derive the optimal rate distribution between subcoders (stages) and the corresponding optimal quantizers using high-rate theory. The analytical solutions for optimal quantizers allow a coder to easily adapt to changes in bit-rate requirements. As an illustration of the new method, we consider quantization in a two-stage sinusoidal/wave form coder that is a widely used combination in audio coding. We show that at low total rates most of the rate should be assigned to the sinusoidal (model-based, subspace) subcoder, while at high total rates most of the rate should be assigned to the waveform (full-space) subcoder. We compare the new method to a reference quantization method that does not use rate-distortion optimization. A significantly higher performance of the new method is shown by means of a listening test.

关键词： audio coding high-rate theory modified discrete cosine transform (MDCT) multistage coding quantization rate-distortion optimization sinusoidal coding waveform coding

来源：评论

学校读者我要写书评

暂无评论

Nonlinear discrete Fourier transformer for the time series analysis and application to speech coding

Nonlinear discrete Fourier transformer for the time series a...

引用

IEEE Region 10 Annual Conference on Speech and Image Technologies for Computing and Telecommunications (IEEE TENCON 97)

作者： Dai, XH Shantou Univ China

ISBN: (纸本)0780343654

The primary motivation of the paper is to investigate waveform coding of speech signal. The paper presents a new signal analyzing tool - nonlinear discrete Fourier transform (NDFT) which has an improved signal analysis performance. By virtue of the NDFT, waveform coding of the speech signal with a long segment (for ex. a segment with 512 or 1024 samples) is studied. The new coding method provides an improved performance of the speech coding at as low as 4 kbit/s, the feature of reproduced signal is kept more significant than that of the linear predictor coding.

关键词： speech coding transform coding Fourier transforms time series nonlinear discrete Fourier transformer time series analysis speech coding waveform coding signal analyzing tool signal analysis performance NDFT speech signal 4 kbit/s

来源：评论

学校读者我要写书评

暂无评论

Hi-BIN: An alternative approach to wideband speech coding 25

Hi-BIN: An alternative approach to wideband speech coding

引用

IEEE International Conference on Acoustics, Speech, and Signal Processing

作者： Taori, R Sluijter, RJ Gerrits, AJ Philips Res Labs Digital Signal Proc Grp Eindhoven Netherlands

ISBN: (纸本)0780362934

In this paper, an encoding technique called Hi-BIN (High Band Injection), which can be combined with any narrowband coder to achieve good quality wideband speech, is described. The principle behind this technique is to model frequencies above 4 kHz by noise with an appropriate spectral shape. This simple way of injecting synthetic noise in the higher frequencies gives surprisingly good quality when compared to very widely used computationally intensive waveform coding techniques such as CELP. We will Show that Hi-BIN offers a low bit-rate representation of the higher band and is backwards compatible with existing narrowband speech coding systems.

关键词： speech coding high-frequency band broadband Narrowband waveform coding

来源：评论

学校读者我要写书评

暂无评论

ENTROPY-CONSTRAINED VECTOR QUANTIZATION

引用

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING 1989年第1期37卷 31-42页

作者： CHOU, PA LOOKABAUGH, T GRAY, RM COMPRESSION LABS INC SAN JOSE CA 95134 USA STANFORD UNIV DEPT ELECT ENGN INFORMAT SYST LAB STANFORD CA 94305 USA

An iterative descent algorithm based on a Lagrangian formulation for designing vector quantizers having minimum distortion subject to an entropy constraint is discussed. These entropy-constrained vector quantizers (ECVQs) can be used in tandem with variable-rate noiseless coding systems to provide locally optimal variable-rate block source coding with respect to a fidelity criterion. Experiments on sampled speech and on synthetic sources with memory indicate that for waveform coding at low rates (about 1 bit/sample) under the squared error distortion measure, about 1.6 dB improvement in the signal-to-noise ratio can be expected over the best scalar and lattice quantizers when block entropy-coded with block length 4. Even greater gains are made over other forms of entropy-coded vector quantizers. For pattern recognition, it is shown that the ECVQ algorithm is a generalization of the k-means and related algorithms for estimating cluster means, in that the ECVQ algorithm estimates the prior cluster probabilities as well. Experiments on multivariate Gaussian distributions show that for clustering problems involving classes with widely different priors, the ECVQ outperforms the k-means algorithm in both likelihood and probability of error.

关键词： waveform coding algorithms quantizer Gaussian distribution K-means algorithm Distortion measurement Entropy coding

来源：评论

学校读者我要写书评

暂无评论

Bearing fault diagnosis using a novel coding-statistic feature combined with NNC

引用

JOURNAL OF VIBROENGINEERING 2022年第5期24卷 848-861页

作者： Qiu, Mingquan Zhao, Zebo Guizhou Minzu Univ Sch Mechatron Engn Guiyang Peoples R China

The failures of rolling bearings usually cause the breakdown of rotating machinery. Therefore, bearing fault diagnosis is receiving more and more attentions. In this paper, a new coding-statistic feature is proposed for bearing fault diagnosis. Firstly, a waveform coding matrix (WCM) is drawn from each signal using a coding algorithm then a statistical feature is extracted from the WCM with a pre-defined dictionary. Secondly, all statistical features are processed using two-dimensional principal component analysis (2DPCA) to reduce redundant information and dimensionality. Finally, a nearest neighbor classifier (NNC) is employed to classify the bearing faults. Two bearing fault classification problems are utilized to demonstrate the effectiveness of the proposed scheme. Experimental results show that an excellent performance could be accomplished with the proposed scheme.

关键词： bearing fault diagnosis waveform coding coding-statistic feature

来源：评论

学校读者我要写书评

暂无评论

Regression to Classification: waveform Encoding for Neural Field-Based Audio Signal Representation 48

Regression to Classification: Waveform Encoding for Neural F...

引用

48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023

作者： Kim, TaeSoo Rho, Daniel Lee, Gahui Park, JaeHan Ko, Jong Hwan Kt Korea Republic of Sungkyunkwan University Department of Electrical and Computer Engineering Korea Republic of

ISBN: (纸本)9781728163277

Neural fields, also known as coordinate-based representations, are an emerging signal representation framework. This approach has also been used to represent audio signals, but the generated audio often contains noise. To reduce noise and improve representation quality, we propose using waveform encoding in the neural field. Instead of yielding real numbers for each temporal coordinate, this involves using discrete integers as outputs, with waveform-encoded integers as target classes, and treating the representation problem as a classification task rather than a regression problem. The experimental results show that waveform encoding can improve the audio quality of neural fields across a variety of audio datasets. © 2023 IEEE.

关键词： audio representations implicit neural representation neural fields waveform coding

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：