检索结果-内蒙古大学图书馆

2012 11th International Conference on Information Science, signal processing and their Applications, ISSPA 2012

作者： Tomar, Vikrant Singh Rose, Richard C. Telecom and Signal Processing Laboratory Department of Electrical and Computer Engineering McGill University Montreal QC Canada

ISBN: (纸本)9781467303828

This paper presents a comparison of three techniques for dimensionally reduction in feature analysis for automatic speech recognition (ASR). All three approaches estimate a linear transformation that is applied to concatenated log spectral features and provide a mechanism for efficient modeling of spectral dynamics in ASR. The goal of the paper is to investigate the effectiveness of a discriminative approach for estimating these feature space transformations which is based on the assumption that speech features lie on a non-linear manifold. This approach is referred to as locality preserving discriminant analysis (LPDA) and is based on the principle of preserving local within-class relationships in this non-linear space while at the same time maximizing separability between classes. This approach was compared to two well known approaches for dimensionality reduction, linear discriminant analysis (LDA) and locality preserving linear projection (LPP), on the Aurora 2 speech in noise task. The LPDA approach was found to provide a significant reduction in WER with respect to the other techniques for most noise types and signal-to-noise ratios (SNRs). © 2012 IEEE.

关键词： Speech recognition

来源：评论

学校读者我要写书评

暂无评论

REAL-VALUED FAST FOURIER-TRANSFORM ALGORITHMS

引用

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND signal processing 1987年第6期35卷 849-863页

作者： SORENSEN, HV JONES, DL HEIDEMAN, MT BURRUS, CS Department of Electrical and Computer Engineering Rice University Houston TX USA Digital Image Processing Laboratory Lockheed Missiles and Space Company Incorporated Palo Alto CA USA

This tutorial paper describes the methods for constructing fast algorithms for the computation of the discrete Fourier transform (DFT) of a real-valued series. The application of these ideas to all the major fast Fourier transform (FFT) algorithms is discussed, and the various algorithms are compared. We present a new implementation of the real-valued split-radix FFT, an algorithm that uses fewer operations than any other real-valued power-of-2-length FFT. We also compare the performance of inherently real-valued transform algorithms such as the fast Hartley transform (FHT) and the fast cosine transform (FCT) to real-valued FFT algorithms for the computation of power spectra and cyclic convolutions. Comparisons of these techniques reveal that the alternative techniques always require more additions than a method based on a real-valued FFT algorithm and result in computer code of equal or greater length and complexity.

关键词： Fast Fourier transforms Discrete Fourier transforms signal processing algorithms Convolutional codes Discrete transforms Application software Algorithm design and analysis NASA Digital images

来源：评论

学校读者我要写书评

暂无评论

A Hybrid HMM-Neural Network with Gradient Descent Parameter Training

A Hybrid HMM-Neural Network with Gradient Descent Parameter ...

引用

International Joint Conference on Neural Networks 2003

作者： Salazar, Jaime Robinson, Marc Azimi-Sadjadi, Mahmood R. Signal/Image Processing Laboratory Colorado State University Department of Electrical Engineering Fort Collins CO 80523 United States

Hybrid Hidden Markov Models (HMM) and Multi-Layer Perceptron (MLP) neural networks have been applied with great success in speech recognition problems. The hybrid system can be applied to sequence classification problems, where multiple looks at an object are used to determine class membership. This presents a utility to perform feature-level fusion in such problems. A new gradient descent algorithm is employed to find optimal parameters within the HMM/MLP model. This scheme has been applied to a data set which contains sonar backscattered signals for four underwater objects for classification as mine-like or non-mine-like.

关键词： Multilayer neural networks

来源：评论

学校读者我要写书评

暂无评论

Color reduction by using a new self-growing and self-organized neural network

Color reduction by using a new self-growing and self-organiz...

引用

2nd International Conference on Video, Vision and Graphics, VVG 2005

作者： Atsalakis, A. Papamarkos, N. Image Processing and Multimedia Laboratory Department of Electrical and Computer Engineering Democritus University of Thrace 67100 Xanthi Greece

ISBN: (纸本)9783905673579

A new method for the reduction of the number of colors in a digital image is proposed. The new method is based on the developed of a new neural network classifier that combines the advantages of the Growing Neural Gas (GNG) and the Kohonen Self-Organized Feature Map (SOFM) neural networks. We call the new neural network: Self-Growing and Self-Organized Neural Gas (SGONG). Its main advantage is that it defines the number of the created neurons and their topology in an automatic way. As a consecutive, isolated color classes, which may correspond to significant image details, can be obtained. The SGONG is fed by the color components and additional spatial features. To speed up the entire algorithm and to reduce memory requirements, a fractal scanning sub-sampling technique is used. The method is applicable to any type of color images and it can accommodate any type of color space. © The Eurographics Association 2005.

关键词： Color

来源：评论

学校读者我要写书评

暂无评论

Multi-spectral document image binarization using image fusion and background subtraction techniques

Multi-spectral document image binarization using image fusio...

引用

作者： Mitianoudis, Nikolaos Papamarkos, Nikolaos Image Processing and Multimedia Laboratory Department of Electrical and Computer Engineering Democritus University of Thrace Xanthi67100 Greece

ISBN: (纸本)9781479957514

In this paper, the authors exploit a multispectral image representation to perform more accurate document image binarisation compared to previous color representations. In the first stage, image fusion is employed to create a 'document' and a 'background' image. In the second stage, the FastICA algorithm is used to perform background subtraction. In the third stage, a spatial kernel K-harmonic means classifier binarizes the FastICA output. The proposed system outperforms previous efforts on document image binarization. © 2014 IEEE.

关键词： image fusion

来源：评论

学校读者我要写书评

暂无评论

Text extraction using document structure features and support vector machines

Text extraction using document structure features and suppor...

引用

11th IASTED International Conference on computer Graphics and Imaging, CGIM 2010

作者： Zagoris, Konstantinos Papamarkos, Nikos Image Processing and Multimedia Laboratory Department of Electrical and Computer Engineering Democritus University of Thrace 67100 Xanthi Greece

ISBN: (纸本)9780889868243

In order to successfully locate and retrieve document images such as technical articles and newspapers, a text localization technique must be employed. The proposed method detects and extracts homogeneous text areas in document images indifferent to font types and size by using connected components analysis to detect blocks of foreground objects. Next, a descriptor that consists of a set of structural features is extracted from the merged blocks and used as input to a trained Support Vector Machines (SVM). Finally, the output of the SVM classifies the block as text or not.

关键词： Support vector machines

来源：评论

学校读者我要写书评

暂无评论

Estimation of proper parameter values for document binarization

Estimation of proper parameter values for document binarizat...

引用

10th IASTED International Conference on computer Graphics and Imaging, CGIM 2008

作者： Badekas, E. Papamarkos, N. Image Processing and Multimedia Laboratory Department of Electrical and Computer Engineering Democritus University of Thrace 67100 Xanthi Greece

ISBN: (纸本)9780889867192

Most of the existing document-binarization techniques deal with many parameters that require a priori setting of their values. Due to the unknown of the ground-truth images, the evaluation of document binarization techniques is subjective and employs human observers for the estimation of the appropriate parameter values. The selection of the appropriate values for these parameters is crucial and influences to the final binarization. However, there is no predetermined set of parameters that guarantees optimal binarization for all document images. This paper proposes a new technique that allows the estimation of proper parameters values for each one of the document binarization techniques. The proposed approach is based on a statistical performance analysis of a set of binarization results, which are obtained by applying various binarization techniques with different parameter values. The proposed statistical performance analysis can also depicts the best document binarization result obtained by a set of document binarization techniques.

关键词： Parameter estimation

来源：评论

学校读者我要写书评

暂无评论

Noise-robust modification method for gaussian-based models with application to radar HRRP recognition

引用

IEEE Geoscience and Remote Sensing Letters 2013年第3期10卷 558-562页

作者： Pan, Mian Du, Lan Wang, Penghui Liu, Hongwei Bao, Zheng National Laboratory of Radar Signal Processing Xidian University Xi'an 710071 China Department of Electrical and Computer Engineering Duke University Durham NC 27708 United States

In this letter, we introduce a novel noise-robust modification method for Gaussian-based models to enhance the performance of radar high-resolution range profile (HRRP) recognition under the test condition of low signal-to-noise ratio (SNR), and we develop an efficient scheme for its computation. This noise-robust modification method is implemented by revising the trained Gaussian-based model according to the estimated SNR of test HRRP. We apply the proposed method to adaptive Gaussian classifier and truncated stick-breaking hidden Markov model. Experimental results demonstrate that the proposed method can significantly improve the average recognition rate for noisy HRRP test samples while offering recognition performance comparable to that of original model for clean HRRP test samples. Moreover, even when the SNR of test HRRP samples is not precisely estimated, we can still obtain an acceptable result with the proposed method. © 2004-2012 IEEE.

关键词： signal to noise ratio

来源：评论

学校读者我要写书评

暂无评论

Features for phoneme independent speaker identification

Features for phoneme independent speaker identification

引用

2012 3rd IEEE/IET International Conference on Audio, Language and image processing, ICALIP 2012

作者： Wang, Jianglin Ji, An Johnson, Michael T. Speech and Signal Processing Laboratory Department of Electrical and Computer Engineering Marquette University Milwaukee WI United States

ISBN: (纸本)9781467301718

This paper describes a unique cross-phoneme speaker identification experiment, using deliberately mismatched phoneme sets for training and testing. The underlying goal is to identify features that represent broad individually unique characteristics rather than those that represent phonetic differences, as are more typical of modern speaker identification and verification systems. A wide range of features are proposed and evaluated within this context using a Gaussian Mixture Model framework. The results show that log-area ratio has better phonetic independence than MFCCs, that residual phase carries substantial speaker information, and identifies several other features that also have usefulness for speaker identification independent of phonetic content. © 2012 IEEE.

关键词： Gaussian distribution

来源：评论

学校读者我要写书评

暂无评论

Self-supervised model-informed deep learning for low-SNR SS-OCT domain transformation

引用

Scientific reports 2025年第1期15卷 17791页

作者： Sajed Rakhshani Amirali Arbab Aref Habibi Mohsen Pourazizi Mahnoosh Tajmirriahi Farnaz Sedighin Hossein Rabbani Medical Image and Signal Processing Research Center School of Advanced Technologies in Medicine Isfahan University of Medical Sciences Isfahan Iran. Department of Electrical and Computer Engineering Isfahan University of Technology Isfahan Iran. Department of Ophthalmology Isfahan Eye Research Center Isfahan University of Medical Sciences Isfahan Iran. Medical Image and Signal Processing Research Center School of Advanced Technologies in Medicine Isfahan University of Medical Sciences Isfahan Iran. rabbani.h@***.

This article introduces a novel deep-learning based framework, Super-resolution/Denoising network (SDNet), for simultaneous denoising and super-resolution of swept-source optical coherence tomography (SS-OCT) images. The novelty of this work lies in the hybrid integration of data-driven deep-learning with a model-informed noise representation, specifically designed to address the very low signal-to-noise ratio (SNR) and low-resolution challenges in SS-OCT imaging. SDNet introduces a two-step training process, leveraging noise-free OCT references to simulate low-SNR conditions. In the first step, the network learns to enhance noisy images by combining denoising and super-resolution within noise-corrupted reference domain. To refine its performance, the second step incorporates Principle Component Analysis (PCA) as self-supervised denoising strategy, eliminating the need for ground-truth noisy image data. This unique approach enhances SDNet's adaptability and clinical relevance. A key advantage of SDNet is its ability to balance contrast-texture by adjusting the weights of the two training steps, offering clinicians flexibility for specific diagnostic needs. Experimental results across diverse datasets demonstrate that SDNet surpasses traditional model-based and data-driven methods in computational efficiency, noise reduction, and structural fidelity. The framework excels in improving both image quality and diagnostic accuracy. Additionally, SDNet shows promising adaptability for analyzing low-resolution, low-SNR OCT images, such as those from patients with diabetic macular edema (DME). This study establishes SDNet as a robust, efficient, and clinically adaptable solution for OCT image enhancement addressing critical limitations in contemporary imaging workflows.

关键词： Denoising Low-SNR OCT Model-aware deep learning Self-supervised learning Super-resolution

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：