The discrete cosine transform is a valuable tool in analysis of data on undirected rectangular grids, like images. In this paper it is shown how one can define an analogue of the discrete cosine transform on triangles...
详细信息
ISBN:
(纸本)9781479981311
The discrete cosine transform is a valuable tool in analysis of data on undirected rectangular grids, like images. In this paper it is shown how one can define an analogue of the discrete cosine transform on triangles. This is done by combining algebraic signalprocessing theory with a specific kind of multivariate Chebyshev polynomials. Using a multivariate Christoffel-Darboux formula it is shown how to derive an orthogonal version of the transform.
The measure-transformed (MT) MUltiple signal Classification (MUSIC) algorithm is a robust MUSIC generalization that operates by applying a transform to the probability measure (distribution) of the data. In this paper...
详细信息
ISBN:
(纸本)9781479981311
The measure-transformed (MT) MUltiple signal Classification (MUSIC) algorithm is a robust MUSIC generalization that operates by applying a transform to the probability measure (distribution) of the data. In this paper, we first provide an asymptotic mean-squared-error (MSE) performance analysis of the MT-MUSIC algorithm. Under some mild assumptions, we show that the MT-MUSIC estimator is asymptotically normal and unbiased, and obtain an analytic expression for the asymptotic MSE matrix. We then proceed to develop a strongly consistent estimator for the asymptotic MSE matrix that is constructed from the same data samples being used for implementation of the MT-MUSIC. This paves the way for development of a data-driven procedure for optimal selection of the measure transformation parameters that minimizes an empirical estimate of the asymptotic average root MSE (RMSE). Simulation examples illustrate the performance advantage of the proposed MSE based optimization of the MT-MUSIC.
The subspace clustering problem arises in many applications that involve processing high-dimensional data, i.e. images and videos. In many of these applications, high dimensional data is often well approximated by uni...
详细信息
ISBN:
(纸本)9781479981311
The subspace clustering problem arises in many applications that involve processing high-dimensional data, i.e. images and videos. In many of these applications, high dimensional data is often well approximated by union of low-dimensional subspaces. This motivated the development of various algorithms to cluster high dimensional data based on the underlying intrinsic low-dimensional subspaces. However, the existing approaches are based on global representation of data whereas this representation can be easily affected by errors, occlusions and severe illumination conditions. Here, we propose a multi-scale approach based on extracting local patches from different scales and then merging the shared information using a weighted scheme based on Grassmann manifolds. This approach not only benefits from the discriminative information from global representation of data but also makes the clustering task more robust using the information from local representations. Numerical results show that the proposed approach significantly outperforms existing subspace clustering algorithms.
Missing data is an inevitable and ubiquitous problem in the data-driven Intelligent Transportation System (ITS), which seriously affects the accuracy of urban traffic planning and management. Most existing traffic dat...
详细信息
ISBN:
(纸本)9781728135557
Missing data is an inevitable and ubiquitous problem in the data-driven Intelligent Transportation System (ITS), which seriously affects the accuracy of urban traffic planning and management. Most existing traffic dataprocessing methods often only exploit the characteristics of single source data. In this paper, we present a novel coupled tensors model by using multi-source traffic data for missing data imputation, and propose a tensor completion algorithm based on a modified CMTF-WOPT(Coupled Matrix and Tensor Factorization-Weighted OP-Timization) algorithm to recover the missing traffic data. We also present extensive simulation results by using real world traffic datasets to evaluate the performance of the proposed algorithm. The simulation results show that the proposed coupled tensor completion algorithm makes a significant improvement on the recovery accuracy compared with existing tensor completion algorithms, especially under high missing rates.
In this paper, we present a method to handle data imbalance for classification with neural networks, and apply it to acoustic event detection (AED) problem. The common approach to tackle data imbalance is to use class...
详细信息
ISBN:
(纸本)9781479981311
In this paper, we present a method to handle data imbalance for classification with neural networks, and apply it to acoustic event detection (AED) problem. The common approach to tackle data imbalance is to use class-weights in the objective function while training. An existing more sophisticated approach is to map the input to clusters in an embedding space, so that learning is locally balanced by incorporating inter-cluster and inter-class margins. On these lines, we propose a method to learn the embedding using a novel objective function, called triple-header cross entropy. Our scheme integrates in a simple way with back-propagation based training, and is computationally more efficient than general hinge-loss based embedding learning schemes. The empirical evaluation results demonstrate the effectiveness of the proposed method for AED with imbalanced training data.
This paper presents a new course design to enhance students' interest in our graduate curriculum of morden digital signalprocessing (MDSP). The familiar radio frequency identification (RFID) technology is introdu...
详细信息
ISBN:
(纸本)9781479981311
This paper presents a new course design to enhance students' interest in our graduate curriculum of morden digital signalprocessing (MDSP). The familiar radio frequency identification (RFID) technology is introduced to reveal the fundamentals of the least-mean-square (LMS) algorithm. Students are encouraged to encode their customised data into their respective RFID tags, and to use the software-defined radio (SDR) platform to monitor the inventory process integrated within RFID communication systems. After acquiring the digital signal from the SDR hardware, they need to perform adaptive channel estimation and signal detection to identify the customised data. By applying appropriate mathematical tools in the Matlab programming environment to solve the signal detection problem in two different communication scenarios, students reported to have had a deeper understanding of the signalprocessing concepts of adaptive filtering techniques in a self-directed manner.
This paper describes the CMU Wilderness Multilingual Speech dataset. A dataset of over 700 different languages providing audio, aligned text and word pronunciations. On average each language provides around 20 hours o...
详细信息
ISBN:
(纸本)9781479981311
This paper describes the CMU Wilderness Multilingual Speech dataset. A dataset of over 700 different languages providing audio, aligned text and word pronunciations. On average each language provides around 20 hours of sentence-lengthed transcriptions. We describe our multi-pass alignment techniques and evaluate the results by building speech synthesizers on the aligned data. Most of the resulting synthesizers are good enough for deployment and use. The tools to do this work are released as open source, and instructions on how to apply such alignment for novel languages are given.
In this work, we investigate residual neural network representations for the identification and forecasting of dynamical systems. We propose a novel architecture that jointly learns the dynamical model and the associa...
详细信息
ISBN:
(纸本)9781479981311
In this work, we investigate residual neural network representations for the identification and forecasting of dynamical systems. We propose a novel architecture that jointly learns the dynamical model and the associated Runge-Kutta integration scheme. We demonstrate the relevance of the proposed architecture with respect to learning-based state-of-the-art approaches in the identification and forecasting of chaotic dynamics when provided with training data with low temporal sampling rates.
In order to analyze the mobile user capacity supportable by a cell in LTE-A systems, we propose a methodology of combining a mathematical model and field measurement data. One important phenomenon considered by the pr...
详细信息
ISBN:
(纸本)9781728135557
In order to analyze the mobile user capacity supportable by a cell in LTE-A systems, we propose a methodology of combining a mathematical model and field measurement data. One important phenomenon considered by the proposed model is that the informations of resource assignment for the both downlink and uplink are transmitted on the same Physical Downlink Control Channel (PDCCH), i.e., the both downlink and uplink controls share the same PDCCH resource which may induce congestions. We perform field measurement on LTE-A cells to collect measurement data to support the parameters utilized by the proposed model. Through comparing the numerical results and the data from field measurement, we confirm the accuracy of the proposed mathematical model.
We investigate supervised learning strategies that improve the training of neural network audio classifiers on small annotated collections. In particular, we study whether (i) a naive regularization of the solution sp...
详细信息
ISBN:
(纸本)9781479981311
We investigate supervised learning strategies that improve the training of neural network audio classifiers on small annotated collections. In particular, we study whether (i) a naive regularization of the solution space, (ii) prototypical networks, (iii) transfer learning, or (iv) their combination, can foster deep learning models to better leverage a small amount of training examples. To this end, we evaluate (i-iv) for the tasks of acoustic event recognition and acoustic scene classification, considering from 1 to 100 labeled examples per class. Results indicate that transfer learning is a powerful strategy in such scenarios, but prototypical networks show promising results when one does not count with external or validation data.
暂无评论