In this paper, we propose a novel approach for spectral unmixing by unfolding the iterative shrinkage-thresholding algorithm (ISTA) into a deep neural network architecture. Spectral unmixing aims at identifying the en...
详细信息
ISBN:
(纸本)9781538691540
In this paper, we propose a novel approach for spectral unmixing by unfolding the iterative shrinkage-thresholding algorithm (ISTA) into a deep neural network architecture. Spectral unmixing aims at identifying the endmembers and their fractional abundances in the mixed pixels. Once the endmembers are obtained as a dictionary, abundance estimation can be defined as a sparse coding problem with nonnegativity constraint. There are a number of iterative optimization algorithms for solving this problem, including ISTA, however, they always require hundreds and even thousands iterations, which is too slow for time-sensitive applications. In contrast, deep neural networks can approximate a finite closed-form expression to direct estimate abundances by learning from training samples, but they are closer to black-box mechanism rather than problem-level formulations. Deep unfolding constructs a deep neural network architecture inspired by the problem model and its corresponding optimization algorithm, which incorporates the prior knowledge of physical model and algorithm into network architecture. In this paper, the deep unfolded ISTA model is adopted for abundance estimation. It uses only a small training set to learn the model parameters, and then the abundance estimation become to be a feed-forward process in this model, which is very fast since no iteration is required.
Computer animation researchers have been extensively investigating 3D facial-expression synthesis for decades. However, flexible, robust production of realistic 3D facial expressions is still technically challenging. ...
详细信息
Computer animation researchers have been extensively investigating 3D facial-expression synthesis for decades. However, flexible, robust production of realistic 3D facial expressions is still technically challenging. A proposed modeling framework applies sparse coding to synthesize 3D expressive faces, using specified coefficients or expression examples. It also robustly recovers facial expressions from noisy and incomplete data. This approach can synthesize higher-quality expressions in less time than the state-of-the-art techniques.
This paper focuses mainly on adaptive dictionary updating and abnormality detection via weighted space coding in video surveillance. Generally, abnormality analysis conducted on a large amount of video data is very co...
详细信息
ISBN:
(纸本)9781479923427
This paper focuses mainly on adaptive dictionary updating and abnormality detection via weighted space coding in video surveillance. Generally, abnormality analysis conducted on a large amount of video data is very complicated, time-consuming and time-variant. However, our dictionary is very efficient at following up on shifted contents in video and abandoning old inactive information in time. The adaptability characteristic also helps reduce the dictionary's size to a small scale, since it only needs to keep recent or active information. We also introduce a simple, but effective, judgement criterion for abnormal detection based on sparse coding over weighted bases. Because of the condensed dictionary and the simplified judgment criterion, our algorithm performs online learning and online detection with a high speed and a high accuracy in various scenes.
This work proposes a time warp invariant sparse coding and dictionary learning framework for time series clustering, where both input samples and atoms define time series of different lengths that involve variable del...
详细信息
ISBN:
(纸本)9783030109257;9783030109240
This work proposes a time warp invariant sparse coding and dictionary learning framework for time series clustering, where both input samples and atoms define time series of different lengths that involve variable delays. For that, first an l(0) sparse coding problem is formalised and a time warp invariant orthogonal matching pursuit based on a new cosine maximisation time warp operator is proposed. A dictionary learning under time warp is then formalised and a gradient descent solution is developed. Lastly, a time series clustering based on the time warp sparse coding and dictionary learning is presented. The proposed approach is evaluated and compared to major alternative methods on several public datasets, with an application to deezer music data stream clustering. Data related to this paper are available at: The link to the data and the evaluating algorithms are provided in the paper. Code related to this paper is available at: The link will be provided at the first author personal website (http://***/similar to varasteh/).
sparse representation is a signal model to represent signals with a linear combination of a small number of prototype signals called atoms, and a set of atoms is called a dictionary. The design of the dictionary is a ...
详细信息
ISBN:
(纸本)9783030375997;9783030375980
sparse representation is a signal model to represent signals with a linear combination of a small number of prototype signals called atoms, and a set of atoms is called a dictionary. The design of the dictionary is a fundamental problem for sparse representation. However, when there are scaled or translated features in the signals, unstructured dictionary models cannot extract such features. In this paper, we propose a structured dictionary model which is scale and shift-invariant to extract features which commonly appear in several scales and locations. To achieve both scale and shift invariance, we assume that atoms of a dictionary are generated from vectors called ancestral atoms by scaling and shift operations, and an algorithm to learn these ancestral atoms is proposed.
Calcium imaging has become a fundamental neural imaging technique, aiming to recover the individual activity of hundreds of neurons in a cortical region. Current methods ( mostly matrix factorization) are aimed at det...
详细信息
ISBN:
(纸本)9781479981311
Calcium imaging has become a fundamental neural imaging technique, aiming to recover the individual activity of hundreds of neurons in a cortical region. Current methods ( mostly matrix factorization) are aimed at detecting neurons in the field-of-view and then inferring the corresponding time-traces. In this paper, we reverse the modeling and instead aim to minimize the spatial inference, while focusing on finding the set of temporal traces present in the data. We reframe the problem in a dictionary learning setting, where the dictionary contains the time-traces and the sparse coefficient are spatial maps. We adapt dictionary learning to calcium imaging by introducing constraints on the norms and correlations of the time-traces, and incorporating a hierarchical spatial filtering model that correlates the time-trace usage over the field-of-view. We demonstrate on synthetic and real data that our solution has advantages regarding initialization, implicitly inferring number of neurons and simultaneously detecting different neuronal types.
This paper presents a method for reducing the Diffusion Weighted Magnetic Resonance Imaging (DW-MRI) examination time based on the mathematical framework of sparse representations. The aim is to undersample the b-valu...
详细信息
ISBN:
(纸本)9781728146171
This paper presents a method for reducing the Diffusion Weighted Magnetic Resonance Imaging (DW-MRI) examination time based on the mathematical framework of sparse representations. The aim is to undersample the b-values used for DW-MRI image acquisition which reflect the strength and timing of the gradients used to generate the DW-MRI images since their number defines the examination time. To test our method we investigate whether the undersampled DW-MRI data preserve the same accuracy in terms of extracted imaging biomarkers. The main procedure is based on the use of the k-Singular Value Decomposition (k-SVD) and the Orthogonal Matching Pursuit (OMP) algorithms, which are appropriate for the sparse representations computation. The presented results confirm the hypothesis of our study as the imaging biomarkers extracted from the sparsely reconstructed data have statistically close values to those extracted from the original data. Moreover, our method achieves a low reconstruction error and an image quality close to the original.
In this paper, we propose a method to estimate secure sparse representations in L0 norm minimization and its application to Encryption-then-Compression (EtC) systems. The proposed scheme provides a practical Orthogona...
详细信息
ISBN:
(纸本)9781728121949
In this paper, we propose a method to estimate secure sparse representations in L0 norm minimization and its application to Encryption-then-Compression (EtC) systems. The proposed scheme provides a practical Orthogonal Matching Pursuit (OMP) algorithm that allows computation in the encrypted domain. We prove, theoretically, that the proposal has exactly the same estimation performance as the unencrypted variant of the OMP algorithm. We demonstrate the security strength of the proposed secure sparse representations. Even if the dictionary information is leaked, the proposed scheme protects the privacy information of the observed signals.
Similar to the deep architectures, a novel multi-layer architecture is used to extend the linear blind source separation (BSS) method to the nonlinear case in this paper. The approach approximates the nonlinearities b...
详细信息
ISBN:
(纸本)9781479981311
Similar to the deep architectures, a novel multi-layer architecture is used to extend the linear blind source separation (BSS) method to the nonlinear case in this paper. The approach approximates the nonlinearities based on a polynomial network, where the layer of our network begins with the polynomial of degree 1, up to build an output layer that can represent data with a small bias by a good approximate basis. Relying on several transformations of the input data, with higher-level representation from lower-level ones, the networks are to fulfill a mapping implicitly to the high-dimensional space. Once the polynomial networks are built, the coefficient matrix can be estimated by solving an l(1)-regularization on the coding coefficient vector. The experiment shows that the proposed approach exhibits a higher separation accuracy than the comparison algorithms.
A new multi-stage approach based on component extraction is proposed to more efficiently address the sparse representation problem. In each stage a pre-set number of coefficients are chosen for reconstructing each sig...
详细信息
ISBN:
(纸本)9781479903573
A new multi-stage approach based on component extraction is proposed to more efficiently address the sparse representation problem. In each stage a pre-set number of coefficients are chosen for reconstructing each signal component. A global search is performed to extract a lower dimensional sub-dictionary consisting of a sorted set of candidate atoms to represent the signal component, corresponding to the stage. The best representing atoms are then selected from the sub dictionary using the Matching Pursuit (MP) method. Afterwards, the sparse coefficients are updated in the same manner in which the Orthogonal Matching Pursuit (OMP) operates. The proposed method is more efficient that the conventional OMP methods. To evaluate the performance of the proposed method, it is compared to OMP and Stagewise OMP (StOMP), which are conceptually the most similar to the proposed approach. The results illustrate the proposed method is more time efficient than OMP and more robust and sparser than the StOMP.
暂无评论