Online averaged stochastic gradient algorithms are more and more studied since (i) they can deal quickly with large sample taking values in high-dimensional spaces, (ii) they enable to treat data sequentially, (iii) t...
详细信息
Online averaged stochastic gradient algorithms are more and more studied since (i) they can deal quickly with large sample taking values in high-dimensional spaces, (ii) they enable to treat data sequentially, (iii) they are known to be asymptotically efficient. In this paper, we focus on giving explicit bounds of the quadratic mean error of the estimates, and this, without supposing that the function we would like to minimize is strongly convex or admits a bounded gradient.
Traditional filtering theory is always based on optimization of the expected value of a suitably chosen function of error, Such as the minimum mean-square error (MMSE) criterion, the minimum error entropy (MEE) criter...
详细信息
Traditional filtering theory is always based on optimization of the expected value of a suitably chosen function of error, Such as the minimum mean-square error (MMSE) criterion, the minimum error entropy (MEE) criterion, and so oil. None of those criteria could capture all the probabilistic information about the error distribution. In this work, we propose a novel approach to shape the probability density function (PDF) of the errors in adaptive filtering. As the PDF contains all the probabilistic information, the proposed approach can be used to obtain the desired variance or entropy, and is expected to be useful in the complex signal processing and learning systems. In our method, the information divergence between the actual errors and the desired errors is chosen as the cost function, which is estimated by kernel approach. Some important properties of the estimated divergence are presented. Also, for the finite impulse response (FIR) Filter, a stochastic gradient algorithm is derived. Finally, simulation examples illustrate the effectiveness of this algorithm in adaptive system training.
In this paper, by introducing the statistics of training data into support vector regression (SVR), we propose a minimum deviation distribution regression (MDR). Rather than just minimizing the structural risk, MDR al...
详细信息
In this paper, by introducing the statistics of training data into support vector regression (SVR), we propose a minimum deviation distribution regression (MDR). Rather than just minimizing the structural risk, MDR also minimizes both the regression deviation mean and the regression deviation variance, which is able to deal with the different distribution of boundary data and noises. The formulation of minimizing the first and second order statistics in MDR leads to a strongly convex quadratic programming problem (QPP). An efficient dual coordinate descend algorithm is adopted for small sample problem, and an average stochastic gradient algorithm for large scale one. Both theoretical analysis and experimental results illustrate the efficiency and effectiveness of the proposed method. (C) 2018 Elsevier B.V. All rights reserved.
An adaptive error-constrained least mean square (AECLMS) algorithm is derived and proposed using adaptive error-constrained optimization techniques. This is accomplished by modifying the cost function of the LMS algor...
详细信息
An adaptive error-constrained least mean square (AECLMS) algorithm is derived and proposed using adaptive error-constrained optimization techniques. This is accomplished by modifying the cost function of the LMS algorithm using augmented Lagrangian multipliers. Theoretical analyses of the proposed method are presented in detail. The method shows improved performance in terms of convergence speed and misadjustment. This proposed adaptive errorconstrained method can easily be applied to and combined with other LMS-type stochasticalgorithms. Therefore, we also apply the method to constant modulus criterion for blind method and backpropagation algorithm for multilayer perceptrons. Simulation results show that the proposed method can accelerate the convergence speed by 2 to 20 times depending on the complexity of the problem. (c) 2005 Elsevier B.V. All rights reserved.
By combining the coupling identification concept with the gradient search, this study develops a partially coupled generalised extended projection algorithm and a partially coupled generalised extended stochastic grad...
详细信息
By combining the coupling identification concept with the gradient search, this study develops a partially coupled generalised extended projection algorithm and a partially coupled generalised extended stochastic gradient algorithm to estimate the parameters of a multivariable output-error-like system with autoregressive moving average noise from input-output data. The key is to divide the identification model into several submodels based on the hierarchical identification principle and to establish the parameter estimation algorithm by using the coupled relationship between these submodels. The simulation test results indicate that the proposed algorithms are effective.
This paper proposes a novel solution to the problem of beamforming and power control in the downlink of a multiple-input multiple-output ( MIMO) orthogonal frequency-division multiplexing (OFDM) system. This solution ...
详细信息
This paper proposes a novel solution to the problem of beamforming and power control in the downlink of a multiple-input multiple-output ( MIMO) orthogonal frequency-division multiplexing (OFDM) system. This solution is developed in two steps. First, we describe an adaptive beamforming technique that, using a stochasticgradient method, maximizes the power delivered to a mobile terminal. In the proposed solution, perturbed precoding matrices are time multiplexed in the information signal transmitted to a mobile terminal;then, the mobile terminal informs the transmitter, via a single feedback bit, about the perturbation delivering the larger power. This approach does not need pilot symbols and uses quasi-Monte Carlo methods to generate the required perturbations with the relevant advantages of improving the downlink spectral efficiency and reducing the system complexity with respect to other competing solutions. Then, we propose a novel power-control algorithm that, selecting a proper transmission energy level from a set of possible values, aims to minimize the average bit error rate. This set of levels is generated on the basis of the channel statistics and a long-term constraint on the average transmission power. Numerical results evidence the robustness of the proposed algorithms in a dynamic fading environment.
We consider an inverse reinforcement learning problem involving "us" versus an "enemy" radar equipped with a Bayesian tracker. By observing the emissions of the enemy radar, how can we identify if ...
详细信息
We consider an inverse reinforcement learning problem involving "us" versus an "enemy" radar equipped with a Bayesian tracker. By observing the emissions of the enemy radar, how can we identify if the radar is cognitive (constrained utility maximizer)? Given the observed sequence of actions taken by the enemy's radar, we consider three problems: (i) Are the enemy radar's actions (waveform choice, beam scheduling) consistent with constrained utility maximization? If so how can we estimate the cognitive radar's utility function that is consistent with its actions. We formulate, and solve the problem in terms of the spectra (eigenvalues) of the state, and observation noise covariance matrices, and the algebraic Riccati equation. (ii) How to construct a statistical test for detecting a cognitive radar (constrained utility maximization) when we observe the radar's actions in noise or the radar observes our probe signal in noise? We propose a statistical detector with a tight Type-II error bound. (iii) How can we optimally probe (interrogate) the enemy's radar by choosing our state to minimize the Type-II error of detecting if the radar is deploying an economic rational strategy, subject to a constraint on the Type-I detection error? We present a stochastic optimization algorithm to optimize our probe signal. The main analysis framework used in this paper is that of revealed preferences from microeconomics.
An alternative blind adaptive multiuser detection is investigated based on modified constrained constant modulus (CM) criterion. It has been shown that the performance of a CM-based receiver is limited by the received...
详细信息
An alternative blind adaptive multiuser detection is investigated based on modified constrained constant modulus (CM) criterion. It has been shown that the performance of a CM-based receiver is limited by the received power of the desired user. In this paper, we show that the limitation can be avoided using noncanonical constraint CIM criterion and that in the presence of channel noise the modified CM criterion function is strictly convex by properly selecting some constant. With analyzing the extrema of the cost function, we point out how to select the constant. Moreover, a simple stochastic gradient algorithm for implementing our scheme is presented, and the convergence properties of the algorithm are analyzed. Simulation examples are given to demonstrate the performance of the proposed scheme.
Convergence of a projected stochastic gradient algorithm is demonstrated for convex objective functionals with convex constraint sets in Hilbert spaces. In the convex case, the sequence of iterates u(n) converges weak...
详细信息
Convergence of a projected stochastic gradient algorithm is demonstrated for convex objective functionals with convex constraint sets in Hilbert spaces. In the convex case, the sequence of iterates u(n) converges weakly to a point in the set of minimizers with probability one. In the strongly convex case, the sequence converges strongly to the unique optimum with probability one. An application to a class of PDE constrained problems with a convex objective, convex constraint, and random elliptic PDE constraints is shown. Theoretical results are demonstrated numerically.
The stochasticgradient descent algorithm (SGD) is the main optimization solution in deep learning. The performance of SGD depends critically on how learning rates are tuned over time. In this paper, we propose a nove...
详细信息
The stochasticgradient descent algorithm (SGD) is the main optimization solution in deep learning. The performance of SGD depends critically on how learning rates are tuned over time. In this paper, we propose a novel energy index based optimization method (EIOM) to automatically adjust the learning rate in the backpropagation. Since a frequently occurring feature is more important than a rarely occurring feature, we update the features to different extents according to their frequencies. We first define an energy neuron model and then design an energy index to describe the frequency of a feature. The learning rate is taken as a hyperparameter function according to the energy index. To empirically evaluate the EIOM, we investigate different optimizers with three popular machine learning models: logistic regression, multilayer perceptron, and convolutional neural network. The experiments demonstrate the promising performance of the proposed EIOM compared with that of other optimization algorithms. (C) 2018 Elsevier Ltd. All rights reserved.
暂无评论