Deep Neural Networks (DNNs) are increasingly being used in a variety of applications. However, DNNs have huge computational and memory requirements. One way to reduce these requirements is to sparsify DNNs by using sm...
详细信息
Deep Neural Networks (DNNs) are increasingly being used in a variety of applications. However, DNNs have huge computational and memory requirements. One way to reduce these requirements is to sparsify DNNs by using smoothed LASSO (Least Absolute Shrinkage and Selection Operator) functions. In this paper, we show that irrespective of error profile, the sparsity values obtained using various smoothed LASSO functions are similar, provided the maximum error of these functions with respect to the LASSO function is the same. We also propose a layer-wise DNN pruning algorithm, where the layers are pruned based on their individual allocated accuracy loss budget, determined by estimates of the reduction in number of multiply-accumulate operations (in convolutional layers) and weights (in fully connected layers). Further, the structured LASSO variants in both convolutional and fully connected layers are explored within the smoothed LASSO framework and the tradeoffs involved are discussed. The efficacy of proposed algorithm in enhancing the sparsity within the allowed degradation in DNN accuracy and results obtained on structured LASSO variants are shown on MNIST, SVHN, CIFAR-10, and Imagenette datasets and on larger networks such as ResNet-50 and Mobilenet.
This paper studies the Diminishing Return Submodular Cover (DRSC) problem as follows: given a diminishing return submodular function f :Z(+)(E)-> R+ over the ground set E of size n, a positive integer B as the maxi...
详细信息
This paper studies the Diminishing Return Submodular Cover (DRSC) problem as follows: given a diminishing return submodular function f :Z(+)(E)-> R+ over the ground set E of size n, a positive integer B as the maximum value of a coordinate of a vector in Z(+)(E) and a threshold alpha > 0, the problem asks to find the vector with the smallest size x so that f(x) >= alpha. This problem has been attracted by much research recently due to its importance in machine learning and combinatorial optimization. However, because of high query and adaptive complexities, the existing algorithms might still not be efficient enough when working with large input data. This paper proposes a randomized and bicriteria approximation algorithm that has O (log n) adaptive complexity and O ((n + log n log B) log(nB)) query complexity. Our algorithm significantly reduces both adaptive and query complexities compared to some state-of-the-art algorithms by factors of Omega(log(m) log(2)(mn log B)(1 + log(log B)/ log(n))), Omega(min{n/ log(n),log(B)}) respectively, where m = f (B center dot 1). (c) 2023 Elsevier B.V. All rights reserved.
Obtaining strong linear relaxations for capacitated covering problems constitutes a significant technical challenge. For one of the most basic cases, the relaxation based on knapsack-cover inequalities has an integral...
详细信息
The purpose of the research under consideration is to develop a mathematical model to calculate the trajectories of the ferromagnetic operating elements (millstones) of an electromagnetic mill, moving in a rotating ma...
详细信息
Now-a-days vehicles have been used at a large scale in the modern society. But in many countries the current traffic-safety statistics are very terrifying. This article focuses on the communication security issues in ...
详细信息
We present a framework for generalizing the primal-dual gradient method, also known as the gradient descent ascent method, for solving convex-concave minimax problems. The framework is based on the observation that th...
详细信息
We present a framework for generalizing the primal-dual gradient method, also known as the gradient descent ascent method, for solving convex-concave minimax problems. The framework is based on the observation that the primal-dual gradient method can be viewed as an inexact gradient method applied to the primal problem. Unlike the setting of traditional inexact gradient methods, the inexact gradient is computed by a dynamic inexact oracle, which is a discrete-time dynamical system whose output asymptotically approaches the exact gradient. For minimax problems, dynamic inexact oracles are capable of modeling a range of first-order methods for computing the gradient of the primal objective, which relies on solving the inner maximization problem. We provide a unified convergence analysis of gradient methods with dynamic inexact oracles and demonstrate its use in creating new accelerated primal-dual algorithms.
In this paper, we propose a parallel and scalable approach for geodesic distance computation on triangle meshes. Our key observation is that the recovery of geodesic distance with the heat method [1] can be reformulat...
详细信息
In this paper, we propose a parallel and scalable approach for geodesic distance computation on triangle meshes. Our key observation is that the recovery of geodesic distance with the heat method [1] can be reformulated as optimization of its gradients subject to integrability, which can be solved using an efficient first-order method that requires no linear system solving and converges quickly. Afterward, the geodesic distance is efficiently recovered by parallel integration of the optimized gradients in breadth-first order. Moreover, we employ a similar breadth-first strategy to derive a parallel Gauss-Seidel solver for the diffusion step in the heat method. To further lower the memory consumption from gradient optimization on faces, we also propose a formulation that optimizes the projected gradients on edges, which reduces the memory footprint by about 50 percent. Our approach is trivially parallelizable, with a low memory footprint that grows linearly with respect to the model size. This makes it particularly suitable for handling large models. Experimental results show that it can efficiently compute geodesic distance on meshes with more than 200 million vertices on a desktop PC with 128 GB RAM, outperforming the original heat method and other state-of-the-art geodesic distance solvers.
Given a set of n points in the plane, and a parameter k, we consider the problem of computing the minimum (perimeter or area) axis-aligned rectangle enclosing k points. We present the first near quadratic time algorit...
详细信息
Given a set of n points in the plane, and a parameter k, we consider the problem of computing the minimum (perimeter or area) axis-aligned rectangle enclosing k points. We present the first near quadratic time algorithm for this problem, improving over the previous near- O(n(5/2))-time algorithm by Kaplan et al. (25th European Symposium on algorithms. Leibniz Int Proc Inform, vol. 87, # 52. Leibniz-Zent Inform, Wadern, 2017). We provide an almost matching conditional lower bound, under the assumption that (min,+)-convolution cannot be solved in truly subquadratic time. Furthermore, we present a new reduction (for both perimeter and area) that can make the time bound sensitive to k, giving near O(nk) time. We also present a near linear time (1 + epsilon)-approximation algorithm to the minimum area of the optimal rectangle containing k points. In addition, we study related problems including the 3-sided, arbitrarily oriented, weighted, and subset sum versions of the problem.
Two-dimensional phase unwrapping (PU) is an essential step in interferometric synthetic aperture radar (InSAR) analysis. Although the L-0-norm PU method is desired, it is nondeterministic polynomial (NP)-hard. Thus, m...
详细信息
Two-dimensional phase unwrapping (PU) is an essential step in interferometric synthetic aperture radar (InSAR) analysis. Although the L-0-norm PU method is desired, it is nondeterministic polynomial (NP)-hard. Thus, many PU meth-ods have been proposed to find near-optimal solutions of the L-0-norm. As PU is an ill-posed problem, it is difficult to choose the method with the most accurate solution unless reference data are available. In this letter, we prove that the NP-hardness of the L-0-norm is weak, suggesting that the alpha-approximation algorithm of the L-0-norm can be devised, that is, the obtained near-optimal solutions can be within a factor of alpha of the L-0-norm optimal value. The primary contribution of the proof is that the alpha value of each obtained PU solution can be considered as a new index to assess the PU performance without using reference data. The validity and effectiveness of the alpha-based index have been verified using simulated and acquired interferometric datasets and three PU methods.
The noise distribution in the mechanical system is tightly connected to the structure design and particular operating conditions. The noise source localization can effectively assist with the operating condition monit...
详细信息
The noise distribution in the mechanical system is tightly connected to the structure design and particular operating conditions. The noise source localization can effectively assist with the operating condition monitoring and noise reduction design of the mechanical device. The performance limitation of the array aperture for lower frequency acoustic localization is broken up by the non-synchronous measurement at coprime positions (CP-NSM), while this method has high uncertainty and requires an efficient adaptive regularization method to solve its corresponding acoustic inverse problem. The algorithms under the Bayesian framework with Student-t priors (variational Bayesian approximation and subspace variational Bayesian) are derived and deployed to solve the acoustic inverse problem in the CP-NSM method, and the results are compared with those obtained by the interior-point method and the alternating direction method of the multipliers algorithm. The proposed Bayesian methods have the advantages of adaptive regularization parameter estimation, which can reduce the influence of various interferences in CP-NSM. At the same time, in addition to using simulations and experiments in the anechoic chambers, the proposed Bayesian algorithms are validated further in real industrial applications. The proposed method is applied to improve the noise reduction design of the centrifugal fan, a mechanical device containing more information from noise sources.
暂无评论