Eigenvector computation such as Singular Value Decomposition (SVD) is one of the most fundamental problems in machine learning, optimization and numerical linear algebra. In recent years, many stochastic variance redu...
详细信息
Eigenvector computation such as Singular Value Decomposition (SVD) is one of the most fundamental problems in machine learning, optimization and numerical linear algebra. In recent years, many stochastic variance reduction algorithms and randomized coordinate descent algorithms have been developed to efficiently solve the leading eigenvalue problem. By taking full advantage of both variance reduction and randomized coordinate descent techniques, this paper proposes a novel Semi-stochastic Block Coordinate Descent algorithm (SBCD-SVD), which is more suitable than existing algorithms for large-scale leading eigenvalue problems of SVD, and can obtain linear convergence. Unlike existing stochastic variance reduction and randomized coordinate descent methods, our algorithm inherits their advantages. Moreover, we propose a new Asynchronous parallel Semi-stochastic Block Coordinate Descent algorithm (ASBCD-SVD) and one new Asynchronous parallel Sparse approximated Variance Reduction algorithm (ASVR-SVD) for large-scale dense and sparse datasets, respectively. Finally, we prove that both dense and sparse asynchronous parallel variants can converge linearly. Extensive experimental results show that our algorithms attain high parallel speedup and achieve almost the same performance with significantly shorter time, and thus they can be widely used in various practice applications.
This paper introduces the d-distance matching problem, in which we are given a bipartite graph G = (S, T;E) with S = {s(1),..., s(n)}, a weight function on the edges and an integer d epsilon Z(+). The goal is to find ...
详细信息
This paper introduces the d-distance matching problem, in which we are given a bipartite graph G = (S, T;E) with S = {s(1),..., s(n)}, a weight function on the edges and an integer d epsilon Z(+). The goal is to find a maximum-weight subset M subset of E of the edges satisfying the following two conditions: (i) the degree of every node of S is at most one in M, (ii) if s(i) t, s (j) t. M, then vertical bar j - i vertical bar= d. This question arises naturally, for example, in various scheduling problems. We show that the problem is NP-complete in general and admits a simple 3-approximation. We give an FPT algorithm parameterized by d and also showthat the casewhen the size of T is constant can be solved in polynomial time. From an approximability point of view, we show that the integrality gap of the natural integer programming model is at most 2- 1/2d-1, and give an LP-based approximation algorithm for the weighted case with the same guarantee. A combinatorial (2- 1/d)-approximation algorithm is also presented. Several greedy approaches are considered, and a local search algorithm is described that achieves an approximation ratio of 3/2 + epsilon for any constant epsilon > 0 in the unweighted case. The novel approaches used in the analysis of the integrality gap and the approximation ratio of locally optimal solutions might be of independent combinatorial interest.
The TICER (TIme-Constant Equilibration Reduction) algorithm is a well-known resistor-capacitor (RC) network reduction algorithm. It finds wide applications in integrated circuit post-layout simulation tools. However, ...
详细信息
The TICER (TIme-Constant Equilibration Reduction) algorithm is a well-known resistor-capacitor (RC) network reduction algorithm. It finds wide applications in integrated circuit post-layout simulation tools. However, the original algorithm is one-dimensional in that each step eliminates one circuit node to obtain an approximately equivalent circuit by connecting additional elements to the neighboring nodes after each elimination. This work extends the TICER algorithm to its high-dimensional version in the sense that each step eliminates a subcircuit as a whole to obtain an approximately equivalent circuit, again by connecting extra elements to the neighboring nodes after each elimination. In practice the high-dimensional TICER (HD-TICER) algorithm finds many advantages over the classical one-dimensional TICER (1D-TICER) algorithm, which is a special case of the HD-TICER algorithm. An elegant mathematical derivation of the HD-TICER algorithm is provided by applying the notion of driving point impedance (DPI). The advantages of the HD-TICER algorithm are demonstrated by application to reductions of some purely resistive networks and RC networks. An approximate time constant estimation method is also provided for selection of a subblock circuit to eliminate.
Faber polynomial based approximations of time dependent exponential integrators for the numerical solution of Maxwell's equations based on explicit algorithms are extended for the inclusion of nonlinear media so t...
详细信息
Faber polynomial based approximations of time dependent exponential integrators for the numerical solution of Maxwell's equations based on explicit algorithms are extended for the inclusion of nonlinear media so that multiphysical problems can be considered. For this purpose, appropriate exponential integrators, such as Lawson type and Rosenbrock type integrators are reviewed and investigated. The formulation with regard to Rosenbrock type integrators is extended to develop exponential integrators allowing the straightforward implementation of the resulting explicit algorithm and the application of large time step sizes opening up the potential for a parallel implementation. The Faber polynomial based approach for the approximation of the exponential integrator is utilized and as in the case of linear material models has an order of magnitude higher efficiency than comparable conventional methods, especially, when a high accuracy is required.
The minimum dominating set is an important NP-hard problem in graph theory, widely used in various fields such as computer network layout and social networks. Although linear programming algorithms can obtain a global...
详细信息
The minimum dominating set is an important NP-hard problem in graph theory, widely used in various fields such as computer network layout and social networks. Although linear programming algorithms can obtain a global optimal solution on small graphs, they are not suitable for large graphs. To find a local optimal solution, many heuristic algorithms have been proposed. These algorithms require strong techniques and programming, and most of them still need improvement in performance on large graphs. This paper presents a cooperative- competitive model that transforms the discrete problem into a continuously differentiable problem. By using the gradient descent method, this approach can handle all vertices simultaneously, and its time complexity is linear. Compared to existing search algorithms, this method is based on a straightforward principle and is fast. For small graphs, it can achieve performance almost comparable to linear programming methods. For large graphs, this method can explore more diverse local optimal solutions in the same amount of time, making it easier to obtain better performance. In this paper, a comparison and analysis of this gradient-based algorithm and graph theory-based search algorithms are conducted on the minimum dominating set, allowing researchers to have a clearer understanding of the advantages and disadvantages of search algorithms and gradient methods in complex problems. The referenced search algorithms may potentially improve the gradient descent algorithm, leading to better solutions in other fields. At the same time, gradient-based methods can be used to solve many similar NP-hard problems in graph theory.
This article gives a short overview of my dissertation, where new algorithms are given for two fundamental graph problems. We develop novel ways of using linear programming formulations, even exponential-sized ones, t...
详细信息
This article gives a short overview of my dissertation, where new algorithms are given for two fundamental graph problems. We develop novel ways of using linear programming formulations, even exponential-sized ones, to extract structure from problem instances and to guide algorithms in making progress. The first part of the dissertation addresses a benchmark problem in combinatorial optimization: the asymmetric traveling salesman problem (ATSP). It consists in finding the shortest tour that visits all vertices of a given edge-weighted directed graph. A.-approximation algorithm for ATSP is one that runs in polynomial time and always produces a tour at most. times longer than the shortest tour. Finding such an algorithm with constant rho had been a long-standing open problem. Here we give such an algorithm. The second part of the dissertation addresses the perfect matching problem. We have known since the 1980s that it has efficient parallel algorithms if the use of randomness is allowed. However, we do not know if randomness is necessary - that is, whether the matching problem is in the class NC. We show that it is in the class quasi-NC. That is, we give a deterministic parallel algorithm that runs in poly-logarithmic time on quasi-polynomially many processors.
This article considers message and energy-efficient distributed algorithms for the SETCOVER Problem. Given a ground set U of n elements and a set S of m subsets of U, we aim to find the minimal number of these subsets...
详细信息
This article considers message and energy-efficient distributed algorithms for the SETCOVER Problem. Given a ground set U of n elements and a set S of m subsets of U, we aim to find the minimal number of these subsets that contain all elements. In the default distributed setup of this problem, each set has a bidirected communication link with each element it contains. This results in a communication graph with n + m nodes and degree A. The value A denotes the maximal degree of the communication graph, i.e., the maximum of all subsets' sizes and the maximum number of sets an element is contained *** present SETCOVER algorithm in the BEEPING model that only relies on carrier-sensing. In each synchronous time step, a node can either listen to the channel or beep. A listening node learns if one or more of its neighbors beeped or if none of its neighbors beeped. In particular, it neither learns which neighbors beeped nor how many neighbors beeped exactly. Given this model, we present an algorithm that runs in O(k3) time and has an expected approximation ratio of O (A3/k log2 A). The value k is an element of [3, log A] is a parameter that lets us trade runtime for approximation ratio similar to the celebrated algorithm by Kuhn and Wattenhofer [14]. Our next result is a O (k2) -time and O tilde (A12+1k (n +m)) -message algorithm (where O tilde (middot) hides polylogarithmic factors) with expected approximation ratio of O (A k log A) in the K T0-CONGEST model. In this variant of the well-known CONGEST model, 1 time proceeds in synchronous rounds, and each node can send a distinct message of size O (log (n + m)) to each of its neighbors. Further, each node has a unique identifier of size O (log (n + m)). However, the crucial aspect of K T0-CONGEST is that the nodes do not know their neighbors' identifiers. Our algorithm is almost optimal concerning time and message complexity as we can show that there are hard instances that require S2(A12 -Em) messages for a constant appr
We employ Difference of Log-Sum-Exp neural networks to generate a data-driven feedback controller based on Model Predictive Control (MPC) to track a given reference trajectory. By using this class of networks to appro...
详细信息
We employ Difference of Log-Sum-Exp neural networks to generate a data-driven feedback controller based on Model Predictive Control (MPC) to track a given reference trajectory. By using this class of networks to approximate the MPC-related cost function subject to the given system dynamics and input constraint, we avoid two of the main bottlenecks of classical MPC: the availability of an accurate model for the system being controlled, and the computational cost of solving the MPC-induced optimization problem. The former is tackled by exploiting the universal approximation capabilities of this class of networks. The latter is alleviated by making use of the difference-of-convex-functions structure of these networks. Furthermore, we show that the system driven by the MPC-neural structure is practically stable.
The degree of dissimilarity between genome sequences of homologous species is a measure of the evolutionary distance between them. It serves as a metric in the construction of phylogenetic trees, which depict the evol...
详细信息
The degree of dissimilarity between genome sequences of homologous species is a measure of the evolutionary distance between them. It serves as a metric in the construction of phylogenetic trees, which depict the evolutionary relationships and common ancestry among different species. Given two genome sequences, evolutionary distance is determined by estimating the number of global mutations that transform one sequence to the other. The computation of the evolutionary distance is done by modelling a genome with the corresponding permutation. Global rearrangement operations such as transposition that model a particular genomic mutation are studied by employing a combinatorial structure known as a cycle graph of the corresponding permutation. A cycle in a cycle graph that has odd length is called an odd cycle. In the context of the problem of sorting by transpositions (SBT), a valid 2-move is a transposition that increases the number of odd cycles in the cycle graph by two. A super oriented cycle (SOC) is an odd cycle C where C and one of the resultant cycles admit valid 2-moves. The minimum number of mutations required to transform a species S into a related species T is the distance from S to T under that mutation. Christie opined that characterizing SOCs will improve the lower bound of the transposition distance. We characterize super oriented cycles. Equivalent transformations on permutations like reduction and (g,b)-split preserve the transposition distance of a given permutation and map SBT to the corresponding SBT on a transformed simpler permutation. We introduce merge, a novel equivalent transformation. These results have applications in computing transposition and other distances between related species.
Expectation propagation (EP) attains near-optimal performance for massive multiple-input multiple-output (MIMO) detection. However, the inevitable matrix inversions and exponentiations at each EP iteration bring great...
详细信息
Expectation propagation (EP) attains near-optimal performance for massive multiple-input multiple-output (MIMO) detection. However, the inevitable matrix inversions and exponentiations at each EP iteration bring great challenges to realistic hardware implementation. To address these issues, a low-complexity EP with iterative Neumann-series approximation (EP-INSA) detector is proposed by employing INSA to estimate the inverse matrices. Further approximations are applied to avoid the exponentiations, which makes EP-INSA an efficient, feasible, and hardware-friendly detector for massive MIMO with various modulations. Simulation results show that EP-INSA attains similar performance as exact EP with only a few INSA terms, which assures enhanced performance and complexity trade-off. The associated hardware architectures of EP-INSA detector are also presented. The implementation results on 65 nm CMOS technology show that our design yields a throughput of 0.62 Gb/s with 2.63x area efficiency of existing EP detector.
暂无评论