High triangle density-the graph property stating that a constant fraction of two-hop paths belongs to a triangle-is a common signature of social networks. This paper studies triangle-dense graphs from a structural per...
详细信息
High triangle density-the graph property stating that a constant fraction of two-hop paths belongs to a triangle-is a common signature of social networks. This paper studies triangle-dense graphs from a structural perspective. We prove constructively that significant portions of a triangle-dense graph are contained in a disjoint union of dense, radius 2 subgraphs. This result quantifies the extent to which triangle-dense graphs resemble unions of cliques. We also show that our algorithm recovers planted clusterings in approximation-stable k-median instances.
The current landscape of balanced graph partitioning is divided into high-quality but expensive multilevel algorithms and cheaper approaches with linear running time, such as single-level algorithms and streaming algo...
详细信息
We present an improved algorithm for maintaining all-pairs (1 + epsilon) approximate shortest paths under deletions and weight-increases. The previous state of the art for this problem is total update time (O) over ti...
详细信息
We present an improved algorithm for maintaining all-pairs (1 + epsilon) approximate shortest paths under deletions and weight-increases. The previous state of the art for this problem is total update time (O) over tilde (n(2)root m/epsilon) over all updates for directed unweighted graphs [ S. Baswana, R. Hariharan, and S. Sen, J. algorithms, 62 (2007), pp. 74-92], and (O) over tilde (mn/epsilon) for undirected unweighted graphs [ L. Roditty and U. Zwick, in Proceedings of the 45th FOCS, Rome, Italy, 2004, pp. 499-508]. Both algorithms are randomized and have constant query time. Very recently, Henzinger, Krinninger, and Nanongkai presented a deterministic version of the latter algorithm [ M. Henzinger, S. Krinninger, and D. Nanongkai, in IEEE FOCS, 2013, pp. 538-547]. Note that (O) over tilde (mn) is a natural barrier because even with a (1 + epsilon) approximation, there is no o(mn) combinatorial algorithm for the static all-pairs shortest path problem. Our algorithm works on directed weighted graphs and has total (randomized) update time (O) over tilde (mn log R/epsilon) where R is the ratio of the largest edge weight to appear at any point in the update sequence to the smallest such weight. (As with previous algorithms, our query time is constant.) Technically, the running time is (O) over tilde (mn log R/epsilon) + O(Delta), where Delta is the total number of updates;the same O(Delta) term is also implicitly present in all other algorithms for the problem, since a constant time per update is clearly unavoidable. Note that log R = O(log(n)) as long as weights are polynomial in n;thus, we effectively expand the (O) over tilde (mn/epsilon) total update time bound from undirected unweighted graphs to directed graphs with polynomial weights. This is in fact the first nontrivial algorithm for decremental all-pairs shortest paths that works on weighted graphs (previous algorithms could only handle small integer weights). By a well-known reduction from decremental algo
algorithms in the data stream model use $O(polylog(N))$ space to compute someproperty of an input of size $N$, and many of these algorithms are implementedand used in practice. However, sketching algorithms in the gra...
详细信息
A graph G = (V, E) is said to be monopolar if its vertex set admits a partition V = (C (Equation presented) I) where G[C] is a cluster graph and I is an independent set in G. Monopolar graphs generalize both bipartite...
详细信息
A graph G = (V, E) is said to be monopolar if its vertex set admits a partition V = (C (Equation presented) I) where G[C] is a cluster graph and I is an independent set in G. Monopolar graphs generalize both bipartite graphs and split graphs, and they have been extensively studied from both graph-theoretic and algorithmic points of view. In this work we focus on the problem Monopolar Recognition of deciding whether a given graph is monopolar. Monopolar Recognition is known to be solvable in polynomial time in certain classes of graphs such as cographs and claw-free graphs, and to be NP-hard in various restricted classes such as subcubic planar graphs. We initiate the study of exact exponential-time algorithms for Monopolar Recognition and allied problems. We design an algorithm that solves Monopolar Recognition in O★(1.3734n) time on input graphs with n vertices. In fact we solve the more general problems Monopolar Extension and List Monopolar Partition, which were introduced in the literature as part of the study of graph monopolarity, in O★(1.3734n) time. We also design fast parameterized algorithms for Monopolar Recognition using two notions of distance from triviality as the parameters. Our FPT algorithms solve Monopolar Recognition in O★(3.076kv) and O★(2.253ke) time, where kv and ke are, respectively, the sizes of the smallest claw-free vertex and edge deletion sets of the input graph. These results are a significant addition to the small number of FPT algorithms currently known for Monopolar Recognition. Le and Nevries have shown that if a graph G is chair-free, then an instance (G, C′) of Monopolar Extension can be solved in polynomial time for any subset C′ of its vertices. We significantly generalize this result;we show that we can solve instances (G, C′) of Monopolar Extension in polynomial time for arbitrary graphs G and any chair-free vertex deletion set C′ of G. This result is the starting point of all our fast algorithms for Monopolar Extension, and w
Many distributed computing systems have been developed recently for implementing graph based algorithms such as PageRank over large-scale graph-structured datasets such as social networks. Performance of these systems...
详细信息
Many distributed computing systems have been developed recently for implementing graph based algorithms such as PageRank over large-scale graph-structured datasets such as social networks. Performance of these systems significantly suffers from communication bottleneck as a large number of messages are exchanged among servers at each step of the computation. Motivated by graph based MapReduce, we propose a coded computing framework that leverages computation redundancy to alleviate the communication bottleneck in distributed graph processing. As a key contribution of this work, we develop a novel coding scheme that systematically injects structured redundancy in the computation phase to enable coded multicasting opportunities during message exchange between servers, reducing the communication load substantially in large-scale graph processing. For theoretical analysis, we consider random graph models, and focus on schemes in which subgraph allocation and Reduce allocation are only dependent on vertex ID while the Shuffle design varies with graph connectivity. Specifically, we prove that our proposed scheme enables an (asymptotically) inverse-linear trade-off between computation load and average communication load for two popular random graph models - Erdos-Renyi model, and power law model. Particularly, for a given computation load r, (i.e. when each graph vertex is carefully stored at r servers), the proposed scheme slashes the average communication load by (nearly) a multiplicative factor of r. Furthermore, for the Erdos-Renyi model, we prove that our proposed scheme is optimal asymptotically as the graph size increases by providing an information-theoretic converse. To illustrate the benefits of our scheme in practice, we implement PageRank over Amazon EC2, using artificial as well as real-world datasets, demonstrating gains of up to 50.8% in comparison to the conventional PageRank implementation. Additionally, we specialize our coded scheme and extend our theore
We provide evidence that nonlinear dimensionality reduction, clustering, and data set parameterization can be solved within one and the same framework. The main idea is to define a system of coordinates with an explic...
详细信息
We provide evidence that nonlinear dimensionality reduction, clustering, and data set parameterization can be solved within one and the same framework. The main idea is to define a system of coordinates with an explicit metric that reflects the connectivity of a given data set and that is robust to noise. Our construction, which is based on a Markov random walk on the data, offers a general scheme of simultaneously reorganizing and subsampling graphs and arbitrarily shaped data sets in high dimensions using intrinsic geometry. We show that clustering in embedding spaces is equivalent to compressing operators. The objective of data partitioning and clustering is to coarse-grain the random walk on the data while at the same time preserving a diffusion operator for the intrinsic geometry or connectivity of the data set up to some accuracy. We show that the quantization distortion in diffusion space bounds the error of compression of the operator, thus giving a rigorous justification fork-means clustering in diffusion space and a precise measure of the performance of general clustering algorithms.
In this paper, we present an experimental comparison of various graph-based approximate nearest neighbor (ANN) search algorithms deployed on edge devices for real-time nearest neighbor search applications, such as sma...
详细信息
A potential maximal clique of a graph is a vertex set that induces a maximal clique in some minimal triangulation of that graph. It is known that if these objects can be listed in polynomial time for a class of graphs...
详细信息
A potential maximal clique of a graph is a vertex set that induces a maximal clique in some minimal triangulation of that graph. It is known that if these objects can be listed in polynomial time for a class of graphs, the treewidth and the minimum fill-in are polynomially tractable for these graphs. We show here that the potential maximal cliques of a graph can be generated in polynomial time in the number of minimal separators of the graph. Thus, the treewidth and the minimum fill-in are polynomially tractable for all classes of graphs with a polynomial number of minimal separators. (C) 2002 Elsevier Science B.V. All rights reserved.
We introduce a concept of similarity between vertices of directed graphs. Let CA and G(B) be two directed graphs with, respectively, n(A) and n(B) vertices. We define an n(B) x n(A) similarity matrix S whose real entr...
详细信息
We introduce a concept of similarity between vertices of directed graphs. Let CA and G(B) be two directed graphs with, respectively, n(A) and n(B) vertices. We define an n(B) x n(A) similarity matrix S whose real entry s(ij) expresses how similar vertex j (in G(A)) is to vertex i (in G(B)): we say that s(ij) is their similarity score. The similarity matrix can be obtained as the limit of the normalized even iterates of Sk+1 = BS(k)A(T) + B(T)S(k)A, where A and B are adjacency matrices of the graphs and So is a matrix whose entries are all equal to 1. In the special case where G(A) = G(B) = G, the matrix S is square and the score s(ij) is the similarity score between the vertices i and j of G. We point out that Klemberg's "hub and authority" method to identify web-pages relevant to a given query can be viewed as a special case of our definition in the case where one of the graphs has two vertices and a unique directed edge between them. In analogy to Kleinberg, we show that our similarity scores are given by the components of a dominant eigenvector of a nonnegative matrix. Potential applications of our similarity concept are numerous. We illustrate an application for the automatic extraction of synonyms in a monolingual dictionary.
暂无评论