Many efficient string matching algorithms make use of q-grams and process the text in windows which are read backward. In this paper we provide a framework for analyzing the average case complexity of these algorithms...
详细信息
Many efficient string matching algorithms make use of q-grams and process the text in windows which are read backward. In this paper we provide a framework for analyzing the average case complexity of these algorithms taking into account the statistical dependencies between overlapping q-grams. We apply this to the q-gram Boyer-Moore-Horspool algorithm adapted to various string matching problems and show that the algorithm is optimal on average. (C) 2012 Elsevier B.V. All rights reserved.
It is well known that the averagecase deterministic communication complexity is bounded below by an entropic quantity, which one would now call deterministic information complexity. In this paper we show a correspond...
详细信息
It is well known that the averagecase deterministic communication complexity is bounded below by an entropic quantity, which one would now call deterministic information complexity. In this paper we show a corresponding upper bound. We also improve known lower bounds for the public coin Las Vegas communication complexity by a constant factor. (c) 2006 Elsevier B.V. All rights reserved.
The theory of average case complexity studies the expected complexity of computational tasks under various specific distributions on the instances, rather than their worst casecomplexity. Thus, this theory deals with...
详细信息
The theory of average case complexity studies the expected complexity of computational tasks under various specific distributions on the instances, rather than their worst casecomplexity. Thus, this theory deals with distributional problems, defined as pairs each consisting of a decision problem and a probability distribution over the instances. While for applications utilizing hardness, such as cryptography, one seeks an efficient algorithm that outputs random instances of some problem that are hard for any algorithm with high probability, the resulting hard distributions in these cases are typically highly artificial, and do not establish the hardness of the problem under "interesting" or "natural" distributions. This paper studies the possibility of proving generic hardness results (i.e., for a wide class of NP-complete problems), under "natural" distributions. Since it is not clear how to define a class of "natural" distributions for general NP-complete problems, one possibility is to impose some strong computational constraint on the distributions, with the intention of this constraint being to force the distributions to "look natural". Levin, in his seminal paper on average case complexity from 1984, defined such a class of distributions, which he called P-computable distributions. He then showed that the NP-complete Tiling problem, under some P-computable distribution, is hard for the complexity class of distributional NP problems (i.e. NP with P-computable distributions). However, since then very few NP-complete problems (coupled with P-computable distributions), and in particular " natural" problems, were shown to be hard in this sense. In this paper we show that all natural NP-complete problems can be coupled with P-computable distributions such that the resulting distributional problem is hard for distributional NP.
Alongside the effort underway to build quantum computers, it is important to better understand which classes of problems they will find easy and which others even they will find intractable. We study random ensembles ...
详细信息
Alongside the effort underway to build quantum computers, it is important to better understand which classes of problems they will find easy and which others even they will find intractable. We study random ensembles of the QMA(1)-complete quantum satisfiability (QSAT) problem introduced by Bravyi [1]. QSAT appropriately generalizes the NP-complete classical satisfiability (SAT) problem. We show that, as the density of clauses/projectors is varied, the ensembles exhibit quantum phase transitions between phases that are satisfiable and unsatisfiable. Remarkably, almost all instances of QSAT for any hypergraph exhibit the same dimension of the satisfying manifold. This establishes the QSAT decision problem as equivalent to a, potentially new, graph theoretic problem and that the hardest typical instances are likely to be localized in a bounded range of clause density.
This is an errata for our STOC'06 paper, "On Basing One-Way Functions on NP-Hardness".There is a gap in the proof of our results regarding adaptive reductions, and we currently do not know whether Theore...
详细信息
ISBN:
(纸本)9781450300506
This is an errata for our STOC'06 paper, "On Basing One-Way Functions on NP-Hardness".There is a gap in the proof of our results regarding adaptive reductions, and we currently do not know whether Theorem 3 (as stated in Section 2) holds.
In the present paper, the authors reject one of Knuth's well known results on average case complexity in replacement (i.e. selection) sort and hence challenge the robustness of averagecomplexity measures where th...
详细信息
In the present paper, the authors reject one of Knuth's well known results on average case complexity in replacement (i.e. selection) sort and hence challenge the robustness of averagecomplexity measures where the response variable is sensitive to ties. (c) 2007 Elsevier Inc. All rights reserved.
The present paper argues that it suffices for an algorithmic time complexity measure to be system invariant rather than system independent (which means predicting from the desk). (C) 2007 Elsevier Inc. All rights rese...
详细信息
The present paper argues that it suffices for an algorithmic time complexity measure to be system invariant rather than system independent (which means predicting from the desk). (C) 2007 Elsevier Inc. All rights reserved.
We investigate the average-case state and transition complexity of deterministic and nondeterministic finite automata, when choosing a finite language of a certain "size" n uniformly at random from all finit...
详细信息
We investigate the average-case state and transition complexity of deterministic and nondeterministic finite automata, when choosing a finite language of a certain "size" n uniformly at random from all finite languages of that particular size. Here size means that all words of the language are either of length n, or of length at most n. It is shown that almost all deterministic finite automata accepting finite languages over a binary input alphabet have state complexity theta(2(n)/n) while nondeterministic finite automata are shown to perform better, namely the nondeterministic state complexity is in theta(root 2(n)). Interestingly, in both cases the aforementioned bounds are asymptotically like in the worst case. However, the nondeterministic transition complexity is shown to be again theta(2(n)/n). The case of unary finite languages is also considered. Moreover, we develop a framework that allows us to investigate the average-casecomplexity of operations like, e.g., union, intersection, complementation, and reversal, on finite languages in this setup. (C) 2007 Published by Elsevier B.V.
mRNA molecules are folded in the cells and therefore many of their substrings may actually be inaccessible to protein and microRNA binding. The need to apply an accessibility criterion to the task of genome-wide mRNA ...
详细信息
ISBN:
(纸本)3540332952
mRNA molecules are folded in the cells and therefore many of their substrings may actually be inaccessible to protein and microRNA binding. The need to apply an accessibility criterion to the task of genome-wide mRNA motif discovery raises the challenge of overcoming the core O(n(3)) factor imposed by the time complexity of the currently best known algorithms for RNA secondary structure prediction. We speed up the dynamic programming algorithms that are standard for RNA folding prediction. Our new approach significantly reduces the computations without sacrificing the optimality of the results, yielding an expected time complexity of O(n(2)psi(n)), where psi(n) is shown to be constant on average under standard polymer folding models. A benchmark analysis confirms that in practice the runtime ratio between the previous approach and the new algorithm indeed grows linearly with increasing sequence size. The fast new RNA folding algorithm is utilized for genome-wide discovery of accessible cis-regulatory motifs in data sets of ribosomal densities and decay rates of S. cerevisiae genes and to the mining of exposed binding sites of tissue-specific microRNAs in A. thaliana.
Using the cavity equations of Mezard, Parisi, and Zecchina [Science 297 (2002), 812;Mezard and Zecchina, Phys Rev E 66 (2002), 056126] we derive the various threshold values for the number of clauses per variable of t...
详细信息
Using the cavity equations of Mezard, Parisi, and Zecchina [Science 297 (2002), 812;Mezard and Zecchina, Phys Rev E 66 (2002), 056126] we derive the various threshold values for the number of clauses per variable of the random K-satisfiability problem, generalizing the previous results to K >= 4. We also give an analytic solution of the equations, and some closed expressions for these thresholds, in an expansion around large K. The stability of the solution is also computed. For any K, the satisfiability threshold is found to be in the stable region of the solution, which adds further credit to the conjecture that this computation gives the exact satisfiability threshold. (c) 2005 Wiley Periodicals, Inc.
暂无评论