检索结果-内蒙古大学图书馆

Approximate membership for regular languages modulo the edit distance

THEORETICAL COMPUTER SCIENCE 2013年 487卷 37-49页

作者： Ndione, Antoine Lemay, Aurelien Niehren, Joachim Lifl Lille France

We present an efficient probabilistic algorithm for testing approximate membership of words to regular languages modulo the edit distance. Our algorithm runs in polynomial time in the size of the input automaton and the inverse error precision in contrast to all previous approaches, and independently of the size of the input word. We also improve a previous approximate membership tester modulo the Hamming distance such that it runs in polynomial complexity time, but with larger polynomials than for the edit distance. (C) 2013 Elsevier B.V. All rights reserved.

关键词： Automata probabilistic algorithms Property testing Regular word languages

来源：评论

学校读者我要写书评

暂无评论

DESIGNING PROGRAMS THAT CHECK THEIR WORK

引用

JOURNAL OF THE ACM 1995年第1期42卷 269-291页

作者： BLUM, M KANNAN, S UNIV PENN DEPT COMP & INFORMAT SCI PHILADELPHIA PA 19104 USA

A program correctness checker is an algorithm for checking the output of a computation. That is, given a program and an instance on which the program is run, the checker certifies whether the output of the program on that instance is correct. This paper defines the concept of a program checker. It designs program checkers for a few specific and carefully chosen problems in the class FP of functions computable in polynomial time. Problems in FP for which checkers are presented in this paper include Sorting, Matrix Rank and GCD. It also applies methods of modern cryptography, especially the idea of a probabilistic interactive proof, to the design of program checkers for group theoretic computations. Two structural theorems are proven here. One is a characterization of problems that can be checked. The other theorem establishes equivalence classes of problems such that whenever one problem in a class is checkable, all problems in the class are checkable.

关键词： algorithms DESIGN RELIABILITY THEORY VERIFICATION INTERACTIVE PROOFS probabilistic algorithms PROGRAM CHECKING PROGRAM VERIFICATION TESTING

来源：评论

学校读者我要写书评

暂无评论

Practical algorithms and lower bounds for similarity search in massive graphs

引用

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING 2007年第5期19卷 585-598页

作者： Fogaras, Daniel Racz, Balazs Google Inc Mountain View CA 94043 USA Budapest Univ Technol & Econ H-1518 Budapest Hungary Hungarian Acad Sci Automat Res Inst H-1518 Budapest Hungary

To exploit the similarity information hidden in the hyperlink structure of the Web, this paper introduces algorithms scalable to graphs with billions of vertices on a distributed architecture. The similarity of multistep neighborhoods of vertices are numerically evaluated by similarity functions including SimRank [1], a recursive refinement of cocitation, and PSimRank, a novel variant with better theoretical characteristics. Our methods are presented in a general framework of Monte Carlo similarity search algorithms that precompute an index database of random fingerprints, and at query time, similarities are estimated from the fingerprints. We justify our approximation method by asymptotic worst-case lower bounds: We show that there is a significant gap between exact and approximate approaches, and suggest that the exact computation, in general, is infeasible for large-scale inputs. We were the first to evaluate SimRank on real Web data. On the Stanford WebBase [2] graph of 80M pages the quality of the methods increased significantly in each refinement step until step four.

关键词： Web search similarity measures graph algorithms probabilistic algorithms

来源：评论

学校读者我要写书评

暂无评论

Nonparametric supervised learning by linear interpolation with maximum entropy

引用

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2006年第5期28卷 766-781页

作者： Gupta, MR Gray, RM Olshen, RA Univ Washington Dept Elect Engn Seattle WA 98195 USA Stanford Univ Informat Syst Lab Dept Elect Engn Stanford CA 94305 USA Stanford Univ Dept Hlth Res & Policy Stanford CA 94305 USA Stanford Univ Dept Stat Stanford CA 94305 USA

Nonparametric neighborhood methods for learning entail estimation of class conditional probabilities based on relative frequencies of samples that are "near-neighbors" of a test point. We propose and explore the behavior of a learning algorithm that uses linear interpolation and the principle of maximum entropy (LIME). We consider some theoretical properties of the LIME algorithm: LIME weights have exponential form;the estimates are consistent;and the estimates are robust to additive noise. In relation to bias reduction, we show that near-neighbors contain a test point in their convex hull asymptotically. The common linear interpolation solution used for regression on grids or look-up-tables is shown to solve a related maximum entropy problem. LIME simulation results support use of the method, and performance on a pipeline integrity classification problem demonstrates that the proposed algorithm has practical value.

关键词： nonparametric statistics probabilistic algorithms pattern recognition maximum entropy linear interpolation

来源：评论

学校读者我要写书评

暂无评论

A mathematical assessment of the isolation random forest method for anomaly detection in big data

引用

MATHEMATICAL METHODS IN THE APPLIED SCIENCES 2023年第1期46卷 1156-1177页

作者： Morales, Fernando A. Ramirez, Jorge M. Ramos, Edgar A. Univ Nacl Colombia Escuela Matemat Antioquia Colombia Oak Ridge Natl Lab Comp Sci & Math Oak Ridge TN USA Carrera 65 59A 110 43-106 Medellin Colombia

We present the mathematical analysis of the Isolation Random Forest Method (IRF Method) for anomaly detection, proposed by Liu F.T., Ting K.M. and Zhou Z. H. in their seminal work as a heuristic method for anomaly detection in Big Data. We prove that the IRF space can be endowed with a probability induced by the Isolation Tree algorithm (iTree). In this setting, the convergence of the IRF method is proved, using the Law of Large Numbers. A couple of counterexamples are presented to show that the method is inconclusive and no certificate of quality can be given, when using it as a means to detect anomalies. Hence, an alternative version of the method is proposed whose mathematical foundation is fully justified. Furthermore, a criterion for choosing the number of sampled trees needed to guarantee confidence intervals of the numerical results is presented. Finally, numerical experiments are presented to compare the performance of the classic method with the proposed one.

关键词： anomaly detection isolation random forest monte carlo methods probabilistic algorithms

来源：评论

学校读者我要写书评

暂无评论

On the complexity of the resolvent representation of some prime differential ideals

引用

JOURNAL OF COMPLEXITY 2006年第3期22卷 396-430页

作者： D'Alfonso, Lisi Jeronimo, Gabriela Solerno, Pablo Univ Buenos Aires Fac Ciencias Exactas & Nat Dept Matemat RA-1428 Buenos Aires DF Argentina

We prove upper bounds on the order and degree of the polynomials involved in a resolvent representation of the prime differential ideal associated with a polynomial differential system for a particular class of ordinary first order algebraic-differential equations arising in control theory. We also exhibit a probabilistic algorithm which computes this resolvent representation within time polynomial in the natural syntactic parameters and the degree of a certain algebraic variety related to the input system. In addition, we give a probabilistic polynomial-time algorithm for the computation of the differential Hilbert function of the ideal. (C) 2005 Elsevier Inc. All rights reserved.

关键词： differential algebra resolvent representation elimination theory probabilistic algorithms straight-line programs differential Hilbert function

来源：评论

学校读者我要写书评

暂无评论

THE PROCESSOR IDENTITY PROBLEM

引用

INFORMATION PROCESSING LETTERS 1990年第2期36卷 91-94页

作者： LIPTON, RJ PARK, A UNIV CALIF DAVIS CALIF PRIMATE RES CTRDIV COMP SCIDAVISCA 95616

This paper investigates the problem of assigning unique identifiers to processors that communicate through shared memory. Solutions to fundamental multiprocessor coordination problems such as the Mutual Exclusion Problem and the Choice Coordination Problem often rely on unique identifiers. We present a probabilistic protocol that solves this Processor Identity Problem for asynchronous processors that communicate through a common shared memory. This protocol requires no central arbiter, and all processors start in exactly the same state. The use of our protocol simplifies shared memory processor design by eliminating the need to encode processor identifiers in system hardware or software structures.

关键词： Computer system initialization parallel algorithms probabilistic algorithms synchronization

来源：评论

学校读者我要写书评

暂无评论

Application of Dempster-Schafer Method in Family-Based Association Studies

引用

IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2013年第4期10卷 1071-1075页

作者： Rajabli, Farid Goktas, Unal Inan, Gul Turgut Ozal Univ Dept Elect & Elect Engn TR-06010 Ankara Turkey Turgut Ozal Univ Dept Comp Engn TR-06010 Ankara Turkey Middle E Tech Univ Dept Stat TR-06800 Ankara Turkey

In experiments designed for family-based association studies, methods such as transmission disequilibrium test require large number of trios to identify single-nucleotide polymorphisms associated with the disease. However, unavailability of a large number of trios is the Achilles' heel of many complex diseases, especially for late-onset diseases. In this paper, we propose a novel approach to this problem by means of the Dempster-Shafer method. The simulation studies show that the Dempster-Shafer method has a promising overall performance, in identifying single-nucleotide polymorphisms in the correct association class, as it has 90 percent accuracy even with 60 trios.

关键词： Algorithm design and analysis biology and genetics knowledge and data engineering tools and techniques probabilistic algorithms

来源：评论

学校读者我要写书评

暂无评论

Communication complexity in a 3-computer model

引用

ALGORITHMICA 1996年第3期16卷 298-301页

作者： Ambainis, A Institute of Mathematics and Computer Science University of Latvia Riga Latvia

It is proved that the probabilistic communication complexity of the identity function in a 3-computer model is O(root n).

关键词： communication complexity probabilistic algorithms error-correcting codes

来源：评论

学校读者我要写书评

暂无评论

BOUNDS ON SAMPLE SPACE SIZE FOR MATRIX PRODUCT VERIFICATION

引用

INFORMATION PROCESSING LETTERS 1993年第2期48卷 87-91页

作者： CHINN, DD SINHA, RK Department of Computer Science and Engineering FR-35 University of Washington Seattle WA 98195 USA

We show that the size of any sample space that could be used in Freivalds' probabilistic matrix product verification algorithm for n x n matrices is at least (n - 1)/epsilon if the probability of error is at most epsilon, matching the upper bound of Kimbrel and Sinha. We also provide a characterization of any sample space for which Freivalds' algorithm has probability of error at most epsilon. We then provide a generalization of Freivalds' algorithm and give a tight lower bound on the time-randomness tradeoff for this class of algorithms.

关键词： ANALYSIS OF algorithms probabilistic algorithms MATRIX PRODUCT

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：