检索结果-内蒙古大学图书馆

Prefix-free regular languages and pattern matching

THEORETICAL COMPUTER SCIENCE 2007年第1-2期389卷 307-317页

作者： Han, Yo-Sub Wang, Yajun Wood, Derick Korea Inst Sci & Technol Intelligence & Interact Res Ctr Seoul 130650 South Korea Hong Kong Univ Sci & Technol Dept Comp Sci Kowloon Hong Kong Peoples R China

We explore the, regular-expression matching problem with respect to prefix-freeness of the pattern. We prove that a prefix-free regular expression gives only a linear number of matching substrings in the size of a given text. Based on this observation, we propose an efficient algorithm for the prefix-free regular-expression matching problem. Furthermore, we suggest an algorithm to determine whether or not a given regular language is prefix-free. (c) 2007 Elsevier B.V. All rights reserved.

关键词： string pattern matching regular-expression matching prefix-free regular languages pruned prefix-free languages

来源：评论

学校读者我要写书评

暂无评论

A Boyer-Moore-style algorithm for regular expression pattern matching

引用

SCIENCE OF COMPUTER PROGRAMMING 2003年第2-3期48卷 99-117页

作者： Watson, BW Watson, RE Univ Pretoria Dept Comp Sci ZA-0002 Pretoria South Africa Eindhoven Univ Technol Dept Comp Sci NL-5600 MB Eindhoven Netherlands

This paper presents a Boyer-Moore-type algorithm for regular expression pattern matching, answering an open problem posed by Aho in 1980 (pattern matching in strings, Academic Press, New York, 1980, p. 342). The new algorithm handles patterns specified by regular expressions-a generalization of the Boyer-Moore and Commentz-Walter algorithms. Like the Boyer-Moore and Commentz-Walter algorithms, the new algorithm makes use of shift functions which can be precomputed and tabulated. The precomputation algorithms are derived, and it is shown that the required shift functions can be precomputed from Commentz-Walter's d(1) and d(2) shift functions. In certain cases, the Boyer-Moore (respectively Commentz-Walter) algorithm has greatly outperformed the Knuth-Morris-Pratt (respectively Aho-Corasick) algorithm (as discussed by Watson in his Ph.D. Thesis, Eindhoven University of Technology, September 1995, and in: N. Ziviani, R. Baeza-Yates, K. Guimaracs (Eds.), Proc. Third South American Workshop on string Processing, International Informatics Series, vol. 4, Carleton University Press, Recife, Brazil, 1996, pp. 280-294). In testing, the algorithm presented in this paper also frequently outperforms the regular expression generalization of the Aho-Corasick algorithm. (C) 2003 Elsevier B.V. All rights reserved.

关键词： string pattern matching regular expressions Boyer-Moore algorithms Commentz-Walter algorithms algorithm generalizations

来源：评论

学校读者我要写书评

暂无评论

Efficient multi-attribute pattern matching

引用

INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS 1998年第1-2期66卷 21-38页

作者： Ando, K Mizobuchi, S Shishibori, M Aoe, J Univ Tokushima Dept Informat Sci & Intelligent Syst Tokushima 770 Japan

This paper describes an efficient multi-attribute pattern matching machine to locate all occurrences of any of a finite number of a sequence of rule structures in a series of input structures. The matching operation of the proposed machine is similar to the method of Aho-Corasick or the method of retrieval using a trie, however, the proposed machine has the following distinctive features: (1) The proposed machine enables us to match set representations containing multiple attributes;(2) It enables us to match separate components;(3) It enables us to match a rule consisting of an exclusive set. In this paper, their features are described in detail. Moreover, the pattern matching algorithm is evaluated by the theoretical observations and the experimental observations that are supported by the simulation results for a variety of rules for document processing as text proofreading, text reduction, and examining a relation between sentences.

关键词： information retrieval string pattern matching multi-attribute pattern matching met representation separate components exclusive set

来源：评论

学校读者我要写书评

暂无评论

An improvement of the Aho-Corasick machine

引用

INFORMATION SCIENCES 1998年第1-4期111卷 139-151页

作者： Ando, K Kinoshita, T Shishibori, M Aoe, J Univ Tokushima Dept Informat Sci & Intelligent Syst Tokushima 770 Japan

Aho and Corasick presented a string pattern matching machine to locate multiple keywords. However, the AC machine could not match multi-attribute information. This paper describes an efficient multi-attribute pattern matching machine to locate all occurrences of any of a finite number of the sequence of rule structures (called matching rules) in a sequence of input structures. The proposed algorithm enables us to match set representations containing multiple attributes. Therefore, confirming transition is decided by the relationship, whether the input structure includes the rule structure or not. Finally, the pattern matching algorithm is evaluated by theoretical analysis and the evaluation is supported by the simulation results with rules for the extraction of keywords. (C) 1998 Elsevier Science Inc. All rights reserved.

关键词： string pattern matching multi-attribute pattern matching set representation finite state pattern matching machine matching algorithm constructing algorithm

来源：评论

学校读者我要写书评

暂无评论

An incremental algorithm for string pattern matching machines

引用

INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS 1995年第1-2期58卷 33-42页

作者： Tsuda, K Fuketa, M Aoe, JI [a] Department of Information Science & Intelligent Systems The University of Tokushima Minami-Josanjima-Cho Tokushima-Shi Japan

Aho and Corasick presented a string pattern matching machine (hereafter called machine AC) to locate multiple keywords. However, the machine AC must be reconstructed all over again when a keyword is appended. This paper proposes an efficient algorithm to append a keyword for the machine AC. This paper presents the time efficiency comparison with the original algorithm using the actual simulation results. The simulation results show the speed up factor, by the algorithm proposed, to be between 25 and 270 fold when compared with the original algorithm by Aho and Corasick which requires the reconstruction of the entire machine AC.

关键词： string pattern matching append a keyword bibliographic search text-editing information retrieval

来源：评论

学校读者我要写书评

暂无评论

ON THE SUBTREE ISOMORPHISM-PROBLEM FOR ORDERED TREES

引用

INFORMATION PROCESSING LETTERS 1989年第5期32卷 271-273页

作者： MAKINEN, E University of Tampere Department of Computer Science P.O. Box 607 SF-33101 Tampere Finland

In the subtree isomorphism problem, given 2 rooted trees T subscript 1 and T subscript 2, a determination is made as to whether T subscript 1 is isomorphic to any subtree of T subscript 2. A tree is considered ordered if the relative order of its subtrees in each node is fixed. It is shown that a O(m+n) time algorithm can be obtained when dealing only with ordered trees. The algorithm is based on tree encoding and on string pattern matching. The subtree isomorphism problem has connections with the tree pattern matching problem, in which nodes are labeled and the trees may contain special symbols that stand for arbitrary trees. While several algorithms already exist to reduce the tree pattern matching problem to string matching, algorithms with a time complexity of less than O(mn) are obtained only in some special cases.

关键词： Ordered tree tree isomorphism Zasks sequence string pattern matching

来源：评论

学校读者我要写书评

暂无评论

ON SYMMETRY DETECTION

引用

IEEE TRANSACTIONS ON COMPUTERS 1985年第7期34卷 663-666页

作者： ATALLAH, MJ Department of Computer Sciences Purdue University Abstract Authors References Cited By Keywords Metrics Similar Download Citation Email Print Request Permissions

A straight line is an axis ofsymmetry of a planar figure if the figure is invariant to reflection with respect to that line. The purpose of this correspondence is to describe an O( n log n) time algorithm for enumerating all the axes of symmetry of a planar figure which is made up of (possibly intersecting) segments, circles, points, etc. The solution involves a reduction of the problem to a combinatorial question on words. Our algorithm is optimal since we can establish an Ω(n log n) time lower bound for this problem.

关键词： Analysis of algorithms axis of symmetry centroid computational geometry string pattern matching

来源：评论

学校读者我要写书评

暂无评论

引用

INTERNATIONAL JOURNAL OF COMPUTER & INFORMATION SCIENCES 1984年第4期13卷 279-290页

作者： ATALLAH, MJ 1. Department of Computer Sciences Purdue University 47907 West Lafayette Indiana

Two planar figures aresimilar if a scaled version of one of them can be moved so that it coincides with the second figure. The problem of checking whether two planar figures are similar is relevant to both computational geometry and pattern recognition. An efficient algorithm is known for checking whether two polygonsP andQ are similar⁽¹⁾ The purpose of this note is to give an efficient algorithm for checking whether two planar figuresP andQ are similar when the figures are no longer constrained to be polygons. We give anO(n logn) time algorithm for solving this problem when each figure consists of a collection of (possibly intersecting) straight line segments, circles, and ellipses. Our algorithm can easily be modified for figures which include other geometric patterns as well. We also prove that our algorithm is optimal.

关键词： Analysis of algorithms computational geometry string pattern matching

来源：评论

学校读者我要写书评

暂无评论

EFFICIENT string matching - AID TO BIBLIOGRAPHIC SEARCH

引用

COMMUNICATIONS OF THE ACM 1975年第6期18卷 333-340页

作者： AHO, AV CORASICK, MJ BELL TEL LABS INC MURRAY HILL NJ 07974 USA

This paper describes a simple, efficient algorithm to locate all occurrences of any of a finite number of keywords in a string of text. The algorithm consists of constructing a finite state pattern matching machine from the keywords and then using the pattern matching machine to process the text string in a single pass. Construction of the pattern matching machine takes time proportional to the sum of the lengths of the keywords. The number of state transitions made by the pattern matching machine in processing the text string is independent of the number of keywords. The algorithm has been used to Improve the speed of a library bibliographic search program by a factor of 5 to 10. [ABSTRACT FROM AUTHOR]

关键词： bibliographic search computational complexity finite state machines information retrieval keywords and phrases string pattern matching text-editing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：