检索结果-内蒙古大学图书馆

International Conference on Innovations in Information Technology

作者： Klaib, Ahmad Fadel Osborne, Hugh Univ Huddersfield Dept Informat Huddersfield HD1 3DH W Yorkshire England

ISBN: (纸本)9781424456987

Huge amounts of biological data are stored in linear files. Biological proteins are sequences of amino acids. The quantities of data in these fields tend to increase year on year. string matching algorithms play a key role in many computer science problems, and in the implementation of computer software. For this reason efficient string-matching algorithms should be used which use minimal computer storage and which minimize the searching response time. In this study, we propose a new algorithm called the Random string Matching Algorithm (RSMA). RSMA combines our enhanced preprocessing phase from the Berry Ravindran algorithm with our proposed new searching phase procedure. This variety of searching order allows our proposed algorithm to reduce the number of comparison characters and enhances the searching response time. Experimental results show that the RSMA algorithm offers a smaller number of comparisons and offers improved elapsed searching time when compared to other well-known algorithms.

关键词： string searching algorithms

来源：评论

学校读者我要写书评

暂无评论

Variable-Stride Multi-Pattern Matching For Scalable Deep Packet Inspection

Variable-Stride Multi-Pattern Matching For Scalable Deep Pac...

引用

IEEE INFOCOM Conference 2009

作者： Hua, Nan Song, Haoyu Lakshman, T. V. Georgia Inst Technol Coll Comp Atlanta GA 30332 USA Alcatel Lucent Bell Labs Boulogne France

ISBN: (纸本)9781424435128

Accelerating multi-pattern matching is a critical issue in building high-performance deep packet inspection systems. Achieving high-throughputs while reducing both memory-usage and memory-bandwidth needs is inherently difficult. In this paper, we propose a pattern (string) matching algorithm that achieves high throughput while limiting both memory-usage and memory-bandwidth. We achieve this by moving away from a byte-oriented processing of patterns to a block-oriented scheme. However, different from previous block-oriented approaches, our scheme uses variable-stride blocks. These blocks can be uniquely identified in both the pattern and the input stream, hence avoiding the multiplied memory costs which is intrinsic in previous approaches. We present the algorithm, tradeoffs, optimizations, and implementation details. Performance evaluation is done using the Snort and ClamAV pattern sets. Using our algorithm, the throughput of a single search engine can easily have a many-fold increase at a small storage cost, typically less than three bytes per pattern character.

关键词： string searching algorithms

来源：评论

学校读者我要写书评

暂无评论

Fast variants of the backward-oracle-marching algorithm

Fast variants of the backward-oracle-marching algorithm

引用

作者： Fan, Hongbo Yao, Nianmin Ma, Haifeng Harbin Heilongjiang 150001 China

ISBN: (纸本)9780769540276

This study focuses on the faster exact single pattern string matching algorithms. In all solutions, two variants of BOM, EBOM and FBOM are very efficient. We improved them and presented two algorithms named Simplified-EBOM and Simplified-FBOM through removing the unnecessary branches and accomplishing the core calculation of the algorithm in a 1-dimensional array. The experimental results indicated that Simplified-EBOM is fast for short patterns and it is 12% faster than its basis algorithm on average. © 2010 IEEE.

关键词： string searching algorithms

来源：评论

学校读者我要写书评

暂无评论

Real-word spelling correction using google web 1T n-gram data set

Real-word spelling correction using google web 1T n-gram dat...

引用

ACM 18th International Conference on Information and Knowledge Management, CIKM 2009

作者： Islam, Aminul Inkpen, Diana Department of Computer Science SITE University of Ottawa Ottawa ON Canada

ISBN: (纸本)9781605585123

We present a method for correcting real-word spelling errors using the Google Web 1T n-gram data set and a normalized and modified version of the Longest Common Subsequence (LCS) string matching algorithm. Our method is focused mainly on how to improve the correction recall (the fraction of errors corrected) while keeping the correction precision (the fraction of suggestions that are correct) as high as possible. Evaluation results on a standard data set show that our method performs very well. Copyright 2009 ACM.

关键词： string searching algorithms

来源：评论

学校读者我要写书评

暂无评论

Tuning BNDM with g-grams 11

Tuning BNDM with g-grams

引用

11th Workshop on Algorithm Engineering and Experiments, ALENEX 2009 and 6th Workshop on Analytic Algorithmics and Combinatorics, ANALCO 2009

作者： ɰurian, Branislav Holub, Jan Peltola, Hannu Tarhio, Jorma S and T Varias s.r.o. Priemyselná 2 ŽilinaSK-010 01 Slovakia Department of Computer Science and Engineering Czech Technical University in Prague Karlovo nám. 13 Prague 2CZ-121 35 Czech Republic Department of Computer Science and Engineering Helsinki University of Technology P.O.B. 5400 HUTFI-02015 Finland

ISBN: (纸本)9781615671489

We develop bit-parallel algorithms for exact string matching. Our algorithms are variations of the BNDM and Shift-Or algorithms. At each alignment the algorithms read a q-gram before testing the state variable. In addition we apply reading a 2-gram in one instruction. Our experiments show that many of the new variations are substantially faster than any previous string matching algorithm on x86 processors for English and DNA data.

关键词： string searching algorithms

来源：评论

学校读者我要写书评

暂无评论

Tuning BNDM with q-Grams

Tuning BNDM with q-Grams

引用

11th Annual Workshop on Algorithm Engineering and Experiments, ALENEX 2009

作者： Ďurian, Branislav Holub, Jan Peltola, Hannu Tarhio, Jorma SandT Varias s.r.o. Priemyselná 2 SK-010 01 Žilina Slovakia Department of Computer Science and Engineering Czech Technical University in Prague Karlovo nám. 13 CZ-121 35 Prague 2 Czech Republic Department of Computer Science and Engineering Helsinki University of Technology P.O.B. 5400 FI-02015 Helsinki Finland

ISBN: (纸本)9780898719307

关键词： string searching algorithms

来源：评论

学校读者我要写书评

暂无评论

Comparison of exact string matching algorithms for biological sequences

引用

Communications in Computer and Information Science 2008年 13卷 417-426页

作者： Kalsi, Petri Peltola, Hannu Tarhio, Jorma Department of Computer Science and Engineering Helsinki University of Technology P.O. Box 5400 FI-02015 HUT Finland

ISBN: (纸本)9783540705987

Exact matching of single patterns in DNA and amino acid sequences is studied. We performed an extensive experimental comparison of algorithms presented in the literature. In addition, we introduce new variations of earlier algorithms. The results of the comparison show that the new algorithms are efficient in practice. © Springer-Verlag Berlin Heidelberg 2008.

关键词： string searching algorithms

来源：评论

学校读者我要写书评

暂无评论

High-speed string searching against large dictionaries on the Cell/BE processor

High-speed string searching against large dictionaries on th...

引用

10th Workshop on Advances in Parallel and Distributed Computational Models/22nd IEEE International Parallel and Distributed Processing Symposium

作者： Scarpazza, Daniele Paolo Villa, Oreste Petrini, Fabrizio IBM Corp Thomas J Watson Res Ctr Cell Solut Dept Yorktown Hts NY 10598 USA Politecn Milan Dipartimento Elettron & Informazione I-20133 Milan Italy

ISBN: (纸本)9781424416936

Our digital universe is growing, creating exploding amounts of data which need to be searched, filtered and protected. string searching is at the core of the tools we use to curb this explosion, such as search engines, network intrusion detection systems, spam filters, and anti-virus programs. But as communication speed grows, our capability to perform string searching in real-time seems to fall behind. Multi-core architectures promise enough computational power to cope with the incoming challenge, but it is still unclear which algorithms and programming models to use to unleash this power. We have parallelized a popular string searching algorithm, Aho-Corasick, on the IBM Cell/B.E. processor, with the goal of performing exact string matching against large dictionaries. In this article we propose a novel approach to fully exploit the DMA-based communication mechanisms of the Cell/B.E. to provide an unprecedented level of aggregate performance with irregular access patterns. We have discovered that memory congestion plays a crucial role in determining the performance of this algorithm. We discuss three aspects of congestion: memory pressure, layout issues and hot spots, and we present a collection of algorithmic solutions to alleviate these problems and achieve quasi-optimal performance. The implementation of our algorithm provides a worst-case throughput of 2.5 Gbps, and a typical throughput between 3.3 and 4.4 Gbps, measured on realistic scenarios with a two-processor Cell/B.E. system.

关键词： string searching algorithms

来源：评论

学校读者我要写书评

暂无评论

Fast string Matching with Space-efficient Word Graphs

Fast String Matching with Space-efficient Word Graphs

引用

International Conference on Innovations in Information Technology

作者： Yata, Susumu Morita, Kazuhiro Fuketa, Masao Aoe, Jun-ichi Univ Tokushima Inst Sci & Technol Tokushima Japan

ISBN: (纸本)9781424433964

string matching is one of the fundamentals in various text-processing applications such as text mining and content filtering systems. This paper describes a fast string matching algorithm using a compact pattern matching machine DAWG. A directed acyclic word graph (DAWG) is traditionally implemented with a 2-dimensional linked list or matrix. However, DAWGs with these structures have drawbacks, the lookup time or the linked list based one is slow and the space requirement of the matrix based one is large. Therefore, this paper proposes a novel DAWG based on a compacted double-array, which overcomes the drawbacks of traditional ones. Experimental results show that the novel DAWG is more efficient than traditional ones.

关键词： string searching algorithms

来源：评论

学校读者我要写书评

暂无评论

A Lightweight Multiple string Matching Algorithm

A Lightweight Multiple String Matching Algorithm

引用

International Conference on Computer Science and Information Technology

作者： Dai, Liuling Xia, Yuning Beijing Inst Technol Sch Comp Sci Beijing Lab Intelligent Informat Technol Beijing 100081 Peoples R China Tsinghua Univ Ctr Speech &Language Technol Beijing 100084 Peoples R China

ISBN: (纸本)9780769533087

string matching is a fundamental issue in computer science. This paper presents a lightweight string matching algorithm for short pattern matching, in which less than 20 keywords are often involved in the pattern set. The new algorithm makes use of condensed hash tables and computes the shift distance after each test by observing the character that immediately passes the test window. Experiments show that the new algorithm improves execution speed and decreases memory requirement. This algorithm is suitable for applications with small pattern set (i.e. containing up to 30 keywords), particularly for embedded equipments.

关键词： string searching algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：