项目作者: unruledboy

项目描述 :
WuManber text search/matching implementation using c#
高级语言: C#
项目地址: git://github.com/unruledboy/WuManber.git
创建时间: 2018-01-10T10:18:33Z
项目社区:https://github.com/unruledboy/WuManber

开源协议:Apache License 2.0

下载


WuManber

WuManber implementation using c#.

Wu/Manber is a very high performance string/text search/match algorithm.

Wu-Manber stand on the shoulders of their Multi Pattern Search predecessors: Aho and Corasick (with a linear time scanner based upon an automata approach), Commentz-Walter (who combined Aho-Corasick with the Boyer-Moore string search algorithm), and Baeza-Yates (with a slightly different combination of Aho-Corasick and Boyer-Moore-Horspool).

This implementation is based on the C++ version from here: http://blog.raymond.burkholder.net/index.php?/archives/362-C++-Implementation-of-Wu-Manbers-Multi-Pattern-Search-Algorithm.html

PPT explaination of the algorithm is here: https://www.slideshare.net/mailund/wu-mamber-string-algorithms-2007

Usage

The usage is very straight forward, please refer to the FunctionTest() in the Program.cs file.

Besides a list of words, you can pass in additional info like Id and Tag, just in case you would like to cross referene with your real world business.

中文

Wu/Manber是非常高效的字符串/文本搜索/查询/匹配算法。