0

我已经在 C#.NET 中实现了 Rabin-Karp 算法,遵循这个伪代码:

伪代码

问题是,模式与原始文本不匹配。我已经彻底浏览了代码,但我无法识别代码中的问题。有人可以告诉我代码中的错误吗?

static void Main(string[] args)
{
    string text = "ratcatpat catbats";
    string pattern = "cat";

    int d = text.Select(e => e).Distinct().Count();

    RabinCarp(text, pattern, d, 17);

    Console.ReadKey();
}

static void RabinCarp(string text, string pattern, int sizeOfAlphabet, int moduloValue)
{ 
    int rollingHashOf_P = 0;
    int rollingHashOf_T = 0;

    int lengthOfText = text.Length;
    int lengthOfPattern = pattern.Length;
    int h = (int)(Math.Pow(sizeOfAlphabet, lengthOfPattern - 1) % moduloValue);

    for (int i = 0; i < lengthOfPattern; i++)
    {
        rollingHashOf_P = (sizeOfAlphabet * rollingHashOf_P + (int)pattern[i]) % moduloValue;
        rollingHashOf_T = (sizeOfAlphabet * rollingHashOf_T + (int)text[i]) % moduloValue;
    }

    int diffNM = lengthOfText - lengthOfPattern;

    for (int i = 0; i <= diffNM; i++)
    {
        if (Math.Abs(rollingHashOf_P) == Math.Abs(rollingHashOf_T))
        {
            if (text.Substring(i, lengthOfPattern).Contains(pattern))
            {
                string message = "pattern identified";
                Console.WriteLine(message);
            }
        }   
        if (i < diffNM)
        {
            rollingHashOf_T = Math.Abs(sizeOfAlphabet * (rollingHashOf_T - (int)text[i] * h) + (int)text[i + lengthOfPattern]) % moduloValue;
        }
    }
}
4

1 回答 1

2

我不熟悉 Rabin-Karp 算法,但我很确定你应该和以下算法rollingHashOf_P一样进步rollingHashOf_T

if (i < diffNM)
{
    rollingHashOf_T = Math.Abs(sizeOfAlphabet * (rollingHashOf_T - (int)text[i] * h) + (int)text[i + lengthOfPattern]) % moduloValue;
    rollingHashOf_P = Math.Abs(sizeOfAlphabet * (rollingHashOf_P - (int)pattern[i] * h) + (int)pattern[i + lengthOfPattern]) % moduloValue;
}

在 OP 在下面的评论中分享了这个伪代码之后:

伪代码

很明显,以上是错误的。将其与帖子中的代码进行比较,尽管表明该错误可能rollingHashOf_T毕竟在推进中,正如它所说:

rollingHashOf_T = Math.Abs(sizeOfAlphabet * (rollingHashOf_T - 
  (int)text[i] * h) + (int)text[i + lengthOfPattern]) % moduloValue;

虽然伪代码表明它应该是:

rollingHashOf_T = Math.Abs(sizeOfAlphabet * (rollingHashOf_T - 
  (int)text[i + 1] * h) + (int)text[i + lengthOfPattern + 1]) % moduloValue;
于 2014-04-17T20:21:33.947 回答