c++ - 具有大致相同大小的操作数的大整数除法

Question

我正在尝试实现一个函数来计算两个大小大致相同的数字之间的除法。我使用 's 的数组unsigned int来存储每个数字的值（参见下面的类结构）。unsigned int我想为两者的数量之差不大于1的情况实现一个有效的除法函数。我研究了长除法算法（如维基百科中所述），但它太慢了。有没有处理这种特殊情况的有效算法？目前，我正在处理最多 100 位数字的数字。

class BigInt {
public:
    short sign; //sign of the number
    unsigned int size; //number of unsigned int's needed to store the number
    unsigned int* values; //values of the number

    ...
}

谢谢

score 1 · Accepted Answer

我相信 Knuth 算法 D 的修改版本（计算机编程艺术第 2 卷，第 4.3.1 节）应该可以在随机输入的恒定平均时间（最坏情况下为线性时间）中解决问题。显着的简化源于只需要初始迭代产生的第一个商字。

下面是我从 Hacker's Delight中毫不掩饰地改编通用实现的尝试。该实现假定 32 位字和 64 位中间/除法，但可以在更广泛的算术可用的地方自然扩展。

我确实担心该功能实际上未经测试，因此请更多地将其视为一个想法的萌芽，而不是完全成熟的实现。老实说，我什至不记得算法为什么起作用的微妙之处。

// A helper function for finding the position of the most significant set bit.
// Please plug in your intrinsic of choice here
static unsigned int find_first_bit(uint32_t value) {
#   ifdef _MSC_VER
    unsigned long index;
    (void) _BitScanReverse(&index, value);
    return index + 1;
#   else
    unsigned int count = 0;
    for(count = 0; value; ++count)
        value >>= 1;
    return count;
#   endif
}

// Multi-word division in 32-bit big-endian segments, returning the most significant
// word of the quotient.
uint32_t divide(const uint32_t *num, const uint32_t *den, size_t len) {
    // Skip past leading zero denominator digits. The quotient must fit in a single
    // 32-bit word and so only the preceeding numerator digit needs be examined
    uint32_t norm_den;
    uint64_t norm_num = 0;
    size_t top = 0;
    while(norm_den = den[top], !norm_den)
        norm_num = num[top++];
    // Please pad the input to insure at least three denominator words counting from
    // the first non-zero digit
    assert(len >= top + 3);
    // Divide the first two digits of the numerator by the leading digit of the
    // denominator as an initial quotient digit guess, yielding an upper bound
    // for the quotient at most two steps above the true value.
    // Simultaneously normalize the denominator with the MSB in the 31st bit.
    unsigned int norm = find_first_bit(norm_den);
    norm_num = norm_num << (64 - norm);
    norm_num |= ((uint64_t) num[top + 0] << 32 | num[top + 1]) >> norm;
    norm_den = ((uint64_t) norm_den << 32 | den[top + 1]) >> norm;
    // We are using a straight 64/64 division where 64/32=32 would suffice. The latter
    // is available on e.g. x86-32 but difficult to generate short of assembly code.
    uint32_t quot = (uint32_t) (norm_num / norm_den);
    // Substitute norm_num - quot * norm_den if your optimizer is too thick-headed to
    // efficiently extract the remainder
    uint32_t rem = norm_num % norm_den;
    // Test the next word of the input, reducing the upper bound to within one step
    // of the true quotient. See Knuth for proofs of this reduction and the bounds
    // of the first guess
    norm_num = ((uint64_t) num[top + 1] << 32 | num[top + 2]) >> norm;
    norm_num = (uint64_t) rem << 32 | (uint32_t) norm_num;
    norm_den = ((uint64_t) den[top + 1] << 32 | den[top + 2]) >> norm;
    if((uint64_t) quot * norm_den > norm_num) {
        --quot;
        // There is no "add-back" step try to avoid and so there is little point
        // in looping to refine the guess further since the bound is sufficiently
        // tight already
    }
    // Compare quotient guess multiplied by the denominator to the numerator
    // across whole numbers to account for the final quotient step.
    // There is no need to bother with normalization here. Furthermore we can
    // compare from the most to the least significant and cut off early when the
    // intermediate result becomes large, yielding a constant average running
    // time for random input
    uint64_t accum = 0;
    do {
        uint64_t prod = (uint64_t) quot * *den++;
        accum = accum << 32 | *num++;
        // A negative partial result can never recover, so pick the lower
        // quotient. A separate test is required to avoid 65-bit arithmetic
        if(accum < prod)
            return --quot;
        accum -= prod;
        // Similarly a partial result which spills into the upper 32-bits can't
        // recover either, so go with the upper quotient
        if((uint64_t) accum >= 0x100000000)
            return quot;
    } while(--len);
    return quot;
}

score 1 · Accepted Answer

假设您想要一个整数结果，您应该能够相当快地执行此操作。

以小数开头的十进制数可能最容易想到这一点。

让我们考虑一些较小的数字，例如：123456789/54321876。其中一个比另一个多一个数字，因此最大可能的结果大约是 10。

如果我们将两边相除，那么（比如说）较大的数字剩下三个数字，较小的数字剩下两个数字，我们得到123/54，这会产生相同的整数结果，就好像我们在整个过程中都以全精度完成了工作一样。

在您的情况下，除了十进制数字之外，您几乎可以做同样的事情，您将在较大的项目中保留三个精确的单词，在较小的项目中保留两个单词（实际上，即使这是保守的 - 2 和 1单词分别可能就足够了，但我现在懒得证明这一点，如果你这样做，你可能需要更加小心四舍五入）。

所以是的，您认为这是一种特殊情况，您可以根据几乎相同大小的数字来节省工作，这是正确的。

c++ - 具有大致相同大小的操作数的大整数除法

2 回答 2

Related

Reference