java - 哪个更快，String 或 Integer 作为 Java 中的 hashkey？

Question

我正在解决一个问题，我遇到了执行时间变得太大的问题，现在我正在寻找可能的优化。

问题：使用 String 或 Integer 作为 haskey 在性能上是否有任何（相当大的）差异？

问题是我有一个图表，其中节点存储在一个以字符串为键的哈希表中。例如，键如下 - “0011”或“1011”等。现在我也可以将它们转换为整数，如果这意味着执行时间的改进。

score 3 · Accepted Answer

整数将比字符串表现更好。以下是两者的哈希码计算代码。

整数哈希码实现

/**
     * Returns a hash code for this <code>Integer</code>.
     *
     * @return  a hash code value for this object, equal to the 
     *          primitive <code>int</code> value represented by this 
     *          <code>Integer</code> object. 
     */
    public int hashCode() {
    return value;
    }

字符串哈希码实现

 /**
     * Returns a hash code for this string. The hash code for a
     * <code>String</code> object is computed as
     * <blockquote><pre>
     * s[0]*31^(n-1) + s[1]*31^(n-2) + ... + s[n-1]
     * </pre></blockquote>
     * using <code>int</code> arithmetic, where <code>s[i]</code> is the
     * <i>i</i>th character of the string, <code>n</code> is the length of
     * the string, and <code>^</code> indicates exponentiation.
     * (The hash value of the empty string is zero.)
     *
     * @return  a hash code value for this object.
     */
    public int hashCode() {
    int h = hash;
    if (h == 0) {
        int off = offset;
        char val[] = value;
        int len = count;

            for (int i = 0; i < len; i++) {
                h = 31*h + val[off++];
            }
            hash = h;
        }
        return h;
    }

score 2 · Accepted Answer

如果您有性能问题，那么问题不太可能是由于 HashMap/HashTable 造成的。虽然散列字符串比散列整数稍微贵一些，但差别很小，而且 hashCode 被缓存，因此如果您使用相同的字符串对象不会重新计算它，您不太可能从首先将其转换为整数获得任何显着的性能优势。

在其他地方寻找性能问题的根源可能更有成效。您是否尝试过分析您的代码？

score 1 · Accepted Answer

速度有区别。HashMaps 会使用 hashCode 根据该代码计算桶，Integer 的实现比 String 的实现简单得多。

话虽如此，如果您在执行时间方面遇到问题，您需要进行一些适当的测量并分析自己。这是找出执行时间问题的唯一方法，使用整数而不是字符串通常对性能的影响很小，这意味着您的性能问题可能在其他地方。

例如，如果您想做一些适当的微基准测试，请查看这篇文章。还有许多其他资源可用于分析等。

java - 哪个更快，String 或 Integer 作为 Java 中的 hashkey？

3 回答 3

Related

Reference