来自google blogspot中的引用,
"In fact, we found even more than 1 trillion individual links, but not all of
them lead to unique web pages. Many pages have multiple URLs with exactly the same
content or URLs that are auto-generated copies of each other. Even after removing
those exact duplicates . . . "
Google 如何检测那些完全重复的网页或文档?对 Google 使用的算法有任何想法吗?