Background-informations to the supplemental index
This week, Google was granted a patent that was already filed in 2003, which is obviously describing the functioning of the supplemental-index. The patent talks about how the index can be divided into meaningful partitions to increase the speed and efficiency of the whole system. The patent goes into the specific technical processes in detail – which pages go into which index, how are the indexes joined during searches, how are snippets created and other informations. Interesting in this context is that the PageRank is considered for the decision into which index a site is pushed. The justification being that the PageRank is a good and query-independent pointer for the importance of a site; a statement that Cutts has already made in his blog but which nobody really believed. However, we have to remember that this patent is already 4 years old. Last year, the technology behind the supplemental index has seen a general overhaul and has probably been changed noticeably. Those who are interested in some background-informations, who have willingly read “Anatomy of a Large-Scale Hypertextual Web Search Engine” by Page and Brin for example, should take a look at the aforementioned patent. By the way, the query "-site:domain.tld* +site:domain.tld" seems to be working to find results in the supplemental index for quite a few domains, the rat-race continues ...This posting is older than 30 days and therefore closed for new comments.