Background-informations to the supplemental index

Johannes Beus
This week, Google was granted a patent that was already filed in 2003, which is obviously describing the functioning of the supplemental-index. The patent talks about how the index can be divided into meaningful partitions to increase the speed and efficiency of the whole system. The patent goes into the specific technical processes in detail – which pages go into which index, how are the indexes joined during searches, how are snippets created and other informations. Interesting in this context is that the PageRank is considered for the decision into which index a site is pushed. The justification being that the PageRank is a good and query-independent pointer for the importance of a site; a statement that Cutts has already made in his blog but which nobody really believed. However, we have to remember that this patent is already 4 years old. Last year, the technology behind the supplemental index has seen a general overhaul and has probably been changed noticeably. Those who are interested in some background-informations, who have willingly read “Anatomy of a Large-Scale Hypertextual Web Search Engine” by Page and Brin for example, should take a look at the aforementioned patent. By the way, the query "-site:domain.tld* +site:domain.tld" seems to be working to find results in the supplemental index for quite a few domains, the rat-race continues ...

Johannes Beus

Johannes Beus, Founder and CEO of SISTRIX, has been interested in the optimisation of websites for searchengines since 2001. In 2003 he started to regularly publish summaries of his evaluations and share his thoughts on the SEO-sector on one of the oldest German SEO-blogs.
Johannes Beus - on Wed (08/08/2007) at 21:58 PM

Add Comment

more
This posting is older than 30 days and therefore closed for new comments.