Index-monster

Johannes Beus
The amount of pages indexed in Google are especially important for projects that work mainly in the long-tail-area. More pages, meaning content, being searchable means a better chance of being found by a multitude of different searchqueries. While in the old days Google indexed every page that the crawler could find, they now have a rather efficient algorithm in place which rates the maximum number of indexable pages on an array of diverse signals. I have taken a look at how many sites I can come up with on the top of my head which have masses of pages in the index and made a chart of them:

#DomainPages
1yahoo.com339.000.000
2yahoo.co.jp171.000.000
3myspace.com136.000.000
4blogspot.com120.000.000
5ebay.com111.000.000
6youtube.com105.000.000
7msn.com86.200.000
8wikipatents.com (fluctuating strongly?)67.000.000
9amazon.com53.300.000
10amazon.de52.600.000
11ebay.de51.700.000
12flickr.com50.200.000
13alibaba.com49.200.000
14wordpress.com46.600.000
15live.com45.700.000
16aol.com45.100.000
17livejournal.com44.600.000
18rootsweb.com41.900.000
19meetup.com41.600.000
20amazon.ca41.400.000
43google.com25.600.000
46chefkoch.de23.300.000
81meinestadt.de13.000.000
99yatego.com10.900.000
114cylex.de9.740.000

Besides the first 20 there are 5 more entries which are pretty interesting from a German point of view. The data comes from a “site:domain.tld”-query that I finished just now. Those of you who know more domains that fit this list are welcome to post them in the comments.
Johannes Beus - on Mon (04/14/2008) at 22:25 PM

Add Comment

more
This posting is older than 30 days and therefore closed for new comments.