Google: A Trillion URLs and counting


The Google blog notes how huge the web is now, with Google indexing over a trillion unique URLs.  As they note in the article the actual number of indexable URLs is, in one sense, infinite.    For example calendar pages will automatically appear as you scroll through many applications, continuing through the years until..the singularity and beyond.     Of course Google does not index many of these “empty” URLs or even a lot of junk or redundant content, so the true number of real, unique URLs is actually well above a Trillion.

I think a fun question is this:   What will the information landscape look like in, say, 20 years when we should have the ability to pour *everything* from the past and the present online?     Questions might take a different form if we had access to every reference on a topic that has ever been produced.    Algorithms will be used to sort through the oceans of content much as Google does now, but with far more precision and better comprehension of the whole mess.

Advertisements

About JoeDuck

Internet Travel Guy, Father of 2, small town Oregon life. BS Botany from UW Madison Wisconsin, MS Social Sciences from Southern Oregon. Top interests outside of my family's well being are: Internet Technology, Online Travel, Globalization, China, Table Tennis, Real Estate, The Singularity.
This entry was posted in Artificial Intelligence, companies, computers, internet, Science & Technology, web, Web 2.0 and tagged , , , , . Bookmark the permalink.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s