Wired on Google’s Algorithm

Wired has a fascinating look into Google’s search algorithm and how it has developed. I may not have mentioned this, but I really love Google. However, I was hooked into this one by the quote provided on The Daily Dish, which made it sound like linguistics might come up:

Google’s synonym system understood that a dog was similar to a puppy and that boiling water was hot. But it also concluded that a hot dog was the same as a boiling puppy. The problem was fixed in late 2002 by a breakthrough based on philosopher Ludwig Wittgenstein’s theories about how words are defined by context. As Google crawled and archived billions of documents and Web pages, it analyzed what words were close to each other. “Hot dog” would be found in searches that also contained “bread” and “mustard” and “baseball games” — not poached pooches. That helped the algorithm understand what “hot dog” — and millions of other terms — meant. “Today, if you type ‘Gandhi bio,’ we know that bio means biography,” Singhal says. “And if you type ‘bio warfare,’ it means biological.”

Article link (via Daily Dish)

(Not much linguistics in there, but very interesting.)