Friday, May 13, 2011

Google’s Real-Life Babelfish Will Translate the World

Before the Internet, the main source material for these translations had been corpuses such as UN documents that had been translated into multiple languages. But the web had produced an unbelievable treasure trove — and Google's indexes made it easy for its engineers to mine billions of documents, unearthing even the most obscure efforts at translating one document or blog post from one language to another. Even an amateurish translation could provide some degree of knowledge, but Google's algorithms could figure out which translations were the best by using the same principles that Google used to identify important websites. "At Google," says Och, with dry understatement, "we have large amounts of data and the corresponding computation of resources we need to build very, very, very good systems."
 -- Steven Levy, from his book In the Plex: How Google Thinks, Works, and Shapes Our Lives, via an excerpt published at Gizmodo (emphasis added)

1 comment:

Anonymous said...

Absolutely right. Google owns more data than anyone in history ever has. Let's just come they do some good things with it!