The Polyglot Internet

Prepared for the World Economic Forum Global Agenda Council on the Future of the Internet by Ethan Zuckerman, October 30, 2008

The first wave of the Internet revolution changed expectations about the availability of information. Information that was stored in libraries, locked in government vaults or available only to subscribers was suddenly accessible to anyone with an internet connection. A second wave has changed expectations about who creates information online. Tens of millions of people are contributing content to the modern Internet, publishing photos, videos and blogposts to a global audience.

The globalization of the Internet has brought connectivity to almost 1.3 billion people. The Internet that results from globalization and user-authorship is profoundly polyglot. Wikipedia is now available in more than 210 languages, which implies that there are communities capable of authoring content in those tongues. Weblog search engine Technorati sees at least as many blogposts in Japanese as in English, and some scholars speculate that there may be as much Chinese content created on sites like Sina and QQ as on all English-language blogs combined.

A user who joins the Internet today is far more likely to encounter content in her own language than had she joined ten years ago. But each internet user is able to participate in a smaller percentage of the total interactions and conversations than an English-speaking internet user could in 1997 when English was the dominant language of the net.

There’s a danger of linguistic isolation in today’s internet. In an earlier, English-dominated internet, users were often forced to cross linguistic barriers and interact in a common language to share ideas with a wider audience. In today’s internet, there’s more opportunity for Portuguese, Chinese, or Arabic speakers to interact with one another, and perhaps less incentive to interact with speakers of other languages. This in turn may fulfill some of the predictions put forth by those who see the Internet acting as an echo-chamber for like-minded voices, not as a powerful tool to encourage interaction and understanding across barriers of nation, language and culture.

For the the Internet to fulfill its most ambitious promises, we need to recognize translation as one of the core challenges to an open, shared and collectively governed internet. Many of us share a vision of the Internet as a place where the good ideas of any person in any country can influence thought and opinion around the world. This vision can only be realized if we accept the challenge of a polyglot internet and build tools and systems to bridge and translate between the hundreds of languages represented online.

Machine translation will not solve all our problems. While machine translation systems continue to improve, they are well below the quality threshold necessary to enable readers to participate in conversations and debates with speakers of another languages. The best machine translation systems still have difficulty with colloquial and informal language, and are most reliable in translating between romance languages. The dream of a system that creates fully-automated, high-quality translations in important language pairs like English/Hindi still appears long off.

While there is profound need to continue improving machine translation, we also need to focus on enabling and empowering human translators. Professional translation continues to be the gold standard for the translation of critical documents. But these methods are too expensive to be used by web surfers simply interested in understanding what peers in China or Colombia are discussing and participating in these discussions.

The polyglot internet demands that we explore the possibility and power of distributed human translation. Hundreds of millions of internet users speak multiple languages; some percentage of these users are capable of translating between these. These users could be the backbone of a powerful, distributed peer production system able to tackle the audacious task of translating the internet.

We are at the very early stages of the emergence of a new model for translation of online content – “peer production” models of translation. Yochai Benkler uses the term “peer production” to describe new ways of organizing collaborative projects beyond conventional arrangements like corporate firms. Individuals have a variety of motives for participation in translation projects, sometimes motivated by an explicit interest in building intercultural bridges, sometimes by fiscal reward or personal pride. In the same way that open source software is built by programmers fueled both by personal passion and by support from multinational corporations, we need a model for peer production translation that enables multiple actors and motivations.

To translate the internet, we need both tools and communities. Open source translation memories will allow translators to share work with collaborators around the world; translation marketplaces will let translators and readers find each other through a system like Mechanical Turk enhanced with reputation metrics; browser tools will let readers seamlessly translate pages into the highest-quality version available and request future human translations. Making these tools useful requires building large, passionate communities committed to bridging in a polyglot web, to preserving smaller languages and to making tools and knowledge accessible to a global audience.

If we do not address the problems of the polyglot internet, we introduce another possible way our shared internet can fragment. There are competing – and likely incompatible – visions for future governance of the internet. As the internet becomes less of a global, shared space and more of a Chinese or Arabic or English space, we lose incentives to work together on common, compatible frameworks and protocols. We face the real possibility of the internet becoming multiple internets, divided first by languages, but later by values, norms and protocols.

The internet is the most powerful tool created by humans to allow connection, collaboration and understanding between people of different nations, races and cultures. For the internet to reach its potential in bridging human differences, we need to make the problems of language and translation center to our conversations about the future of the internet.

50 Responses to The Polyglot Internet

  1. Pingback: …My heart’s in Accra » The polyglot internet

  2. jose murilo says:

    You nailed it, again.
    Many thanks, Ethan.

  3. Great essay, and it sparked me to consider one unavailable opportunity — taking advantage of the Google Books Search settlement, at http://snurl.com/4xw20 [blogs_lib_berkeley_edu] . The blog discusses the possibility of creating multiple rough machine translations of the Google Book Search content for search and discovery (as opposed to presuming the ability to create new authorized translations without the rightsholders’ permissions).

  4. Pingback: …My heart’s in Accra » The weekend in Dubai

  5. Pingback: Web novinarstvo » Nova era veb-komunikacije u kojoj će Internet postati poliglota

  6. Pingback: What Trend Is the Biggest Threat to the Future of the Internet? | Blurring Borders

  7. gabyd says:

    do you think that maybe images, i mean photographs & videos, will gain then more and more place?

  8. Pingback: Dominique Lowell · Bringing order to chaos

  9. Pingback: …My heart’s in Accra » Thelma Golden - freestyle, frequency, flow

  10. Pingback: Inter-professional Dialogue: Kirti Vashee About the Future of Translation/MT….. Diálogo interprofesional: Kirti Vashee sobre el futuro de la traducción/TA « Lapsus translinguae

  11. Pingback: El Oso » Archive » Upcoming: Human Rights, Liberians in NYC, Lullabies in Argentina, OLPC Uruguay, Voces Bolivianas, Community News, Collaborative Translation, and Digital Transformation

  12. Pingback: El Oso » Archive » Social Translation and Fan Culture

  13. Pingback: Global Voices Online » Global: the polyglot internet and translation exchange

  14. Pingback: Global Voices Online » Global Voices is seeking a Project Manager for Translation Exchange

  15. Pingback: Global Voices is seeking a Project Manager for Translation Exchange :: Elites TV

  16. Pingback: Global: The polyglot internet and translation exchange | Global Voices Online

  17. Pingback: Global Voices is seeking a Project Manager for Translation Exchange | Global Voices Online

  18. Pingback: …My heart’s in Accra » TED embraces social translation

  19. Pingback: TED Embraces Social Translation | FollowGreen.com

  20. Pingback: TED Embraces Social Translation | EcoSilly

  21. Pingback: Global Voices بالعربية » الأصوات العالمية تبحث عن مدير لمشروع تبادل الترجمة

  22. Pingback: …My heart’s in Accra » New York Times on Social Translation

  23. Pingback: New York Times on Social Translation | EcoSilly

  24. Pingback: New York Times on Social Translation | NomadsLand Post

  25. Pingback: New York Times on Social Translation | green hopogus

  26. Pingback: Green Design » Blog Archive » New York Times on Social Translation

  27. Pingback: New York Times on Social Translation | FollowGreen.com

  28. Pingback: New York Times on Social Translation | Climate Vine

  29. Pingback: …My heart’s in Accra » What percentage of the Internet is in English? In Chinese?

  30. Pingback: Brown Bourne: Favorites

  31. Pingback: …My heart’s in Accra » Notes and reflections from the Open Translation Tools Summit 2009

  32. Pingback: Language as social justice, a goodbye to the anglo web and hello to diversified campaining « Change your tools

  33. Pingback: Language as social justice, a goodbye to the anglo web and hello to diversified campaining

  34. Pingback: Michael Nielsen » Biweekly links for 07/31/2009

  35. Pingback: …My heart’s in Accra » Building the future of translation, online

  36. Pingback: Radar @ O’Reilly Media on WWL | Worldwide Lexicon

  37. Pingback: A Bill of Rights in Cyberspace « BuzzMachine

  38. Pingback: Internet Rights | Media and Tech

  39. Pingback: My cyberspace bill of rights | We-found-it

  40. Pingback: A Bill of Rights in Cyberspace « cubicgarden.com…

  41. Pingback: Stumblers.Net › Jarvis: Google is defending citizens of the net

  42. Pingback: ?????????

  43. Pingback: My cyberspace bill of rights | Jeff Jarvis | WorldBBNews

  44. Pingback: Shopfloor Blog Archive » Translation Tools are Improving. ???

  45. Pingback: …My heart’s in Accra » Global Voices: Love and money

  46. Pingback: My God, it’s Full of Internets « All of the Above

  47. Pingback: My cyberspace bill of rights | Richard Hartley

  48. Pingback: Why Machine Translation Matters: Trends & Best Practices | The Big Wave

  49. Pingback: Transmediale: The future of Tech in Africa | Afromusing

  50. Pingback: Internet-Charta: “Wir haben das Recht auf Vernetzung” | Digital | ZEIT ONLINE | Medienzeiger

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>