情侣自拍

Skip to content
  • Shop ATAware
  • Contact Us
  • Log In Welcome,
情侣自拍 (ATA)
Find a Translator or Interpreter
  • Client Assistance
    • Find a Translator Button
      • Find a Language Professional
    • Client Resources
      • Why Should I Hire a Professional?
      • Translator vs. Interpreter
      • Buying Language Services
      • What is Machine Translation?
    • More Client Resources
      • Why Hire an ATA-Certified Translator?
      • Need a Certified Translation?
      • The ATA Compass Blog
      • Know Your Rights to Language Access
  • Certification
    • Register Buttons
      • Order Practice Test

      • Register for Exam
    • Client Resources
      • Why Hire an ATA-Certified Translator?
      • What is a Certified Translation?
    • About the Exam
      • How to Prepare
      • Practice Test
      • How the Exam is Graded
      • Exam Schedule
      • Need More Information?
    • Already Certified?
      • Put Your Credentials To Work
      • Continuing Education Requirement
  • Career Support
    • Event Buttons
      • Visit ATA67

      • Upcoming Webinars
    • For Newcomers
      • Student Resources
      • Starting Your Career
      • The Savvy Newcomer Blog
    • For Professionals
      • Growing Your Career
      • Business Strategies
      • Next Level Blog
      • Client Outreach Kit
      • Mentoring Opportunities
    • More Resources
      • Educators and Trainers
      • Tools and Technology
      • Publications
      • School Outreach
  • Events
    • Event Buttons
      • Visit ATA67

      • Upcoming Webinars
        听
    • Events
      • Annual Conference
      • Free Events for ATA Members
      • Certification Exam Schedule
    • More Events
      • Virtual Workshops and Events
      • Upcoming Webinars
      • Webinars On Demand
      • Calendar of Events
  • News
    • Industry News
    • Advocacy and Outreach
    • The ATA Chronicle
    • The ATA Podcast
    • ATA Newsbriefs
    • Press Releases
  • Member Center
    • Member Buttons
      • Join ATA

      • Renew Now
    • Member Resources
      • Join ATA
      • Renew Your Membership
      • Benefits of Membership
      • Divisions & Special Interest Groups
      • Chapters, Affiliates, and Other Groups
      • Get Involved
      • Member Discounts
      • Shop ATAware
    • Already a Member?
      • Connect with Members
      • Credentialed Interpreter Designation
      • Become a Voting Member
      • Submit Member News
      • Submit Your Event
      • Contact Us
  • About Us
    • Contact Button
      • Contact ATA

    • About ATA
      • Who We Are
      • Honors and Awards Program
      • Advertise with Us
      • Media Kit
    • How ATA Works
      • Board of Directors
      • Committees
      • Policies & Procedures
      • Code of Ethics
      • ATA Team
  • Join ATA
  • Renew Your Membership
  • Contact Us
  • Find a Translator or Interpreter
  • Search for:
March 21, 2017

Interview with Mike Dillinger, Machine Translation Pioneer

Resources
Source: The ATA Chronicle

Translation has been an integral part of our human evolution ever since we developed the ability to speak. But with the arrival of the industrial age, the spread of globalization, and the birth of technology, it was inevitable that someone would eventually wonder if machines were capable of performing this task. Research began in earnest in the early 1950s, and great progress has obviously been made since then. Machine translation, or MT, is the name for both the technology and for an established subfield of linguistic engineering that creates systems to translate text or speech from one human language to another.

My guest this time has spent some 20 years deeply immersed in the theory and practice of MT. Mike Dillinger currently manages taxonomies and MT at LinkedIn. He is a member of the External Advisory Board at the ADAPT Centre (a leading center for MT research) in Dublin, Ireland, and advises startups in the U.S., Israel, Australia, and Brazil. He has twice been president of the Association for Machine Translation in the Americas, and has experience with all phases of planning, development, deployment, and evaluation of MT systems. He led LinkedIn鈥檚 and eBay鈥檚 first launches of full-scale production MT, spearheaded the development of the first commercial MT-translation memory (TM) tool integration (Star Transit with Logos MT), and developed an interactive speech-to-speech MT system.

Mike earned a PhD from McGill University, in Montreal, for research on the cognitive processes of simultaneous interpreting and comprehension of technical content, as well as degrees in linguistics in the U.S., Canada, and Brazil. He is also an experienced translator and interpreter who has worked in English, Portuguese, Spanish, and French. Mike wants organizations everywhere to enable global communication by developing content that鈥檚 understandable and translatable, and by deploying MT effectively.

Thank you for spending time with us, Mike. Let鈥檚 start with a little history. Can you summarize the highlights of MT鈥檚 development?

A research collaboration between Georgetown University in Washington, DC, and IBM resulted in the first widely-known MT system from Russian into English. It was reported on the front page of The New York Times on January 8, 1954, and, of course, led to the hasty conclusion that the problem of automatic translation was essentially solved.1

Fun facts: it wasn鈥檛 a real MT system, just a proof of concept, with about two dozen rules and fewer than 200 words in its dictionary. It was basically what we now call a hybrid system, in which rules used the most probable word senses. It was, however, a spectacularly effective system because it convinced the U.S. and Russian governments to pour millions of dollars into research to 鈥渇inish鈥 developing the technology. The U.S. first used MT to translate Russian scientific publications to track their technology development during the Cold War. Later, MT (from Logos Corp) was used to translate helicopter repair manuals into Vietnamese.

Fast forward through the 1970s, when U.S. government funding dried up and Europe and Japan took the lead in MT research, trying to capture translators鈥 knowledge in rules stored on a computer鈥斺渞ule-based MT.鈥 In the 1980s, again at IBM, a new statistical approach emerged in which the software would calculate the likelihood of one particular translation based on many examples of human translations鈥斺渟tatistical MT.鈥 Again, this new approach convinced the U.S. government to provide financial support, and MT took off as a research area in the 1990s.

In the early 2000s, Google formed a group to work on MT, and companies like Language Weaver started to offer statistical MT systems as products. Since 2007, much progress has been made in making MT technology more easily available, both to researchers (with the Moses toolkit) and to translators, with Microsoft Translators Hub, Google鈥檚 Translator Toolkit, and products like Kantan MT. Most recently, significant progress has been made on two fronts:

  1. MT systems can now be 鈥渁daptive.鈥 In other words, the systems get updated every time a human makes a correction. (One example of this can be found at www.lilt.com.)
  2. A newer research approach called 鈥渘eural鈥 MT is improving how MT systems leverage information in the context of a sentence being translated.
Most translators are, of course, very familiar with MT, and use it in their daily work in one way or another. Just to get us all on the same page, please give us your definition of MT: a description of how it works and how it differs from Translation Memory (TM).

Both technologies (TM and MT) have the same job: to provide possible translations for expert review. TM technology focuses on reusing whole segments (mostly sentences). But you don鈥檛 get anything if most of an incoming sentence doesn鈥檛 match, unless you ask for an 鈥渁ssembled鈥 translation that guesses piece by piece. MT just produces assembled translations, but in a much more sophisticated way. That鈥檚 it.

Speed is obviously one of MT’s great advantages. How fast can a machine translate, compared to an average human translator?

The average human translator delivers something like 2,000 words per day. An average MT system can produce something like 2,000,000 words per day, and engineers know how to increase that rate with super-fast TMs.

Can you give us an idea of the volume involved? How many words are being processed by MT programs on an average day in the U.S. and worldwide? Is it possible to compare that with the volume of human translation?

Let鈥檚 take one example: Google Translate. It processes about 100 billion words per day, in 103 languages, for 500 million users around the world. That鈥檚 equivalent to the output of 50 million translators every day.

Where is MT mainly used? In what areas, what sectors?

MT is mainly used in two scenarios:

  1. The 鈥渘o-other-choice-but-MT鈥 scenarios (e.g., translating search queries, e-mail, tweets, and user feedback), ecommerce, informal 鈥淚鈥檓-just-curious translations,鈥 translations of customer support information, and espionage. What they all have in common is that the volume of source documents is far too large for humans to tackle, and the information in the source documents is too ephemeral or not very valuable. It鈥檚 also really horrible translation work. Can you imagine translating search queries or e-mail all day! In this sense, MT is doing us a favor! This kind of MT actually creates more work for human translators. It helps organizations identify important information鈥攕erving a triage function鈥攖hat humans often have to translate.
  2. The 鈥淚鈥檓-in-a-hurry鈥 scenarios, such as big localization projects for multinational contracts and project launches at global companies. Again, there鈥檚 usually far too much content and too little time for normal translation processes, so we use MT to pre-translate. When we set things up correctly, MT can make the whole process about four times faster.
How has MT impacted technical writing, and source-material writing in general?

I鈥檝e worked quite a bit with tech writers at a range of companies, and it鈥檚 clear to me that the impact of MT itself has been very small. The impact of TM technology and translation pricing, however, has been substantial. To save money (reason #1!), decrease project turnaround time, and increase readability for end users, many companies have adopted tools like Acrolinx and content management systems to improve the consistency and reuse of their source content.

In the early days of TM and MT, some translators were apprehensive, scared that technology was about to make them redundant. You had a great response to that. You said: translation technologies are translator 鈥渁ccelerators,鈥 not translator 鈥渞eplacements.鈥 Please expand on that idea.

Translation technologies today perform pretty poorly for most kinds of content, especially when writers don鈥檛 write consistently or clearly. TM and MT tools simply aren鈥檛 mature enough to be 鈥渓et out of the house鈥 on their own when the goal is publishable content. In any scenario where the source content is valuable or the reader鈥檚 understanding is important, we continue to need human translators. By the way, the amount of economically valuable content that should really be translated is estimated to be at least 10 times what we are doing today. That means that we need to find ways to accelerate the translator鈥檚 productivity, so we use MT as a translator accelerator.

You have also said that there are two ways for translators to work with MT: post-editing and turbo-translating. Please tell us more about those two methods.

In post-editing, someone else controls the MT system that provides you with draft translations. Often, these individuals don鈥檛 know what they鈥檙e doing or aren鈥檛 paying attention to the same things as you. So, you don鈥檛 always get the best possible candidate translations with which to work. That鈥檚 why you have to be extra careful when you accept post-editing jobs, since not all MT output is of the same quality.

In 鈥渢urbo-translating鈥 you know enough about MT to control the process and create your own draft translations. This means you can better manage the resources the system uses, you control the trade-off between speed and accuracy, and you can predict the kinds of problems you鈥檒l find and prepare for them. The older rule-based MT systems offered more direct control over different linguistic parameters, so they were better for turbo-translating. Statistical systems are usually black-box systems that we can鈥檛 control much.

Actually, I think there is a third way to work with MT, but it鈥檚 not available yet. I call this new option 鈥渉ybrid-intelligence鈥 translation. The idea is to leverage the strengths of both machine intelligence and human expertise by letting the humans 鈥渄rive鈥 the machine. This approach is like fly-by-wire systems for pilots: the pilot is definitely in charge and the system works out thousands of routine details to allow the pilot to focus on the important things. Adaptive MT is the first step in this direction, and I think there are many more things we can do to allow translators to 鈥減ilot鈥 their MT systems.

The post-editing field looks like a future growth area for many in our profession. Is that how you see it? Is post-editing something any translator can do? What are the requirements?

Yes, post-editing will surely continue to grow, and very rapidly. There鈥檚 some controversy concerning your second question. I believe that anyone who can correct a junior translator鈥檚 work can correct MT output. Researchers have already documented huge differences in how quickly people can do post-editing, and a large part of that is due to experience rather than special training. I still can鈥檛 identify any specific training that post-editing requires.

Do all post-editing clients want the same 鈥減roduct鈥 from a translator, or are there different standards or levels? Do they all want a translator鈥檚 best work, or do some want something that鈥檚 just 鈥済ood enough鈥?

This is actually the beauty of post-editing. Whereas before MT we could either provide a first-rate translation or none at all, now we can calibrate the translation quality more precisely to the client鈥檚 needs. For a while, we saw people ask for 鈥渓ight鈥 post-editing and 鈥渇ull鈥 post-editing. 鈥淟ight鈥 post-editing is an effort to find and fix only the most misleading and blatantly incorrect translations (e.g., missing negation). 鈥淔ull鈥 post-editing is the task of bringing MT output up to your usual, high standard for human translation quality. Unfortunately, it鈥檚 hard for translators and clients to agree on when we鈥檙e done with 鈥渓ight鈥 post-editing, so it鈥檚 a headache to manage. 鈥淢edium鈥 post-editing has appeared as an option that鈥檚 easier to manage: fix only terminology and grammar, and don鈥檛 worry about style and tone versus 鈥渇ull鈥 post-editing, where we have to fix everything.

We read that improvements in MT technology for spoken language applications are being driven by the interpreting requirements of military operations overseas. How will those improvements impact civilian interpreters working in our usual environments (medical, legal, corporate, etc.)?

Military operations overseas have incredibly demanding requirements for speech-to-speech MT. This technology has to interpret between uneducated speakers of unusual dialects of unheard-of languages and more educated speakers of varying dialects of English using machines with no Internet connection in extreme weather conditions. The equipment has to be light enough to carry along with a 40-pound backpack and robust enough for a truck to drive over, with batteries that last for weeks. And speech recognition has to work in the middle of traffic and gunfire. Oh, and we have to build the system with next to no example sentences (data collection in a war zone isn鈥檛 easy) and make sure that it can cover a wide range of topics. If we can make progress on any of these fronts, then civilian interpreting technology will certainly improve. Right now, soldiers in the field don鈥檛 have any MT systems to use. They rely on human interpreters.

Is there a healthy level of international cooperation in MT development? At what level does that occur: government, military, industry, or academia?

Yes! The International Association for Machine Translation holds its MT Summit every other year to gather together the global MT community. Researchers, both in industry and academia, collaborate routinely across national boundaries. Multinational companies hire people from around the world. I鈥檝e consulted for the European Community and worked on MT projects in at least five countries.

How does a machine learn鈥攁nd keep learning鈥攈ow to translate? How does it incorporate new words, phrases, and terminology into its repertoire? How does it expand its ability to process syntactical shifts and other linguistic features as language evolves?

Machines 鈥渓earn鈥 by ingesting, analyzing, and storing information about example human translations. Notice the scare quotes: machines only 鈥渒now鈥 what they鈥檝e seen. An MT system has a huge database of all the words it has ever seen, all the translations for each word it has ever seen, and all the contexts in which both the word and its translations have occurred. It 鈥渓earns鈥 by adding more human examples to this database and by recalculating the most likely translations for each source sequence. It evolves by acquiring more information about some words and sentences than about others. We have to feed the system with more example translations continuously. Lots and lots of example translations, until we get good coverage of the words and sentence types that we need for a specific project. And we need linguists to do this kind of 鈥渇eeding鈥 work.

As the volume of available content requiring translation increases exponentially daily, it鈥檚 clear that human translators cannot supply the need. Can current levels of MT keep up?

MT can keep up in terms of quantity, but not quality. The challenge is the increase in valuable content, which MT can鈥檛 handle well and is already too much for humans to handle. Today, a great deal of valuable content simply goes untranslated.

We read that 14 languages are required to reach 90% of the world鈥檚 most economically active populations, but most websites can only deliver content in about seven. What are those seven languages? Which languages will be next as capacity increases?

These are what companies call the Tier 1 languages. Although the list varies from company to company, it usually includes English, Simplified Chinese, Spanish, Russian, Japanese, German, and French. The next batch, the Tier 2 languages, varies more according to each company鈥檚 international strategy, but usually includes Korean, Arabic, Italian, Indonesian, Dutch, Traditional Chinese, and the Scandinavian languages.

Finally, please tell us what developments in MT we can expect to see in the near-to-medium future, and where translators and interpreters fit in that evolving scenario.

In my opinion, the most interesting areas of MT research are domain adaptation, neural MT, and hybrid-intelligence systems.

Domain adaptation is the part of an MT system that tries to pick and choose which translations, of all the millions of translations the system has seen, will be most relevant for your specific project. There鈥檚 some really fascinating work going on in Europe to make this adaptation faster and more accurate so that we get much better candidate translations.

Neural MT, the hottest, latest fashion in research circles, uses much more contextual information far more efficiently. It promises far better word sense disambiguation so that we get more accurate word choice in the candidate translations it proposes. It鈥檚 also producing grammatically better candidate translations. Neural MT is already showing up in online MT systems and many more improvements are sure to come.

Hybrid-intelligence translation systems will someday let the translator 鈥渄rive.鈥 The main assumption is that for the foreseeable future, MT won鈥檛 be able to do publication-level translation of valuable information on its own. So, we must find ways to merge the things that MT systems do well with the things that only human translators can do well. The first systems of this type we call adaptive MT, which is built bottom-up for translators: when you (or your team) correct a translation, your corrections are applied immediately to the remaining unreviewed sentences. An adaptive MT system 鈥渓earns鈥 much more relevant and more reliable information (for that particular project), learns it much faster, and presents it back to the translator much more quickly than ever before. In future systems, translators will not only correct a machine鈥檚 output; they will also teach it linguistic rules, stylistic preferences, and project-specific idiosyncrasies.

Thank you, Mike, for this highly illuminating review of your fascinating field.

Notes
  1. Plumb, Robert K. 鈥淩ussian Is Turned into English By a Fast Electronic Translator,鈥 The New York Times (January 8, 1954), 1, .

    Also see: Hutchins, John. 鈥淭he Georgetown-IBM Experiment Demonstrated in January 1954鈥 (Association for Machine Translation in the Americas), .


Tony Beckwith was born in Buenos Aires, Argentina, spent his formative years in Montevideo, Uruguay, then set off to see the world. He moved to Texas in 1980 and currently lives in Austin, Texas, where he works as a writer, translator, poet, and cartoonist. Contact: tony@tonybeckwith.com.

Sources for More Information on Machine Translation


ADAPT Centre


Association for Machine Translation in the Americas


International Association of Machine Translation


Lilt

 

Share this

Posts navigation

← Congratulations!
Advocacy Matters →

Latest Posts

  • What Is Audio Description? May 11, 2026
  • Coming Soon: ATA Microcredential Series May 4, 2026
  • Introducing the ATA Learning Hub! May 4, 2026
  • Member News May 4, 2026
  • Texas Court Interpreter Detained by ICE at Airport Says She鈥檚 Been 鈥楬umiliated and Treated Like a Criminal鈥 May 4, 2026
  • U.S. Department of Education Dissolving Federal Office Serving English Learners May 4, 2026
  • Northern Ireland to Offer Free Sign Language Classes for Deaf Children May 4, 2026
  • Washington State Passes Law to Promote Consistent Language Access May 4, 2026
  • Breaking News: Texas Interpreter Meenu Batra Released from ICE Custody May 4, 2026
  • Newsbriefs: April 30, 2026 May 1, 2026

Topics

  • Advocacy & Outreach
  • Annual Conference
  • Book Reviews
  • Business Strategies
  • Certification Exam
  • Certification Program
  • Client Assistance
  • Educators and Trainers
  • Growing Your Career
  • Industry News
  • Interpreting
  • Member Benefits
  • Member News
  • Mentoring
  • Networking
  • Public Outreach
  • Publications
  • Resources
  • School Outreach
  • Specializations
  • Starting Your Career
  • Student Resources
  • Tools and Technology
  • Translation
Find a Translator听 or Interpreter
ata_logo_footer

情侣自拍
211 N. Union Street, Suite 100
Alexandria, VA 22314

Phone +1-703-683-6100
Fax +1-703-778-7222

  • Certification
  • Career and Education
  • Client Assistance
  • Events
  • News
  • Member Center
  • About Us
  • Contact Us
  • Sitemap
  • Privacy Policy
  • Accessibility Statement
  • Submit Feedback

漏 2026 -听情侣自拍

Find a Translator or Interpreter
Scroll To Top
By clicking accept or closing this message and continuing to use this site, you agree to our use of cookies.