logo
Published on A Technical Communication Community (http://www.techwr-l.com)

Good, Fast, Cheap: Translation Memory Systems Offer the Potential for All Three

By ejray
Created 2007-05-29 04:57

by Deborah S. Ray and Eric J. Ray

Boss: Beginning in three months, we'll be making our products available to markets in Europe and Asia. We'll need all Getting Started Guides, Tutorials, User Documentation, Advanced System Reference Manuals, Online Help, and marketing materials translated into German, French, Spanish, Italian, and Japanese. I need to see a translation plan by Friday morning.

You: Consider it done!

For technical communicators exploring translation services, a relatively new technology can help provide consistency among translated documents, make the translation process more efficient, and make translation projects cost effective. Translation memory systems assist human translators by following along as a document is translated, creating a database of translated material and terminology, and allowing translators to access previously translated material easily. Using this technology, translators can translate, save, and reuse material, making the resulting translations highly consistent and the overall process more efficient and cost effective than working without this technology.


In this exploratory article, we explain the evolution toward translation memory systems, discuss why and when they're particularly useful for helping translate technical documentation, and offer guidelines for determining whether translation memory systems are appropriate for your translation needs.

Reprinted with permission from Technical Communication [0], the journal of the Society for Technical Communication [1]. Originally published in May 1999 (V. 46, No. 2).


The Evolution Toward Tranlation Memory Systems


More than three decades ago, translation technologies emerged in scientific and technical fields, attempting to automate translation among Indo-European languages. Initially, translation technology development focused on machine translation systems, which work by translating basic words, complex words, and sentence structures from one language into another. At a basic level, these early systems could translate words, conjugate verbs, inflect articles and adjectives, and rearrange word order as necessary. And, as the technology advanced, machine translation systems could also recognize a variety of idioms and phrases, as well as improved translation of verb tenses and articles.


The problems of machine translation systems
Researchers quickly discovered, however, problems in automating language translation. For example, early machine translation systems required enormous computing power-far more than was readily available to most users at that time. Because of this, machine translation technology was not widely available, practical, or affordable as a translation tool. So, for a long time, only companies who were researching the technologies or who could afford the mainframe equipment used these systems.


More importantly, despite technological advances and improvements, machine translation systems still could not adequately account for differences in language structure, differences in how languages handle singularity and plurality, ambiguity of pronoun antecedents, the variety of word contexts, and the majority of idioms and nuances inherent in all languages. The resulting machine translations were crude at best, often not even adequately producing the gist of a document.


Over the years, machine translation technology has improved somewhat and is much more widely available than it was even 10 years ago. Today, because of advances in computing technology, machine translation systems are available as software for average PCs and even available on Web sites, making the technology accessible to most companies needing or performing translation services. Figure 1, for example, shows a basic machine translation system that requires people only to enter text and to specify the originating and target languages.


In addition to "gisting" machine translation systems, such as the one in Figure 1 that provides users with the general idea of the material, other, more advanced machine translation systems are also available. These more advanced systems can provide an adequate starting point for professional translators and accommodate specialized vocabularies.


Today's machine translation technology, however, is still far from meeting the initial goal of automating translation. As the following example shows, the same factors that affected translation quality and accuracy thirty years ago still affect them today: language is simply too variable to be automatically translated with the quality and accuracy of the original. For example, the English sentence:


Technical communicators working in a networked environment should consider a new technology that can make developing, maintaining, and reusing structured documents easier than ever before.


is translated into German as:


Die technischen Verbindungen, die in einem vernetzten Klima arbeiten, sollten eine neue Technologie betrachten, die das Entwickeln, das Beibehalten und der strukturierten Dokumente ueberhaupt vorher wiederverwenden bilden kann leicht als.


which re-translated into English reads:


The technical connections, which operate in an interlaced climate, should regard a new technology, those developing, maintaining and the structured documents at all beforehand reuse to form can easily as.


Although word order, tense, and complex grammar pose problems for most machine translation systems, industry-specific terminology and context-dependent meanings, both prevalent in technical documentation, particularly hinder the translation accuracy in this example. In particular, "environment" is incorrectly translated here; however, if "environment" were used to describe a machine room or facility, then the translation (Klima, here) would be more accurate. Even more advanced machine translation systems cannot always definitively identify contexts and translate appropriately.


So, although current machine translations can give readers the overall gist of a document, provide a starting point or rough draft for translation, or automate translations of simplistic documents (like parts lists or specification sheets), they are still far from the automated, publication-quality translations hoped for years ago.


The advent of translation memory systems


Translation technologies continued to emerge, surging again in the 1980's when computing equipment and resources became more widely available. At that time, large translation agencies began using specialized, automated tools to help their translators work more efficiently and consistently. These tools assisted translators by storing previously-translated materials, providing databases of specialized terminology, and allowing translators to access this information for subsequent jobs. These specialized tools marked a different direction in the development of translation technologies: rather than automating the translation process, these technologies were intended to assist human translators, allowing them to work more efficiently, with less repetition, and with more consistency. And, with that, translation memory systems emerged.


Using a translation memory system, a translator can translate, save, and reuse translated segments. The translation memory system, which is usually integrated into a word processing program, follows along as a document is translated and saves translated sentences and passages. Then, when the translator comes across identical or similar material, the translation memory system lets the translator reuse the previously-translated material. As Figure 2 shows, the program allows translators to reuse previous translations, modify them as necessary, or choose from suggested translations.


What's significant to technical communicators is that translation memory systems are particularly suitable for helping translators with highly-technical documentation, large documentation sets, multiple related documents, and documents with specialized vocabularies. Using this technology, translators can offer highly consistent, efficient, and cost-effective translations:


First, translation memory systems can help ensure that the translated documents are consistent, including common definitions, phrasings, terminology, and so on. This is particularly valuable when different translators are working on a single project and when translation jobs span many different documents. The database of translations and terminology that's developed throughout the process helps promote consistency within a documentation set, as well as consistency in subsequent translation projects.


Second, translation memory systems can help speed the overall translation process. Because these systems remember previously-translated material, they eliminate the need for re-translating boilerplate text, instructions that are identical across different documentation series, glossaries of terms common to an industry or documentation set, or warnings and cautions present throughout documentation. The translator can translate the material one time, then reuse or tweak it at subsequent instances.


Third, translation memory systems not only make the translation process consistent and efficient, but also cost effective for long-term translation needs. For example, given a documentation set, a translator might need to translate 100% of the first manual, but only 80% of the next manual, 75% of the next, and so on. Likewise, a warning message could be translated one time and reused throughout an entire documentation set or series of documents for a product. Additionally, because the translation memory system builds a database of terminology and phrasing specific to a product, company, or industry, it makes subsequent translation jobs even more cost effective because a portion of the material is already translated.

Considerations for Translation Memory Systems


Although translation memory systems can help ensure consistency, improve efficiency, and reduce costs, they are not ideal for all documents or situations. In particular, translation memory systems are less effective for stand-alone or short documents, for document sets that are produced using varied writing styles, or for documents that include multiple terminology contexts. In these cases, the translator would likely translate the material from scratch and not benefit at all from the memory aspects that these systems provide.


Additionally, these systems are not ideal for documents that contain what's called "fuzzy matches" in the source material, where non-identical text is used to convey identical material. One document, for example, might refer to a drop-down menu, and another to a select list. Although the translation memory system might suggest the same translation, it would most likely require the translator to re-translate the second instance or choose from suggested translations. Either way, the fuzzy matches in the source documents are not ideal for translation through memory systems because the system does not eliminate or reduce the amount of work the human translator must do.


If you are exploring translation services, consider whether translation memory systems can help:

Consider the documents



Consider the translator or translation agency



Consider the drawbacks



XML reaches translation memory systems


A new translation memory system database format is now available that should help translation databases become more portable. The TMX (Translation Memory eXchange) format is an XML-compliant markup language used to encode translation memory databases for exchange among products from different vendors. Although this format is not yet universally supported, it shows promise of helping the translation industry more easily exchange databases.


Conclusion


For technical communicators exploring translation services, translation memory systems can help translators provide consistent translations, both quickly and cost effectively in the long term. Translation memory systems are a newer translation technology that's used to assist human translators. Rather than actually doing translations, as machine translation systems do, translation memory systems follow along as a document is being translated, creates a database of translations and terminology, and allows translators to reuse previously-translated material. Translation memory systems are ideal for helping translators process large documentation sets and multiple, related documents, as well as accommodating long-term translation needs. Because identical and similar material can be reused, the overall translation is highly consistent, efficient, and cost effective.

For Further Reading


General information



Translation Memory Systems



Acknowledgment



We'd like to thank Bill Burns of International Language Engineering for his insights, suggestions, and feedback.


Figure 1.

Today's machine translation services are widely available and easy to use.

Figure 2.

Translation memory systems allow translators to reuse or tweak previously-translated material, making the translation process efficient and cost effective.


Technology [11]

Writing and Editing [11]

AttachmentSize
translationfigure1.jpg [12]22.43 KB
translationfigure2.jpg [13]26.33 KB
  • [13]
  • [13]

Source URL:
http://www.techwr-l.com/articles/general/translationmemorysystems