CAT is a term for computer-assisted translation. It is a software that constantly stores (when active) pairs of translated and original texts in its translation memory (TM). When a translator translates a document, the software suggests all the previously translated texts of his TM.

CAT is a term that is not used for machine translation, which is a different thing. CAT software generally works hand in hand with editors such as MS Word or Openoffice.org; compares the text stored in TM with that of the document and gives suggestions to the translator.

There are many commercial CAT tools on the market today such as Trados, minimalist Wordfast or Metatexis. Wordfast, for example, is not standalone software and requires MS Word (it works like a macro).

Anaphraseus: CAT with OpenOffice.org

Anaphraseus as an extension for OpenOffice.org is installed directly from the OOo menu: Tools | Extension Manager, where you just click on the Get more extensions here … text. On the OOo extensions website, which appears in your default browser after clicking the text above, you will find Anaphraseus.

You will get a file with an OXT extension (some previous versions used a Zip format). After downloading it, click the Add button in the OOo Extension Manager window and browse for the OXT file on your PC. When done, read the license and scroll down to accept it. After clicking the OK button, Anaphraseus will be permanently moved to your OOo Extension Manager.

Anaphraseus does not depend on the platform but on OOo. It works in OpenOffice, it doesn’t matter if you work with Windows, Linux, FreeBSD, OS / 2 or even OpenBSD. This nice CAT extension is compatible with OpenOffice.org 2.1 and higher; StarOffice 8, Update 5 or higher, and will give you the following capabilities:

* Term of recognition

* Blurred search

* Export / Import Unicode UTF-16 TMX

* Plain text and Unicode UTF-16TM

* User glossary

After installation, you must restart OpenOffice.org for the changes to take effect. You will then see a new panel with the Anaphraseus icons appearing in the OOo environment.

To carry out your first translation, create (or import) your 1) Translation Memory; 2) open a document with the source language you want; and 3) start translating it by clicking Alt + Down button in Anaphraseus icon panel.

Translation memory

The first step is to create (or possibly import) your TM. To do this, click on the Anaphraseus Settings icon, select New and then enter the relevant TMX codes, which you will need for cases where you decide to export your TM later (see the Little Glossary at the bottom). You will need a separate TM for each language combination, for example if you translate from German to Polish, this combination, i.e. this TM is not good for translation from Polish to German.

The software allows you to work with many translation memories. You can use them for any translation job; for example, bible-italian2eng.txt (Italian to English) or bible-eng2italian.txt (English to Italian) will be your TMs for Bible translation projects (both Wordfast and Anaphraseus use the TXT format in their TMs).

A small CAT glossary

TMX

In computer-assisted translation, you use the Translation Memory Exchange (TMX) (XML) format because translators often need to migrate (export / import) their TMs to a variety of CAT tools they use. It is the right of the translator to choose any CAT software and in case a group of translators work on a project, they can share their TMs. Many CAT tools use their own (proprietary) translation memory formats and TMX helps translators and translation agencies to share their translation memories easily. For example, you export your TM from a proprietary CAT application format (like Trados, etc.) to TMX format and then import this TMX format to Anaphraseus (or any other CAT tool).

TMX is a type of database with multiple codes identifying languages ​​(CS-01 for Czech language, EN-US for US English, etc.).

Unicode

If Anaphraseus asks if you want to use Unicode, you should be aware that CAT software may have trouble displaying words with diacritics like those used in Eastern European languages. By the term “show” I mean that once the source and target sentences reach the TM, Anaphraseus will compare the source sentence in the document with that of your TM and show you the target sentence if it meets certain criteria . With Unicode font it will correctly display all fonts. If you don’t work with Western-type languages, it’s always a good idea to use Unicode.

Clean up

The term “clean document” in CAT terminology means that you delete the original text (source) of the document, which remains there for editing purposes. Both the source and destination segments are delimited with colored markers such as {0> and you cannot remove them from the document (of course you can, but only by “cleaning the document”). Authors of CAT tools know that translators must compare the original text with the translated text even after the translation is complete. In addition to the fact that Anaphraseus (and many other CAT tools) save sentence pairs in their TM, you will also have these pairs embedded in the document until you clean it up.

If the document has not yet been cleaned up, you can always click the Down Arrow button on the Anaphraseus toolbar, compare the source (original) text with the translated text, and continue editing. When you’re done, choose CLEAN. The software will ask if you want to update your translation memory. All color markers and source sentences will disappear from the document and you will only see your final work (the text you translated).

conclusion

Anaphraseus does not have all the functions of commercial applications (such as Pandora’s box, etc.), but not all translators need comprehensive solutions every hour and every day. OpenOffice.org has thus become not only a complex and very useful tool for translators, but also a small star on your way to freedom.

Related Post

Leave a Reply

Your email address will not be published. Required fields are marked *