Implementing Multilingual Support with Amazon Translate - Real-Time Translation and Custom Terminology
Learn about real-time translation with Translate, improving translation quality with custom terminology, and leveraging batch translation for large-scale document processing.
Translate Overview
Translate is a service that provides neural machine translation across 75+ languages, processing up to 100 KB of text per request. Use the real-time translation API to implement multilingual support for web applications and chat, and batch translation to process large volumes of documents at once. It offers a neural translation engine similar to Google Translate or DeepL, running on AWS infrastructure.
Custom Terminology and Batch Translation
Custom terminology defines term pairs in CSV or TMX format, ensuring specified terms are always used during translation. This lets you control translations such as preventing the product name "Amazon Aurora" from being translated and keeping it in its original form. Batch translation places documents in an S3 bucket input folder and stores translation results in an output folder. It supports HTML, DOCX, and XLIFF formats, preserving tag structure during translation.
Parallel Data and Active Custom Translation
Parallel data (parallel corpora) provides source and translation text pairs in CSV or TMX format, adapting the translation model to domain-specific expressions. Translation quality improves significantly in fields with heavy specialized terminology, such as technical documentation, legal documents, and medical documents. Active Custom Translation (ACT) uses parallel data to adjust the model in real time, generating more natural, context-aware translations than custom terminology alone. Profanity masking automatically detects inappropriate expressions and masks or excludes them. Formality settings let you control the formality level (polite/casual) of translations, allowing you to use appropriate tones for business documents versus casual chat. For a systematic understanding of Translate from basics to advanced usage, books on Amazon are a great resource.
Translate Pricing
Translate pricing is based on the number of characters translated, with real-time translation costing approximately $15 per million characters. Batch translation uses the same per-character rate but allows bulk processing of large document volumes via S3. The first 12 months include a free tier of 2 million characters per month. There is no additional charge for using custom terminology. Active Custom Translation incurs additional charges for parallel data training, billed by training time. You can optimize costs by pre-stripping HTML tags and metadata from translation targets to reduce non-translatable character counts. Adding a caching layer to avoid re-translating identical text is also effective.
Summary
Translate is a service that enables multilingual support across 75+ languages using neural machine translation. Custom terminology controls the translation of product names and specialized terms, while Active Custom Translation provides domain adaptation through parallel data. Formality settings let you switch between polite and casual tones, and batch translation automates the multilingual conversion of large content volumes.