Explore tens of thousands of sets crafted by our community.
Machine Translation Terms
15
Flashcards
0/15
Corpus
A large and structured set of texts used in machine translation for statistical analysis and hypothesis testing.
BLEU Score
Bilingual Evaluation Understudy, a metric for evaluating the quality of text which has been machine-translated from one natural language to another.
Parallel Corpus
A collection of texts, each of which is translated into one or more languages, aligned with their translations.
Phrase-Based Translation
A type of statistical machine translation where translations are based on phrases (chunks of words) rather than individual words.
Decoder
A component of a machine translation system that generates the target language translation from a model's representations.
Encoder
A component of a machine translation system that converts the source language input into a meaningful, fixed-sized representation.
NMT
Neural Machine Translation, a machine translation approach that uses a large neural network capable of learning to translate between languages from large datasets.
SMT
Statistical Machine Translation, a machine translation technique that uses statistical models to generate translations based on analysis of bilingual text corpora.
Back-Translation
A method where a text is translated from a source language to a target language, and then back to the source language by a different system or manually to check translation accuracy.
Attention Mechanism
A component in neural machine translation models that allows the model to focus on different parts of the input sequence when generating each word of the output sequence.
Tokenization
The process of dividing a sequence of textual material into smaller parts called tokens, such as words or phrases, often as a first step in machine translation.
Domain Adaptation
The process by which a machine translation system is adapted to better perform on a specific subject or field, often by using a specialized corpus.
Post-Editing
The process of making corrective changes to a translated text produced by a machine translation system, typically done by a human translator.
MT Evaluation Metrics
Standards or measures used to assess the quality and performance of machine translation outputs, such as BLEU, METEOR, and TER.
Machine Translation
The process of using automated software to translate text or speech from one language to another.
© Hypatia.Tech. 2024 All rights reserved.