# Machine Translation Approaches

**[Rule-based machine translation](https://en.wikipedia.org/wiki/Rule-based_machine_translation) (RBMT)** is generated on the basis of morphological, syntactic, and semantic analysis of both the source and the target languages. 

- Direct systems (Dictionary-based translation)
- Transfer-based MT
- Interlingua-based MT

**[Corpus](https://en.wikipedia.org/wiki/Text_corpus)-based machine translation (CBMT)** is generated on the analysis of [bilingual text](https://en.wikipedia.org/wiki/Bilingual_text) corpora. 

- Statistical MT
- Example-based MT



>  RBMT belongs to the domain of rationalism and CBMT empiricism. Given large-scale and fine-grained linguistic rules, RBMT systems are capable of producing translations with reasonable quality, but constructing the system is very time-consuming and labor-intensive because such linguistic resources need to be hand-crafted, frequently referred to as [knowledge acquisition](https://en.wikipedia.org/wiki/Knowledge_acquisition) problem. Moreover, it is of great difficulty to correct the input or add new rules to the system to generate a translation. By contrast, adding more examples to a CBMT system can improve the system since it is based on the data, though the accumulation and management of the huge bilingual data corpus can also be costly.



## Rule-based Machine Translation (RBMT)

- "Classical Approach" of MT
- Based on **linguistic information** about source and target languages basically retrieved from (unilingual, bilingual or multilingual) **dictionaries** and **grammars** covering the main semantic, morphological, and syntactic regularities of each language respectively.

### Types of RBMT

**Vauquois Triangle**

<img src="https://www.setthings.com/wp-content/uploads/2015/01/Direct_translation_and_transfer_translation_pyramind-1.jpg" alt="https://www.setthings.com/wp-content/uploads/2015/01/Direct_translation_and_transfer_translation_pyramind-1.jpg" style="zoom: 67%;" />

#### Direct Systems (Dictionary-based translation)

- Translation by **dictionary lookups**
  -  word by word, usually without much correlation of meaning between words
- Earliest approach to MT

- Translation memory
  - Store many translations
  - Present **closest** match for new sentence
  - Useful only when you expect same or similar sentence to show up again, but then high quality

#### Transfer-based MT / Transfrer RBMT Systems

- Currently one of the most widely used methods of machine translation

- Steps:

  1. Analyze into syntactic structure (i.e. parsing)

     - Morphological analysis -> root form

     - Dependency structure

  2. Transfer source language structure into target language

     - Reordering, inserting and deleting

     - Converting

     - Also lexical transfer

  3. Generate into target language

     - Full word form

     - Agreement: case, number, gender

- Problem: Effort with Grammar Writing, Lack of Semantics

#### Interlingua-Based MT / Interlingual RBMT Systems

- Language independent structure

- Steps

  1. Analyse source language into interlingua (an abstract language-independent representation)

  2. Generate into target language

- 👍 Advantages

  - For n languages only n analysis and n generation grammars instead of n(n-1) transfer components
  - Translate back into source language -> paraphrase
  - Handle ungrammatical speech

- 👎Disadvantages

  - Time consuming (expensive!)
  - Representation should not contain ambiguities
  - Realistic only for limited domains



## Corpus-based MT (CBMT)

### Statistical MT (SMT)

- Generated on the basis of statistical models whose parameters are derived from the analysis of bilingual text corpora.

- Automatically align words and phrases within sentence pairs in a parallel corpus
- Probabilities are determined automatically by training statistical models using the parallel corpus
- Using statistical models
  - Create many alternatives, called **hypotheses**
  - Give a score to each hypothesis
  - Select the **best** -> search

### Example-based MT (EBMT)

- Characterized by its use of bilingual corpus with parallel texts as its main knowledge
- Main idea: Translation by analogy
- Tasks:
  - example acquisition
  - example base and management
  - example application
  - synthesis



## Neural MT

Model MT using **neural networks**

- Improve generalization

  - Compact representation

- End-to-end training

- Learn 

  word embeddings

  - Continuous representation of words
  - Similar words learn similar representation

- Computationally very expensive

  - Efficient using GPUs

- Great improvements for some languages pairs 👏



## Pivot MT

- Use Natural language as intermediate representation
- Example
  - Instead of: German – French
  - Use:
    - German – English
    - English – French
- 👍 Advantage: More data
- 👎 Difficulty: Additional ambiguity





## Reference

- [Comparison of different machine translation approaches](https://en.wikipedia.org/wiki/Comparison_of_different_machine_translation_approaches)
- [Machine Translation](https://en.wikipedia.org/wiki/Machine_translation#Example-based)



