# mBART-50 many to many multilingual machine translation
This model is a fine-tuned checkpoint of [**mBART-large-50**](https://huggingface.co/facebook/mbart-large-50). **`mbart-large-50-many-to-many-mmt`** is fine-tuned for multilingual machine translation. It was introduced in [**Multilingual Translation with Extensible Multilingual Pretraining and Finetuning paper**.](https://arxiv.org/abs/2008.00401)

The model can translate directly between any pair of **50 languages**. To translate into a target language, the **target language id** is forced as the **first generated token**. To force the **target language id** as the **first generated token**, pass the **`forced_bos_token_id`** parameter to the generate method.

* For **multilingual generation**, you need a **causal LM** trained on many languages (e.g. ***XGLM, BLOOM, mGPT***).
* These models don’t **“auto-translate”** but *continue text in the target language if you nudge them.*
* You can enforce the language by:
  - Adding a language instruction in the prompt (e.g., “Write in Hindi: ...”)
  - Or using a **translation model** if you want exact translation.

In [1]:
from transformers import pipeline

# Load the multilingual translation pipeline (mBART)
translator = pipeline("translation", model="facebook/mbart-large-50-many-to-many-mmt")

The secret `HF_TOKEN` does not exist in your Colab secrets.
To authenticate with the Hugging Face Hub, create a token in your settings tab (https://huggingface.co/settings/tokens), set it as secret in your Google Colab and restart your session.
You will be able to reuse this secret in all of your notebooks.
Please note that authentication is recommended but still optional to access public models or datasets.
Device set to use cuda:0


In [2]:
# English → Hindi
hindi_prompt = "India is a great country and has a rich culture."
hindi_output = translator(hindi_prompt, src_lang="en_XX", tgt_lang="hi_IN")
print("Hindi Output:\n", hindi_output[0]['translation_text'])

# English → Spanish
spanish_prompt = "Artificial intelligence is changing the future of technology."
spanish_output = translator(spanish_prompt, src_lang="en_XX", tgt_lang="es_XX")
print("\nSpanish Output:\n", spanish_output[0]['translation_text'])

# English → French
french_prompt = "The future of technology is full of exciting opportunities."
french_output = translator(french_prompt, src_lang="en_XX", tgt_lang="fr_XX")
print("\nFrench Output:\n", french_output[0]['translation_text'])


Hindi Output:
 भारत एक महान देश है और इसकी समृद्ध संस्कृति है।

Spanish Output:
 La inteligencia artificial está cambiando el futuro de la tecnología.

French Output:
 L'avenir de la technologie est plein d'opportunités intéressantes.
