## Build the translation pipeline using ðŸ¤— Transformers LibraryÂ¶


In [4]:
from transformers.utils import logging
logging.set_verbosity_error()

In [5]:
from transformers import pipeline 
import torch

In [6]:
translator = pipeline(task="translation",
                      model="facebook/nllb-200-distilled-600M",
                      torch_dtype=torch.bfloat16) 

In [7]:
text = """\
My puppy is adorable, \
Your kitten is cute.
Her panda is friendly.
His llama is thoughtful. \
We all have nice pets!"""

In [8]:
text_translated = translator(text,
                             src_lang="eng_Latn",
                             tgt_lang="fra_Latn")

Afrikaans: afr_Latn
Chinese: zho_Hans
Egyptian Arabic: arz_Arab
French: fra_Latn
German: deu_Latn
Greek: ell_Grek
Hindi: hin_Deva
Indonesian: ind_Latn
Italian: ita_Latn
Japanese: jpn_Jpan
Korean: kor_Hang
Persian: pes_Arab
Portuguese: por_Latn
Russian: rus_Cyrl
Spanish: spa_Latn
Swahili: swh_Latn
Thai: tha_Thai
Turkish: tur_Latn
Vietnamese: vie_Latn
Zulu: zul_Latn


In [9]:
text_translated

[{'translation_text': 'Mon chiot est adorable, ton chaton est mignon, son panda est ami, sa lamme est attentive, nous avons tous de beaux animaux de compagnie.'}]

## Free up some memory before continuing

In order to have enough free memory to run the rest of the code, please run the following to free up memory on the machine.


In [10]:
import gc

In [11]:
del translator

In [12]:
gc.collect()

1846

## Build the summarization pipeline using ðŸ¤— Transformers Library


In [14]:
summarizer = pipeline(task="summarization",
                      model="facebook/bart-large-cnn",
                      torch_dtype=torch.bfloat16)

config.json: 0.00B [00:00, ?B/s]

Xet Storage is enabled for this repo, but the 'hf_xet' package is not installed. Falling back to regular HTTP download. For better performance, install the package with: `pip install huggingface_hub[hf_xet]` or `pip install hf_xet`


model.safetensors:   0%|          | 0.00/1.63G [00:00<?, ?B/s]

generation_config.json:   0%|          | 0.00/363 [00:00<?, ?B/s]

vocab.json: 0.00B [00:00, ?B/s]

merges.txt: 0.00B [00:00, ?B/s]

tokenizer.json: 0.00B [00:00, ?B/s]

In [15]:
text = """Paris is the capital and most populous city of France, with
          an estimated population of 2,175,601 residents as of 2018,
          in an area of more than 105 square kilometres (41 square
          miles). The City of Paris is the centre and seat of
          government of the region and province of ÃŽle-de-France, or
          Paris Region, which has an estimated population of
          12,174,880, or about 18 percent of the population of France
          as of 2017."""

In [16]:
summary = summarizer(text,
                     min_length=10,
                     max_length=100)

In [17]:
summary

[{'summary_text': 'Paris is the capital and most populous city of France, with an estimated population of 2,175,601 residents as of 2018. The City of Paris is the centre and seat of the government of the region and province of ÃŽle-de-France.'}]