# Lesson 3: Translation and Summarization

- In the classroom, the libraries are already installed for you.
- If you would like to run this code on your own machine, you can install the following:

```
    !pip install transformers 
    !pip install torch
```

- Here is some code that suppresses warning messages.

In [1]:
from transformers.utils import logging
logging.set_verbosity_error()

### Build the `translation` pipeline using 🤗 Transformers Library

In [2]:
from transformers import pipeline 
import torch

In [3]:
translator = pipeline(task="translation",
                      model="./models/facebook/nllb-200-distilled-600M",
                      torch_dtype=torch.bfloat16) 

NLLB: No Language Left Behind: ['nllb-200-distilled-600M'](https://huggingface.co/facebook/nllb-200-distilled-600M).



In [4]:
text = """\
My puppy is adorable, \
Your kitten is cute.
Her panda is friendly.
His llama is thoughtful. \
We all have nice pets!"""

In [5]:
text_translated = translator(text,
                             src_lang="eng_Latn",
                             tgt_lang="fra_Latn")

To choose other languages, you can find the other language codes on the page: [Languages in FLORES-200](https://github.com/facebookresearch/flores/blob/main/flores200/README.md#languages-in-flores-200)

For example:
- Afrikaans: afr_Latn
- Chinese: zho_Hans
- Egyptian Arabic: arz_Arab
- French: fra_Latn
- German: deu_Latn
- Greek: ell_Grek
- Hindi: hin_Deva
- Indonesian: ind_Latn
- Italian: ita_Latn
- Japanese: jpn_Jpan
- Korean: kor_Hang
- Persian: pes_Arab
- Portuguese: por_Latn
- Russian: rus_Cyrl
- Spanish: spa_Latn
- Swahili: swh_Latn
- Thai: tha_Thai
- Turkish: tur_Latn
- Vietnamese: vie_Latn
- Zulu: zul_Latn

In [6]:
text_translated

[{'translation_text': 'Mon chiot est adorable, ton chaton est mignon, son panda est ami, sa lamme est attentive, nous avons tous de beaux animaux de compagnie.'}]

In [7]:
text = """\
Saya punya rumah, \
Ukurannya kecil dan sempit.
Warnanya pink cerah.
Interiornya sederhana dan elegan. \
Saya senang tinggal di sini!"""

text_translated = translator(text,
                             src_lang="ind_Latn",
                             tgt_lang="arz_Arab")

text_translated

[{'translation_text': 'عندي بيت صغير ضيق، لونه وردي مشرق، داخله بسيطة ورائعة، أنا سعيد بالبقاء هنا!'}]

## Free up some memory before continuing
- In order to have enough free memory to run the rest of the code, please run the following to free up memory on the machine.

In [8]:
import gc

In [9]:
del translator

In [10]:
gc.collect()

25

### Build the `summarization` pipeline using 🤗 Transformers Library

In [11]:
summarizer = pipeline(task="summarization",
                      model="./models/facebook/bart-large-cnn",
                      torch_dtype=torch.bfloat16)

Model info: ['bart-large-cnn'](https://huggingface.co/facebook/bart-large-cnn)

In [12]:
text = """Paris is the capital and most populous city of France, with
          an estimated population of 2,175,601 residents as of 2018,
          in an area of more than 105 square kilometres (41 square
          miles). The City of Paris is the centre and seat of
          government of the region and province of Île-de-France, or
          Paris Region, which has an estimated population of
          12,174,880, or about 18 percent of the population of France
          as of 2017."""

In [13]:
summary = summarizer(text,
                     min_length=10,
                     max_length=100)

In [14]:
summary

[{'summary_text': 'Paris is the capital and most populous city of France, with an estimated population of 2,175,601 residents as of 2018. The City of Paris is the centre and seat of the government of the region and province of Île-de-France.'}]

### Try it yourself! 
- Try this model with your own texts!

In [15]:
text = """From Wikipedia, the free encyclopedia
"UFC" redirects here. For other uses, see UFC (disambiguation).
UFC Holdings, LLC

Trade name	UFC
Company type	Subsidiary
Industry	
Mixed martial artsStreaming media
Founded	November 12, 1993; 30 years ago
Founders	
Art Davie
Bob Meyrowitz
Ryan Troutsdale
Campbell McLaren
David Isaacs
John Milius
Rorion Gracie[1][2]
Headquarters	Las Vegas, Nevada, United States
Area served	Worldwide
Key people	
Ari Emanuel
(CEO, TKO)
Dana White
(President & CEO)
Products	
Home videoLive eventsMerchandisePublishingStreaming network serviceTV
Services	Licensing
Revenue	Increase US$1.3 billion (2023)
Operating income	Increase US$$142.9 million (2023)
Net income	Increase US$387 million (2022)
Total assets	Increase US$1.1 billion (2022)
Total equity	Increase US$7 billion (2022)
Number of employees	~400 (2016)
Parent	TKO Group Holdings[3][a]
Divisions	UFC Apex
UFC Fight Pass
UFC Performance Institute
Website	www.ufc.com Edit this at Wikidata
The Ultimate Fighting Championship (UFC) is an American mixed martial arts (MMA) promotion company based in Las Vegas, Nevada. It is owned and operated by TKO Group Holdings, a majority owned subsidiary of Endeavor Group Holdings.[5][6][7][8] It is the largest MMA promotion in the world as of 2023.[9] It produces events worldwide that showcase 11 weight divisions (eight men's and three women's) and abides by the Unified Rules of Mixed Martial Arts.[10] As of 2022, it had held over 600 events. Dana White has been its president since 2001 and CEO since 2023. Under White's stewardship, it has grown into a global multi-billion-dollar enterprise.[11]

The UFC was founded by businessman Art Davie and Brazilian martial artist Rorion Gracie,[12] and the first event was held in 1993 at the McNichols Sports Arena in Denver, Colorado.[13] The purpose of the UFC's early competitions was to identify the most effective martial art in a contest with minimal rules and no weight classes between competitors of different fighting disciplines. In subsequent events, more rigorous rules were created and fighters began adopting effective techniques from more than one discipline, which indirectly helped create a separate style of fighting known as present-day mixed martial arts.

The UFC was initially owned by the Semaphore Entertainment Group (SEG) until it had financial issues and it was sold to the brothers Frank and Lorenzo Fertitta in 2001, who formed the company Zuffa to operate the UFC, and placed Dana White as the president of the company.[11][14] In 2016, UFC's parent company, Zuffa, was sold to a group led by Endeavor, then known as William Morris Endeavor (WME–IMG), including Silver Lake Partners, Kohlberg Kravis Roberts and MSD Capital[15] for US$4.025 billion.[16] In 2021, Endeavor bought out Zuffa's other owners at a valuation of $1.7 billion.[8] The organization had 578 fighters contracted as of January 2023.[17]

With a TV deal and expansion in Australia, Asia, Europe,[18][19][20] and new markets within the United States, the UFC has achieved greater mainstream media coverage. It earned US$609 million in 2015,[21] and its next domestic media rights agreement with ESPN was valued at $1.5 billion over a five-year term.[22]

In April 2023, Endeavor Group Holdings announced that UFC would merge with the wrestling promotion WWE to form TKO Group Holdings, a new public company majority-owned by Endeavor, with Vince McMahon serving as an executive chairman of the new entity and White remaining as UFC president. The merger was completed on September 12, 2023.[23]"""

In [16]:
summary = summarizer(text,
                     min_length=10,
                     max_length=140)

In [17]:
summary

[{'summary_text': "The Ultimate Fighting Championship (UFC) is an American mixed martial arts (MMA) promotion company based in Las Vegas, Nevada. It is owned and operated by TKO Group Holdings, a majority owned subsidiary of Endeavor Group Holdings. UFC was founded by businessman Art Davie and Brazilian martial artist Rorion Gracie in 1993. Dana White has been its president since 2001 and CEO since 2023. It produces events worldwide that showcase 11 weight divisions (eight men's and three women's) and abides by the Unified Rules of Mixed Martial Arts."}]