# Transformers, what can they do?

## Introduction

In this section, we look at what the **Transformer models** can do and use the Hugging Face (from now on HF) **`pipeline()`** function

### Transformers are everywhere!

**Transformer models** are used to solve **all kind** of **NLP** tasks (like the ones in the previous section).

Companies like: 
- Facebook
- Microsoft
- Grammarly

Use **HF** and the **Transformer models** and share back their own models.
The [**HF Transformers Library**](https://github.com/huggingface/transformers) provides the following tools:
- Create the models
- Use the shared models

The [**HF Model Hub**](https://huggingface.co/models) contains thousands of **pretrained models** open source.

## Working with pipelines

Install the `Transformers`, `Datasets`, and `Evaluate` libraries to run this notebook.

In [None]:
!pip install datasets evaluate transformers[sentencepiece]

The most **basic object** in the `transformers`library is the `pipeline()`function.
It connects a **model** with the following steps: 
- preprocessing
- postprocessing

This allows us 
1. to input any text
2. get an intelligible answer

By default, the `pipeline()`function selects a **pretrained model** fine-tuned. The model is **downloaded and cached

In [None]:
from transformers import pipeline

classifier = pipeline("sentiment-analysis")
classifier("I've been waiting for a HuggingFace course my whole life.")

We can even pass an **array** of sentences:

In [None]:
classifier(
    ["I've been waiting for a HuggingFace course my whole life.", "I hate this so much!"]
)

There are **3 main steps** involved when you pass **text** to the `pipeline()`

1. Text is **preprocessed** into a format the model can understand
2. **Preprocessed inputs** are passed to the model
3. **Predictions** made by the model are **post-processed** so we can understand them

In [None]:
from transformers import pipeline

classifier = pipeline("zero-shot-classification")
classifier(
    "This is a course about the Transformers library",
    candidate_labels=["education", "politics", "business"],
)

The currently **[HF available pipelines]**(https://huggingface.co/docs/transformers/main_classes/pipelines) are: 
- feature-extraction
- fill-mask
- ner
- question-answering
- sentiment-analysis
- summarization
- text-generation
- translation
- zero-shot-classification

### Using any model from the **HF HUB** in a `pipeline()`

You can choose a **particular model** from the [**HF HUB**](https://huggingface.co/models) to use in the `pipeline()`


In [None]:
from transformers import pipeline

generator = pipeline("text-generation", model="distilgpt2")
generator("In this course, we will teach you how to")

## Types of pipelines & examples

### Zero-Shot-Classification

`zero-shot`: Because we don't need to **fine-tune** the model.

It allows us to **classify texts** that haven't been **labeled**.

It is a common scenario in real world. 

The `zero-shot-classification`pipeline allows to specify **which labels to use for classification**. 

By doing that, you don't have to rely on the **labels of the pretrained model**.

In [None]:
from transformers import pipeline

classifier = pipeline("zero-shot-classification")
classifier(
    "The company was bought by an arabic fund",
    candidate_labels=["mergers and acquisitions", "politics"],
)

### Text Generation

Used for **generating some text**. 

We provide a **prompt** and the model will **auto-complete** by generating the **remaining text**.

It involves **randomness**. Outputs may differ in each execution.

In [None]:
from transformers import pipeline

generator = pipeline("text-generation")
generator("Mergers and acquisitions or M&A is ")

You can control **how many different sequences are generated** by using `num_return_sequences`

You can control **the total lenght of the output text** by using `max_length`

In [None]:
from transformers import pipeline

generator = pipeline("text-generation", model="distilgpt2")
generator(
    "In this course, we will teach you how to",
    max_length=30,
    num_return_sequences=2,
)

### Mask Filling

The `top_k` argument controls: 
- How many possibilities you want to be displayed

The model fills in the **special** `<mask>` word. Also called **mask token**.

Other mask-filling models **might have different mask tokens**.

In [None]:
from transformers import pipeline

unmasker = pipeline("fill-mask")
unmasker("This course will teach you all about <mask> models.", top_k=2)

### Named Entity Recognition (NER)

The model has to find which parts of the **input text** correspond to entities (companies, locations, persons...)

By passing the option `grouped_entities=true`, we tell the pipeline the following: 
- To regroup together the parts of the sentence that correspond to the same entity

In the example Hugging Face will be treated as a **single** organization, even the name consists of multiple words.

In [None]:
from transformers import pipeline

ner = pipeline("ner", grouped_entities=True)
ner("My name is Sylvain and I work at Hugging Face in Brooklyn.")

You will see that the model has recognized: 
- Sylvain as a **person** `PER`
- Hugging Face as a **organization** `ORG`
- Brooklyn as a **location** `LOC`

In [None]:
from transformers import pipeline

ner = pipeline("ner", grouped_entities=True, model="MMG/xlm-roberta-large-ner-spanish")
ner("My name is Sylvain and I work at Hugging Face in Brooklyn.")

### Question Answering

The pipeline works by extracting **information** from the **provided context**. 

It does not generate the answer!

In [None]:
from transformers import pipeline

question_answerer = pipeline("question-answering")
question_answerer(
    question="Where do I work?",
    context="My name is Sylvain and I work at Hugging Face in Brooklyn",
)

### Summarization

We can specify a `max_length` or `min_length` argument

In [None]:
from transformers import pipeline

summarizer = pipeline("summarization")
summarizer(
    """
    America has changed dramatically during recent years. Not only has the number of 
    graduates in traditional engineering disciplines such as mechanical, civil, 
    electrical, chemical, and aeronautical engineering declined, but in most of 
    the premier American universities engineering curricula now concentrate on 
    and encourage largely the study of engineering science. As a result, there 
    are declining offerings in engineering subjects dealing with infrastructure, 
    the environment, and related issues, and greater concentration on high 
    technology subjects, largely supporting increasingly complex scientific 
    developments. While the latter is important, it should not be at the expense 
    of more traditional engineering.

    Rapidly developing economies such as China and India, as well as other 
    industrial countries in Europe and Asia, continue to encourage and advance 
    the teaching of engineering. Both China and India, respectively, graduate 
    six and eight times as many traditional engineers as does the United States. 
    Other industrial countries at minimum maintain their output, while America 
    suffers an increasingly serious decline in the number of engineering graduates 
    and a lack of well-educated engineers.
""", max_length=30, min_length = 10
)

### Translation

We can specify a `max_length` or `min_length` argument

In [None]:
from transformers import pipeline

translator = pipeline("translation", model="Helsinki-NLP/opus-mt-fr-en")
translator("Ce cours est produit par Hugging Face.")