In [1]:
from transformers import pipeline

### pipeline API supports most common NLP tasks out of the box
* Text classification
* Zero-shot classification
* Text generation
* Text completion(mask filling)
* Token classification
* Question answering
* Summarization
* Translation

In [2]:
classifier = pipeline("sentiment-analysis") # simple sentiment-analysis
classifier("Omg. This is terrible one. I hate you!")

No model was supplied, defaulted to distilbert-base-uncased-finetuned-sst-2-english and revision af0f99b (https://huggingface.co/distilbert-base-uncased-finetuned-sst-2-english).
Using a pipeline without specifying a model name and revision in production is not recommended.
All PyTorch model weights were used when initializing TFDistilBertForSequenceClassification.

All the weights of TFDistilBertForSequenceClassification were initialized from the PyTorch model.
If your task is similar to the task the model of the checkpoint was trained on, you can already use TFDistilBertForSequenceClassification for predictions without further training.


[{'label': 'NEGATIVE', 'score': 0.9988190531730652}]

In [3]:
classifier(['This is a good one!',
            "I don't want to do this shit"])

[{'label': 'POSITIVE', 'score': 0.9998583793640137},
 {'label': 'NEGATIVE', 'score': 0.9988161325454712}]

In [4]:
classifier = pipeline("zero-shot-classification")
classifier("This is a course about the Transformers Library" , candidate_labels=['education' , 'politics' , 'science' , 'business']) # zero-shot-classification allows you to provide candidate_labels.

No model was supplied, defaulted to roberta-large-mnli and revision 130fb28 (https://huggingface.co/roberta-large-mnli).
Using a pipeline without specifying a model name and revision in production is not recommended.
All PyTorch model weights were used when initializing TFRobertaForSequenceClassification.

All the weights of TFRobertaForSequenceClassification were initialized from the PyTorch model.
If your task is similar to the task the model of the checkpoint was trained on, you can already use TFRobertaForSequenceClassification for predictions without further training.


{'sequence': 'This is a course about the Transformers Library',
 'labels': ['education', 'science', 'business', 'politics'],
 'scores': [0.9018488526344299,
  0.05133064091205597,
  0.02937738597393036,
  0.017443055287003517]}

In [5]:
# generator = pipeline('text-generation') # Default model
# generator = pipeline('text-generation', model='gpt2', revision='6c0e608') # gpt2 model with specific revision
generator = pipeline('text-generation', model= 'distilgpt2') # distipgpt2 model
generator("In this course, we will teach you how to")

All PyTorch model weights were used when initializing TFGPT2LMHeadModel.

All the weights of TFGPT2LMHeadModel were initialized from the PyTorch model.
If your task is similar to the task the model of the checkpoint was trained on, you can already use TFGPT2LMHeadModel for predictions without further training.
Setting `pad_token_id` to `eos_token_id`:50256 for open-end generation.


[{'generated_text': 'In this course, we will teach you how to get into the art of working with other artists to make your art.\n\n\nWhy should we not teach about all the techniques taught here?\nSome are just as capable of getting people in the'}]

In [6]:
generator = pipeline('text-generation', model= 'distilgpt2') 
generator("In this course, we will teach you how to",
          max_length=30,
          num_return_sequences= 2) # This parameters is also looks cool.

All PyTorch model weights were used when initializing TFGPT2LMHeadModel.

All the weights of TFGPT2LMHeadModel were initialized from the PyTorch model.
If your task is similar to the task the model of the checkpoint was trained on, you can already use TFGPT2LMHeadModel for predictions without further training.
Setting `pad_token_id` to `eos_token_id`:50256 for open-end generation.


[{'generated_text': "In this course, we will teach you how to build your skills in Java/OCT for developers at NCS, and we hope you won't"},
 {'generated_text': 'In this course, we will teach you how to write and write software. But if you do, it․s hard because you cannot write the'}]

In [7]:
unmasker = pipeline("fill-mask")
unmasker("This course will teach you all about <mask> models." , top_k = 2)

No model was supplied, defaulted to distilroberta-base and revision ec58a5b (https://huggingface.co/distilroberta-base).
Using a pipeline without specifying a model name and revision in production is not recommended.
All PyTorch model weights were used when initializing TFRobertaForMaskedLM.

All the weights of TFRobertaForMaskedLM were initialized from the PyTorch model.
If your task is similar to the task the model of the checkpoint was trained on, you can already use TFRobertaForMaskedLM for predictions without further training.


[{'score': 0.19619682431221008,
  'token': 30412,
  'token_str': ' mathematical',
  'sequence': 'This course will teach you all about mathematical models.'},
 {'score': 0.040527019649744034,
  'token': 38163,
  'token_str': ' computational',
  'sequence': 'This course will teach you all about computational models.'}]

In [8]:
ner = pipeline('ner' , grouped_entities= True)
ner("My name is Emincan and I work at Hugging Face in Ankara-Cankaya")

No model was supplied, defaulted to dbmdz/bert-large-cased-finetuned-conll03-english and revision f2482bf (https://huggingface.co/dbmdz/bert-large-cased-finetuned-conll03-english).
Using a pipeline without specifying a model name and revision in production is not recommended.
All PyTorch model weights were used when initializing TFBertForTokenClassification.

All the weights of TFBertForTokenClassification were initialized from the PyTorch model.
If your task is similar to the task the model of the checkpoint was trained on, you can already use TFBertForTokenClassification for predictions without further training.


[{'entity_group': 'PER',
  'score': 0.9974027,
  'word': 'Emincan',
  'start': 11,
  'end': 18},
 {'entity_group': 'ORG',
  'score': 0.9865106,
  'word': 'Hugging Face',
  'start': 33,
  'end': 45},
 {'entity_group': 'LOC',
  'score': 0.9980684,
  'word': 'Ankara',
  'start': 49,
  'end': 55},
 {'entity_group': 'LOC',
  'score': 0.9839372,
  'word': 'Cankaya',
  'start': 56,
  'end': 63}]

In [9]:
question_answerer = pipeline("question-answering")
question_answerer(
    question = "Where do I work?",
    context = "My name is Emincan and I work at EmKa Academy which is online IT education platform in Turkey"
)

No model was supplied, defaulted to distilbert-base-cased-distilled-squad and revision 626af31 (https://huggingface.co/distilbert-base-cased-distilled-squad).
Using a pipeline without specifying a model name and revision in production is not recommended.
All PyTorch model weights were used when initializing TFDistilBertForQuestionAnswering.

All the weights of TFDistilBertForQuestionAnswering were initialized from the PyTorch model.
If your task is similar to the task the model of the checkpoint was trained on, you can already use TFDistilBertForQuestionAnswering for predictions without further training.


{'score': 0.9767260551452637, 'start': 33, 'end': 45, 'answer': 'EmKa Academy'}

In [10]:
summarizer = pipeline("summarization")
summarizer("""
           The history of the lands that became the United States began with the arrival of the first people in the Americas around 15,000 BC. Numerous indigenous cultures formed, and many saw transformations in the 16th century away from more densely populated lifestyles and towards reorganized polities elsewhere. 
           The European colonization of the Americas began in the late 15th century, however most colonies in what would later become the United States were settled after 1600. By the 1760s, the thirteen British colonies contained 2.5 million people and were established along the Atlantic Coast east of the Appalachian Mountains. 
           The Southern Colonies built an agricultural system on slave labor, kidnapping people from Africa for this purpose. After defeating France, the British government imposed a series of taxes, including the Stamp Act of 1765, rejecting the colonists' constitutional argument that new taxes needed their approval. 
           Resistance to these taxes, especially the Boston Tea Party in 1773, led to Parliament issuing punitive laws designed to end self-government. Armed conflict began in Massachusetts in 1775.
           """)

No model was supplied, defaulted to t5-small and revision d769bba (https://huggingface.co/t5-small).
Using a pipeline without specifying a model name and revision in production is not recommended.
All PyTorch model weights were used when initializing TFT5ForConditionalGeneration.

All the weights of TFT5ForConditionalGeneration were initialized from the PyTorch model.
If your task is similar to the task the model of the checkpoint was trained on, you can already use TFT5ForConditionalGeneration for predictions without further training.


[{'summary_text': 'the history of the lands that became the united states began with the arrival of the first people in the Americas around 15,000 BC . the early colonization of the americans began in the late 15th century . by the 1760s, the thirteen British colonies contained 2.5 million people and were established along the Atlantic coast east of the Appalachian Mountains .'}]

In [12]:
translator = pipeline("translation", model= "Helsinki-NLP/opus-mt-fr-en")
translator("Ce cours est produit par Hugging Face.")

All model checkpoint layers were used when initializing TFMarianMTModel.

All the layers of TFMarianMTModel were initialized from the model checkpoint at Helsinki-NLP/opus-mt-fr-en.
If your task is similar to the task the model of the checkpoint was trained on, you can already use TFMarianMTModel for predictions without further training.


[{'translation_text': 'This course is produced by Hugging Face.'}]

In [None]:
# Done