In [1]:
import transformers
from transformers import pipeline

  from .autonotebook import tqdm as notebook_tqdm


In [2]:
# Intenta crear una pipeline de análisis de sentimiento simple
try:
    classifier = pipeline("sentiment-analysis")
    result = classifier("Hello, world!")
    print(result)
except Exception as e:
    print(f"Error: {e}")


No model was supplied, defaulted to distilbert/distilbert-base-uncased-finetuned-sst-2-english and revision af0f99b (https://huggingface.co/distilbert/distilbert-base-uncased-finetuned-sst-2-english).
Using a pipeline without specifying a model name and revision in production is not recommended.


[{'label': 'POSITIVE', 'score': 0.9997164607048035}]


In [4]:
# Varias oraciones:

classifier(
    ["I've been waiting for a HuggingFace course my whole life.", "I hate this so much!"]
)

[{'label': 'POSITIVE', 'score': 0.9598049521446228},
 {'label': 'NEGATIVE', 'score': 0.9994558691978455}]

In [5]:
classifier = pipeline("zero-shot-classification")
classifier(
    "This is a course about the Transformers library",
    candidate_labels=["education", "politics", "business"],
)

No model was supplied, defaulted to facebook/bart-large-mnli and revision c626438 (https://huggingface.co/facebook/bart-large-mnli).
Using a pipeline without specifying a model name and revision in production is not recommended.


{'sequence': 'This is a course about the Transformers library',
 'labels': ['education', 'business', 'politics'],
 'scores': [0.8445989489555359, 0.11197416484355927, 0.04342687129974365]}

In [13]:
classifier = pipeline("zero-shot-classification")
classifier(
    "The politics course at my university helped me a lot in my business.",
    candidate_labels=["education", "politics", "business"],
)

No model was supplied, defaulted to facebook/bart-large-mnli and revision c626438 (https://huggingface.co/facebook/bart-large-mnli).
Using a pipeline without specifying a model name and revision in production is not recommended.


{'sequence': 'The politics course at my university helped me a lot in my business.',
 'labels': ['politics', 'business', 'education'],
 'scores': [0.7526328563690186, 0.2084205448627472, 0.03894660994410515]}

In [15]:
generator = pipeline("text-generation")
generator("In this course, we will teach you how to")

No model was supplied, defaulted to openai-community/gpt2 and revision 6c0e608 (https://huggingface.co/openai-community/gpt2).
Using a pipeline without specifying a model name and revision in production is not recommended.
Setting `pad_token_id` to `eos_token_id`:50256 for open-end generation.


[{'generated_text': 'In this course, we will teach you how to create and understand software libraries using tools such as C/C++, and you will learn how to write code with C, like in Rust.\n\nWe will create reusable and compact libraries with custom'}]

In [17]:
generator = pipeline("text-generation", model="distilgpt2")
generator(
    "In this course, we will teach you how to",
    max_length=30,
    num_return_sequences=2,
)

Truncation was not explicitly activated but `max_length` is provided a specific value, please use `truncation=True` to explicitly truncate examples to max length. Defaulting to 'longest_first' truncation strategy. If you encode pairs of sequences (GLUE-style) with the tokenizer you can select this strategy more precisely by providing a specific strategy to `truncation`.
Setting `pad_token_id` to `eos_token_id`:50256 for open-end generation.


[{'generated_text': "In this course, we will teach you how to use a new plugin when you're not using your own plugins.\n\n\n\nTo learn more"},
 {'generated_text': 'In this course, we will teach you how to be a better member of the community in an environment where you can also build relationships and trust in the'}]

In [18]:
unmasker = pipeline("fill-mask")
unmasker("This course will teach you all about <mask> models.", top_k=2)

No model was supplied, defaulted to distilbert/distilroberta-base and revision ec58a5b (https://huggingface.co/distilbert/distilroberta-base).
Using a pipeline without specifying a model name and revision in production is not recommended.
Some weights of the model checkpoint at distilbert/distilroberta-base were not used when initializing RobertaForMaskedLM: ['roberta.pooler.dense.bias', 'roberta.pooler.dense.weight']
- This IS expected if you are initializing RobertaForMaskedLM from the checkpoint of a model trained on another task or with another architecture (e.g. initializing a BertForSequenceClassification model from a BertForPreTraining model).
- This IS NOT expected if you are initializing RobertaForMaskedLM from the checkpoint of a model that you expect to be exactly identical (initializing a BertForSequenceClassification model from a BertForSequenceClassification model).


[{'score': 0.196197971701622,
  'token': 30412,
  'token_str': ' mathematical',
  'sequence': 'This course will teach you all about mathematical models.'},
 {'score': 0.0405271016061306,
  'token': 38163,
  'token_str': ' computational',
  'sequence': 'This course will teach you all about computational models.'}]

In [21]:
ner = pipeline("ner", grouped_entities=True)
ner("In this episode, we had the opportunity to chat with Andrea Quirós, founder of Loopa, a startup that measures carbon footprint to inform consumers when purchasing furniture. Andrea is a furniture designer and holds a master's degree in engineering and design from MIT. We discussed her company, where data plays a fundamental role in success. She openly spoke about her journey to learn and incorporate a data-driven vision into Loopa..")

No model was supplied, defaulted to dbmdz/bert-large-cased-finetuned-conll03-english and revision f2482bf (https://huggingface.co/dbmdz/bert-large-cased-finetuned-conll03-english).
Using a pipeline without specifying a model name and revision in production is not recommended.
Some weights of the model checkpoint at dbmdz/bert-large-cased-finetuned-conll03-english were not used when initializing BertForTokenClassification: ['bert.pooler.dense.bias', 'bert.pooler.dense.weight']
- This IS expected if you are initializing BertForTokenClassification from the checkpoint of a model trained on another task or with another architecture (e.g. initializing a BertForSequenceClassification model from a BertForPreTraining model).
- This IS NOT expected if you are initializing BertForTokenClassification from the checkpoint of a model that you expect to be exactly identical (initializing a BertForSequenceClassification model from a BertForSequenceClassification model).


[{'entity_group': 'PER',
  'score': 0.9415091,
  'word': 'Francesco Giorgetti',
  'start': 33,
  'end': 52},
 {'entity_group': 'LOC',
  'score': 0.4346513,
  'word': 'Sierra',
  'start': 92,
  'end': 98},
 {'entity_group': 'ORG',
  'score': 0.7233525,
  'word': '##Col',
  'start': 98,
  'end': 101},
 {'entity_group': 'LOC',
  'score': 0.99221087,
  'word': 'Bogotá',
  'start': 109,
  'end': 115},
 {'entity_group': 'PER',
  'score': 0.9980458,
  'word': 'Francesco',
  'start': 447,
  'end': 456}]

In [22]:
question_answerer = pipeline("question-answering")
question_answerer(
    question="Where do I work?",
    context="My name is Sylvain and I work at Hugging Face in Brooklyn",
)

No model was supplied, defaulted to distilbert/distilbert-base-cased-distilled-squad and revision 626af31 (https://huggingface.co/distilbert/distilbert-base-cased-distilled-squad).
Using a pipeline without specifying a model name and revision in production is not recommended.


{'score': 0.6949766278266907, 'start': 33, 'end': 45, 'answer': 'Hugging Face'}

In [23]:
translator = pipeline("translation", model="Helsinki-NLP/opus-mt-en-es")
translator("My name is Sylvain and I work at Hugging Face in Brooklyn.")



[{'translation_text': 'Me llamo Sylvain y trabajo en Hugging Face en Brooklyn.'}]