You create a pipeline using something like
my_pipeline = pipeline("The task I want to do")
followed by
result = my_pipeline(my_input)
and that's it.

In [1]:
!pip install transformers datasets diffusers

Collecting diffusers
  Downloading diffusers-0.33.1-py3-none-any.whl.metadata (19 kB)
Downloading diffusers-0.33.1-py3-none-any.whl (3.6 MB)
[2K   [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m3.6/3.6 MB[0m [31m32.5 MB/s[0m eta [36m0:00:00[0m
[?25hInstalling collected packages: diffusers
Successfully installed diffusers-0.33.1


In [14]:
# Imports
# Import necessary libraries
import torch
from transformers import AutoTokenizer, AutoModelForCausalLM, pipeline
from datasets import load_dataset
import soundfile as sf
from diffusers import StableDiffusionPipeline, DiffusionPipeline
from IPython.display import display, Audio

In [None]:
# sentiment analysis
classifier = pipeline("sentiment-analysis")
results = classifier("I love using Hugging Face's transformers library!")
print(results)

No model was supplied, defaulted to distilbert/distilbert-base-uncased-finetuned-sst-2-english and revision 714eb0f (https://huggingface.co/distilbert/distilbert-base-uncased-finetuned-sst-2-english).
Using a pipeline without specifying a model name and revision in production is not recommended.
Device set to use mps:0


[{'label': 'NEGATIVE', 'score': 0.9985520243644714}]


In [8]:
# Named Entity Recognition (NER)
ner = pipeline("ner", grouped_entities=True)
results = ner("Barack Obama was the 44th President of the United States.")
print(results)

No model was supplied, defaulted to dbmdz/bert-large-cased-finetuned-conll03-english and revision 4c53496 (https://huggingface.co/dbmdz/bert-large-cased-finetuned-conll03-english).
Using a pipeline without specifying a model name and revision in production is not recommended.
Some weights of the model checkpoint at dbmdz/bert-large-cased-finetuned-conll03-english were not used when initializing BertForTokenClassification: ['bert.pooler.dense.bias', 'bert.pooler.dense.weight']
- This IS expected if you are initializing BertForTokenClassification from the checkpoint of a model trained on another task or with another architecture (e.g. initializing a BertForSequenceClassification model from a BertForPreTraining model).
- This IS NOT expected if you are initializing BertForTokenClassification from the checkpoint of a model that you expect to be exactly identical (initializing a BertForSequenceClassification model from a BertForSequenceClassification model).
Device set to use mps:0


[{'entity_group': 'PER', 'score': 0.99913895, 'word': 'Barack Obama', 'start': 0, 'end': 12}, {'entity_group': 'LOC', 'score': 0.9952142, 'word': 'United States', 'start': 43, 'end': 56}]


In [10]:
# Text Summarization
summarizer = pipeline("summarization")
# let's put a text about ways to improve your productivity
text = """
Productivity is a measure of how efficiently you can complete tasks and achieve goals. Improving your productivity can lead to better time management, increased efficiency, and a greater sense of accomplishment. Here are some tips to help you improve your productivity:
1. Set clear goals: Define what you want to achieve and break it down into smaller, manageable tasks.
2. Prioritize tasks: Identify the most important tasks and focus on them first.
3. Eliminate distractions: Minimize interruptions and create a conducive work environment.
4. Use time management techniques: Techniques like the Pomodoro Technique can help you stay focused and manage your time effectively.
5. Take breaks: Regular breaks can help you recharge and maintain focus.
6. Stay organized: Keep your workspace tidy and use tools like to-do lists or project management software to stay on track.
7. Practice self-care: Ensure you get enough sleep, exercise, and maintain a healthy diet to keep your mind sharp.
8. Continuously learn: Stay updated with new skills and knowledge to enhance your productivity.
"""
summary = summarizer(text, max_length=70, min_length=30, do_sample=False)
print(summary)
# Text Generation
generator = pipeline("text-generation", model="gpt2")
prompt = "Once upon a time in a land far, far away"
num_return_sequences = 3
results = generator(prompt, max_length=50, num_return_sequences=num_return_sequences)
for i, result in enumerate(results):
    print(f"Generated text {i+1}: {result['generated_text']}")
# Text-to-Speech

No model was supplied, defaulted to sshleifer/distilbart-cnn-12-6 and revision a4f8f3e (https://huggingface.co/sshleifer/distilbart-cnn-12-6).
Using a pipeline without specifying a model name and revision in production is not recommended.
Device set to use mps:0


[{'summary_text': ' Productivity is a measure of how efficiently you can complete tasks and achieve goals . Improving your productivity can lead to better time management, increased efficiency and a greater sense of accomplishment .'}]


config.json:   0%|          | 0.00/665 [00:00<?, ?B/s]

model.safetensors:   0%|          | 0.00/548M [00:00<?, ?B/s]

generation_config.json:   0%|          | 0.00/124 [00:00<?, ?B/s]

tokenizer_config.json:   0%|          | 0.00/26.0 [00:00<?, ?B/s]

vocab.json:   0%|          | 0.00/1.04M [00:00<?, ?B/s]

merges.txt:   0%|          | 0.00/456k [00:00<?, ?B/s]

tokenizer.json:   0%|          | 0.00/1.36M [00:00<?, ?B/s]

Device set to use mps:0
Truncation was not explicitly activated but `max_length` is provided a specific value, please use `truncation=True` to explicitly truncate examples to max length. Defaulting to 'longest_first' truncation strategy. If you encode pairs of sequences (GLUE-style) with the tokenizer you can select this strategy more precisely by providing a specific strategy to `truncation`.
Setting `pad_token_id` to `eos_token_id`:50256 for open-end generation.


Generated text 1: Once upon a time in a land far, far away, in darkness with all the stars in its sky; they heard the strange noises that were passing, the footsteps that came, the movements of the stars that were passing through those distant halls and halls
Generated text 2: Once upon a time in a land far, far away, where magic had not ceased to be in force, in any place of life, it was said that an elephant was standing at the top of the hill, who stood motionless. It was
Generated text 3: Once upon a time in a land far, far away from our ancestors, these creatures are known as The Ones of Nature. The One who first created us was not a spirit, but an entity. It is all, all in all; our own


In [12]:
# Translation
translator = pipeline("translation_en_to_fr")
text = "Hello, how are you? I would like to learn more about you."
translation = translator(text)
print(f"Translation: {translation[0]['translation_text']}")

No model was supplied, defaulted to google-t5/t5-base and revision a9723ea (https://huggingface.co/google-t5/t5-base).
Using a pipeline without specifying a model name and revision in production is not recommended.
Device set to use mps:0


Translation: Bonjour, j'aimerais en savoir plus sur vous.


In [None]:
# Image generation
# Load the Stable Diffusion pipeline
model_id = "CompVis/stable-diffusion-v1-4"
pipe = StableDiffusionPipeline.from_pretrained(model_id, torch_dtype=torch.float16)
prompt = "A fantasy landscape with mountains and a river"
image = pipe(prompt).images[0]
# Save the image
image.save("fantasy_landscape.png")
# Display the image
display(image)

In [31]:
# Audio generation

synthetizer = pipeline("text-to-speech", "microsoft/speech5_tts")
text = "Hello, how are you? I would like to learn more about you."
# Generate audio
audio = synthetizer(text)
# Save the audio
sf.write("output.wav", audio, samplerate=22050)
# Display the audio
display(Audio("output.wav"))

OSError: microsoft/speech5_tts is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models'
If this is a private repository, make sure to pass a token having permission to this repo either by logging in with `huggingface-cli login` or by passing `token=<your_token>`