In [5]:
pip install transformers

Note: you may need to restart the kernel to use updated packages.


In [6]:
from transformers import pipeline

In [7]:
summarizer = pipeline("summarization")

No model was supplied, defaulted to t5-small (https://huggingface.co/t5-small)
All model checkpoint layers were used when initializing TFT5ForConditionalGeneration.

All the layers of TFT5ForConditionalGeneration were initialized from the model checkpoint at t5-small.
If your task is similar to the task the model of the checkpoint was trained on, you can already use TFT5ForConditionalGeneration for predictions without further training.


In [8]:
doc_muscle_description = """The shoulder muscles surround the shoulder joint. In bodybuilding, a distinction is usually made between three parts of the shoulder: the anterior shoulder, the lateral shoulder 
and the posterior shoulder. The shoulder muscles move and stabilize the shoulders not only in one direction, but in all directions. If you want to train your shoulder optimally, 
you should include exercises for all three parts of the shoulder in your training plan. Most basic upper body exercises train at least one part of the shoulder; however, the lateral 
shoulder in particular can also be trained through isolation exercises."""

doc_exercise_description = """The pre-bent reverse fly is a rather less frequently observed exercise on the training floor. Although the exercise is excellent for the posterior shoulder and neck, we would like to point out the no less effective alternatives. Especially widespread are the reverse flys on the cable pulley and the reverse flys on the butterfly machine. Below this article you will find more variations with a picture.
The Reverse Flys are mainly aimed at strengthening the posterior part of the deltoid muscle (musculus deltoideus pars spinalis) and the middle part of the hood muscle (musculus trapezius pars transversa)."""

doc_execution_description = """Hold a dumbbell in each hand and stand about shoulder width apart. Bend your knees slightly and bend your upright upper body forward until it is parallel to the floor. Make sure that your entire back remains extended and especially that your lower back forms a slight hollow cross position. This is highly advisable to avoid back and disc injuries. Your arms with the dumbbells are down and almost fully extended.

Now exhale and simultaneously move the dumbbells upward with your arms almost extended on both sides of your body. When you reach the end point of this movement phase at head height, you are training almost exclusively the back shoulder. If you bring the shoulder blades together at the end of the movement by bringing the arms up as far as the anatomy allows, the hood muscle will be used more.

Then inhale again and simultaneously bring both dumbbells down slowly and in a controlled manner to the starting position."""

article = doc_muscle_description

In [9]:
summary = summarizer(article, max_length=90, min_length=20, do_sample=False)

In [10]:
print(summary)

[{'summary_text': 'the shoulder muscles surround the shoulder joint . the anterior shoulder, the lateral shoulder and the posterior shoulder move and stabilize the shoulders .'}]


In [11]:
from transformers import FSMTForConditionalGeneration, FSMTTokenizer
mname = "facebook/wmt19-de-en"
tokenizer = FSMTTokenizer.from_pretrained(mname)
model = FSMTForConditionalGeneration.from_pretrained(mname)

input_ids = tokenizer.encode(article, return_tensors="pt")
outputs = model.generate(input_ids)
decoded = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(decoded) # Machine learning is great, isn't it?

Downloading: 100%|██████████| 829k/829k [00:00<00:00, 1.43MB/s]
Downloading: 100%|██████████| 829k/829k [00:00<00:00, 1.25MB/s] 
Downloading: 100%|██████████| 308k/308k [00:00<00:00, 695kB/s]  
Downloading: 100%|██████████| 67.0/67.0 [00:00<00:00, 67.0kB/s]
Downloading: 100%|██████████| 825/825 [00:00<00:00, 412kB/s]


ImportError: 
FSMTForConditionalGeneration requires the PyTorch library but it was not found in your environment. Checkout the instructions on the
installation page: https://pytorch.org/get-started/locally/ and follow the ones that match your environment.


In [None]:
summary_en_1 = summarizer(decoded, max_length=50, min_length=10, do_sample=False)

In [None]:
print(summary_en_1)

[{'summary_text': ' In bodybuilding, three parts of the shoulder are usually distinguished: the front shoulder, the side shoulder, and the rear shoulder . Shoulder muscles move and stabilize the shoulders not only in one, but in all directions .'}]


In [None]:
summary_en_2 = summarizer(article, max_length=50, min_length=10, do_sample=False)

In [None]:
print(summary_en_2)

[{'summary_text': ' In bodybuilding, a distinction is usually made between the anterior shoulder, the lateral shoulder and the posterior shoulder . Most basic exercises for the upper body train at least one part of the shoulder . The lateral shoulder in particular can also be trained'}]
