## Βήματα υλοποίηση του pipeline
##### α) Διάβασμα αρχείων κειμένων
##### β) Επιλογή pipeline
##### γ) Ανακατασκευή του κειμένου 

---------------------------------------------------------------------------------------------

### α) Διάβασμα αρχείων κειμένων

In [1]:
try: 
    with open('../text_files/text1.txt', 'r', encoding='utf-8') as file:
        text1 = file.read()
    print("File content successfully loaded into text1.")
except FileNotFoundError:
    print("Error: The file 'text1.txt' was not found. Please make sure the file exists and the path is correct.")
except Exception as e:
    print(f"An unexpected error occurred: {e}")

File content successfully loaded into text1.


In [2]:
try:
    with open('../text_files/text2.txt', 'r', encoding='utf-8') as file:
        text2 = file.read()
    print("File content successfully loaded into text2.")
except FileNotFoundError:
    print("Error: The file 'text2.txt' was not found. Please make sure the file exists and the path is correct.")
except Exception as e:
    print(f"An unexpected error occurred: {e}")

File content successfully loaded into text2.


### β) Επιλογή pipeline

In [3]:
from transformers import pipeline

model={"Coherence & Flow (distilbart)": pipeline("text2text-generation", model="sshleifer/distilbart-cnn-12-6")}

model.safetensors:   0%|          | 0.00/1.22G [00:00<?, ?B/s]

Device set to use cpu


### γ) Ανακατασκευή του κειμένου 
- Χωρίζουμε τα κείμενα σε προτάσεις

In [4]:
import nltk
from nltk.tokenize import sent_tokenize

sentences_text1 = sent_tokenize(text1)
sentences_text2 = sent_tokenize(text2)

- Κάνουμε Loop για κάθε πρόταση και πραγματοποιούμε ανακατασκευή

In [5]:
reconstructed_text1=""
for sentence in sentences_text1:
     model_pipeline= model["Coherence & Flow (distilbart)"]
     result = model_pipeline(sentence, max_length=256, do_sample=False)[0]["generated_text"]
     print(f"Original: {sentence}")
     print(f"→ Transformed: {result}\n")
     reconstructed_text1+=result

Original: Today is our dragon boat festival, in our Chinese culture, to celebrate it with all safe and great in
our lives.
→ Transformed:  Today is our dragon boat festival, in our Chinese culture, to celebrate it with all safe and great in our lives . Celebrations include dragon boat festivals, dragon boat celebrations, and dragon boat races, in Chinese culture . We are happy to celebrate the festival with safe, great and great celebrations .

Original: Hope you too, to enjoy it as my deepest wishes.
→ Transformed:  "Hope you too, to enjoy it as my deepest wishes. Hope you too," she said. "Enjoy it as I do. My deepest wishes." She added that she hopes you too enjoy it, as well as the rest of the world . Enjoy it, she said, and it's a good time for you to enjoy your life .

Original: Thank your message to show our words to the doctor, as his next contract checking, to all of us.
→ Transformed:  Thank your message to show our words to the doctor, as his next contract checking, to all of

In [6]:
reconstructed_text2=""
for sentence in sentences_text2:
     model_pipeline= model["Coherence & Flow (distilbart)"]
     result = model_pipeline(sentence, max_length=256, do_sample=False)[0]["generated_text"]
     print(f"Original: {sentence}")
     print(f"→ Transformed: {result}\n")
     reconstructed_text2+=result

Original: During our final discuss, I told him about the new submission — the one we were waiting since
        last autumn, but the updates was confusing as it not included the full feedback from reviewer or
        maybe editor?
→ Transformed:  The new submission has been submitted since last autumn, but the updates was confusing as it not included the full feedback from reviewer or reviewer or editor . During our final discuss, I told him about the new submission — the one we were waiting since autumn, he said . But the updates were confusing as they not included . It was confusing to include feedback from reviewers or editor or reviewer .

Original: Anyway, I believe the team, although bit delay and less communication at recent days, they really
        tried best for paper and cooperation.
→ Transformed:  "I believe the team, although bit delay and less communication, they really tried best for paper and cooperation. Anyway, I believe . I believe    team, although . bit delay  and

### Αποθηκεύουμε τα ανακατασκευασμένα κείμενα

In [7]:
try:
    with open("../text_files/third_pipeline_text1.txt", 'w', encoding='utf-8') as outfile:
        outfile.write(reconstructed_text1)
    print(f"Successfully saved reconstructed text ")
except Exception as e:
    print(f"Error saving file : {e}")

Successfully saved reconstructed text 


In [8]:
try:
    with open("../text_files/third_pipeline_text2.txt", 'w', encoding='utf-8') as outfile:
        outfile.write(reconstructed_text2)
    print(f"Successfully saved reconstructed text ")
except Exception as e:
    print(f"Error saving file : {e}")

Successfully saved reconstructed text 
