Setting Up Your Environment
First, ensure you have Python installed on your system. Then, you'll need to install the necessary libraries. You can do this by running the following commands in your terminal or command prompt:
pip install nltk transformers pandas

Basic Text Processing with NLTK
Here's how you can start with basic text processing tasks such as tokenization (breaking text into words or sentences) using NLTK:

In [None]:
import nltk
nltk.download('punkt')  # Download necessary datasets

from nltk.tokenize import word_tokenize, sent_tokenize

text = "Here is an example sentence. And another one!"

# Tokenize into sentences
sentences = sent_tokenize(text)
print("Sentences:", sentences)

# Tokenize into words
words = word_tokenize(text)
print("Words:", words)

Summarization with Transformers
For summarization, we'll use the transformers library to leverage a pre-trained model. This example uses the pipeline function for summarization:

In [None]:
from transformers import pipeline

# Load the summarization pipeline
summarizer = pipeline("summarization")

text = """
Looma Education provides a comprehensive digital library accessible through various means, including a dedicated Looma box, servers in schools, and online through AWS. The system is designed to enrich the educational experience by integrating approved textbooks with a vast array of digital resources, making learning more engaging and effective.
"""

# Perform summarization
summary = summarizer(text, max_length=50, min_length=25, do_sample=False)

print("Summary:", summary[0]['summary_text'])

Topic Extraction (Keyword Extraction) with Transformers
While the Transformers library doesn't directly provide a "topic extraction" pipeline, you can use it for named entity recognition (NER) or keyphrase extraction as a proxy to identify important topics in a text. For more straightforward keyword extraction, you might explore other libraries or methods, but here's a basic NER example with Transformers:

In [None]:
# Load the NER pipeline
ner_pipeline = pipeline("ner")

text = "Looma Education is enhancing learning in Nepal by integrating digital resources with traditional textbooks."

# Perform NER
ner_results = ner_pipeline(text)

# Extract entities
for entity in ner_results:
    print(f"Entity: {entity['word']}, Type: {entity['entity']}")