# How to Use Text Rank

# [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/alvaro-francisco-gil/text-rank/blob/main/examples/using_text_rank.ipynb)


## Install  Library

In [None]:
!pip install git+https://github.com/alvaro-francisco-gil/text-rank.git

## Basic Usage

In [None]:
from text_rank import TextRankKeywordExtractor

extractor = TextRankKeywordExtractor(window_size=5)

In [2]:
text = """
Natural language processing (NLP) is a subfield of linguistics, computer science, and artificial intelligence 
concerned with the interactions between computers and human language. It is used to apply algorithms to identify 
and extract the natural language rules such that unstructured language data is converted into a form that computers 
can understand.
"""

keywords = extractor.extract_keywords(text, top_n=10)
for word, score in keywords:
    print(f"{word}: {score:.4f}")


Extracted keywords: [('machine', 0.15525755334511443), ('learning', 0.15525755334511443), ('processing', 0.13480829115294352), ('fields', 0.13480829115294352), ('computer', 0.11485540497728197), ('language', 0.11485540497728196), ('science', 0.09507875052466004), ('natural', 0.09507875052466004)]


## Analyzing Text Files

In [None]:
from text_rank.utils import analyze_text_file, analyze_multiple_files

# Analyze a single file (all keywords)
keywords = analyze_text_file('your_text_file.txt')

# Analyze a single file (top 10 keywords)
top_keywords = analyze_text_file('your_text_file.txt', top_n=10)

# Analyze multiple files
file_paths = ['file1.txt', 'file2.txt', 'file3.txt']
results = analyze_multiple_files(file_paths)  # Returns all keywords for each file

## Exporting the Co-occurrence Graph

In [None]:
# Build and export the graph
graph = extractor.build_cooccurrence_graph(text)
extractor.export_pajek(graph, 'output_graph.net')