Skip to content

andysingal/br-laws-clustering-nlp

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Clustering & NLP

Brazilian Laws analysis with TF-IDF and K-Means

This repository contains a few NLP and Clustering analysis of a dataset containing ~6400 Brazilian Ordinary Laws. The Source Code is in a Jupyter Notebook file.

Also, read the Medium Article related with this repository.

Main contents:

  • A PT-BR dataset ready-to-use in folder 📂 data

  • Feature Extraction with TF-IDF and Clustering with K-Means

  • TF-IDF visualizations to better data understanding

  • Creation of informative/visual plots like this:

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 80.4%
  • HTML 19.6%