Skip to content

Latest commit

 

History

History
14 lines (9 loc) · 336 Bytes

README.md

File metadata and controls

14 lines (9 loc) · 336 Bytes

Automation in Text Data with Python

This repository stores my exploration on automation related to text data with Python.

Keywords: Natural Language Processing, Web Scraping, PDF Parsing

Scope

  • Text Data Collection
    • Web Scraping
    • API
  • Pre-precessing methods
  • Latent Dirichlet Allocation (LDA) for Topic Modeling