Skip to content

anglilian/oligarchy-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Oligarchy Project

Image Preprocessing

  • Binarise images
  • Crop images into single columned text
  • Correct Skew

Extract Text

  • OCR with AWS Textract (yielded better results than Tesseract).
  • Correct OCR mistakes with SpellChecker
  • Split text into sections based on each person's name
  • Categorise information based on keywords and patterns

Build Network

  • Group similar locations or institutions
  • Map connections between nodes based on number of shared affiliations
  • Draw network using Gephi

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published