Skip to content

Latest commit

 

History

History
15 lines (12 loc) · 540 Bytes

ReadMe.md

File metadata and controls

15 lines (12 loc) · 540 Bytes
  • Gathers the wiki pages from the database dump
  • Extracts relevant pages
  • Inspects sections for links (currently manual, see old commit for other)
    • using marko: markdown parsing
    • wikimarkup: converts to HTML, uses beautifulsoup to extract texts
    • manual: rough (but fast and robust) text processing
  • Postprocesses links
  • Caches information
  • Generates interactive graph using pyvis network

Reference