Skip to content

jpotts18/ldsconf

Repository files navigation

ldsconf

You may need to create a data folder with the following structure

data
├── conferences
│   ├── html
│   └── txt
├── talks
│   ├── html
│   └── txt
└── words
    └── csv

Download Conference Home Pages

# downloads into data/conferences/html/{Year}-{Month}
$ python conference-downloader.py

Parse Conference Pages to extract Talk

# extracts links into data/conferences/txt/{Year}-{Month}
$ python conference-parser.py

Download Talks

# downloads into data/talks/html/{Year}-{Month}/{file_name}
$ python talk-downloader.py

Parse Talks

# extracts into data/talks/txt/{Year}-{Month}/{file_name}
$ python talk-parser.py
# writes

Run AFINN word polarity on every talk

$ python sentiment.py
# outputs sentiment.csv

About

A Scraper and Textmining tool for fun

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages