Python scripts for data analysis, mostly work in progress.
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
nwbib
.gitignore
.project
.pydevproject
README.md
requirements.txt

README.md

python-data-analysis

Python scripts for data analysis, mostly work in progress.

Prerequisite

Python 3

Dependencies

Install dependencies:

pip3 install -r requirements.txt

Run

Change into the nwbib directory:

cd nwbib

Load sample NWBib data from the Lobid API:

python3 nwbib_subjects_load.py

Run classification experiment:

python3 nwbib_subjects_process.py

Run bulk classification (first run takes some time):

python3 nwbib_subjects_bulk.py

Run a pipeline with cross-validation and hyperparameter optimization:

python3 nwbib_subjects_pipeline.py

Run experiments based on paragraph vectors:

python3 nwbib_doc2vec.py

License

Eclipse Public License 2.0