Skip to content

crsmithdev/canon

master
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 
 
 
 
 
 
 

canon

Exploring the Pāli Canon with Machine Learning

Installation

Use of Anaconda is recommended.

  • Create an Anaconda environment: conda create --name canon python=3
  • Activate the environment: conda activate canon
  • Install Anaconda packages: conda install nltk tensorflow scikit-learn matplotlib
  • Install non-Anaconda packages: pip install bs4

Text Processing

Run python process.py to download the ATI archive (if needed) and process text into data/sentences.py

Analysis

Run python analyze.py to train and evaluate a word vector model on the processed sentences. This will produce some examples for evaluation and save a tsne.png file that contains the t-SNE plot for the results. Subsequent runs will use saved model data, unless the data/model directory is deleted.

About

Exploring the Pāli Canon with Machine Learning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages