dialect_detector

This dialect detector predicts which dialect of Spanish a sentence is from. For the moment, it only distinguishes between Mexican Spanish and other varieties. I trained this model using text data from two sources (childes.talkbank.cmu and radioambulante.org).

To use the dialect detector:

Launch the project in Binder (may take several minutes):
Open the jupyter notebook 'dialect_predictor.ipyn'
Run all cells in the "Intro" section.
In the section "Type sentence, get dialect," type in a sentence and run the cell to get a prediction.

To see the code interactively:

Launch and open the jupyter project in Binder (may take several minutes):
Open the jupyter notebook 'capstone.ipyn'
You can either (i) run all cells (this includes the time-consuming process of scraping and cleaning the training data), or (ii) skip to the section "Data wrangling" > "Full dataset" > "Read in full dataset from disk" (this skips data scraping but still includes model fitting), or (iii) skip to the section "Feature engineering" > "Read in training data from disk"

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Capstone.ipynb		Capstone.ipynb
README.md		README.md
childes.csv		childes.csv
classifier.pkd		classifier.pkd
dialect_detector.ipynb		dialect_detector.ipynb
finaltest.csv		finaltest.csv
radioambulante.csv		radioambulante.csv
raw_data.csv		raw_data.csv
requirements.txt		requirements.txt
test.csv		test.csv
train.csv		train.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.ipynb_checkpoints

.ipynb_checkpoints

Capstone.ipynb

Capstone.ipynb

README.md

README.md

childes.csv

childes.csv

classifier.pkd

classifier.pkd

dialect_detector.ipynb

dialect_detector.ipynb

finaltest.csv

finaltest.csv

radioambulante.csv

radioambulante.csv

raw_data.csv

raw_data.csv

requirements.txt

requirements.txt

test.csv

test.csv

train.csv

train.csv

Repository files navigation

dialect_detector

About

Releases

Packages

Languages

HannahForsythe/dialect_detector

Folders and files

Latest commit

History

Repository files navigation

dialect_detector

About

Resources

Stars

Watchers

Forks

Languages