Skip to content

Comparison of self-supervised deep learning architectures for surname generation.

License

Notifications You must be signed in to change notification settings

jbpacker/surnamerator

Repository files navigation

Surnamerator

Surnamerator is a surname generator. It's a testbed for different network architectures inspired by Andrej Karpathy's zero-to-hero youtube series.

Live demo at the surnamerator huggingface space!

Getting started

To run existing models run app.ipynb in a jupyter notebook IDE. To train a new GPT model, run lit_surnames.ipynb.

File Structure

  • app.ipynb - example code for the huggingface gradio
  • lit_surnames.ipynb - Most up to date training file for the transformer architecture.
  • surnamerator.py - Library for network models and utils.
  • first_names.txt/surnames.txt - Dataset.
  • data/ - Dataset creation.
  • prototypes/ - Network architecture prototypes.
  • models/ - Saved weights and configs.

Dataset

Dataset creation for surnames.txt is done in ./data. first_names.txt was downloaded from makemore. Example surname data:

butterly
gawne
mouch
...

The raw data for surnames was found at fivethirtyeight, data.world, and cencus.gov.

About

Comparison of self-supervised deep learning architectures for surname generation.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published