Release v0.2.0 — Edison Singing · Scottcjn/vintage-voice

VintageVoice v0.2.0 — Edison Singing

F5-TTS fine-tune on Edison wax-cylinder recordings (1900–1925).

Dataset: 569 rows, ~2.26 hours, English Edison cylinders
Training: 6,800 updates, 50 epochs, 2× V100 32 GB, 5h 13m wall time
Base: SWivid/F5-TTS

Where the weights live

🤗 HF branch (this release): AutomatedJanitor/vintage-voice@v0.2.0-edison-singing
🤗 HF main (v0.1.0 transatlantic): AutomatedJanitor/vintage-voice (unchanged)

What this is

A singing model, not a speech model. Mid-training audit confirmed ~60–70% of the Edison cylinder corpus is sung material (vaudeville, parlor songs, opera, lieder). Rather than fight the data, we shipped what the data is: a model that teaches any clean reference voice to perform in 1910s-era theatrical cadence over band-limited 300–3000 Hz wax-cylinder acoustic.

What it is not

For modern-sounding spoken Sophia, use base F5-TTS. v0.3.0 Edison Spoken is on the roadmap but blocked on a different upstream archive (UCSB Spoken Word, LoC sermons/lectures) — the Edison cylinders don't contain enough modern-sounding spoken material.

Quick start

pip install huggingface_hub
huggingface-cli download AutomatedJanitor/vintage-voice \
  --revision v0.2.0-edison-singing \
  --local-dir vintage-voice-edison-singing

License

Code (this repo): MIT
Weights: CC-BY-NC-4.0 (inherited from F5-TTS base)
Training data: Public domain (Internet Archive)

Built on sophiacore-A620I-AX (192.168.0.136). Audit pipeline + dataset prep at /mnt/18tb/vintage_voice_edison_dataset/ on-prem.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.2.0 — Edison Singing

Choose a tag to compare

Sorry, something went wrong.