Update README.md

declare-lab · May 30, 2023 · f586b78 · f586b78
1 parent 1836fa4
commit f586b78
Showing 1 changed file with 2 additions and 2 deletions.
diff --git a/README.md b/README.md
@@ -9,7 +9,7 @@
 
 📣 We are excited to share that Oracle Cloud has sponsored the project Tango.
 
-# Tango Model Family
+## Tango Model Family
 
 | Model Name                 | Model Path                                       |
 |----------------------------|-------------------------------------------------|
@@ -18,7 +18,7 @@
 Tango-Full-FT-Audio-Music-Caps | [https://huggingface.co/declare-lab/tango-full-ft-audio-music-caps](https://huggingface.co/declare-lab/tango-full-ft-audio-music-caps) |
 | Tango-Full | [https://huggingface.co/declare-lab/tango-full](https://huggingface.co/declare-lab/tango-full) |
 
-# Description
+## Description
 
 **TANGO** is a latent diffusion model (LDM) for text-to-audio (TTA) generation. **TANGO** can generate realistic audios including human sounds, animal sounds, natural and artificial sounds and sound effects from textual prompts. We use the frozen instruction-tuned LLM Flan-T5 as the text encoder and train a UNet based diffusion model for audio generation. We perform comparably to current state-of-the-art models for TTA across both objective and subjective metrics, despite training the LDM on a 63 times smaller dataset. We release our model, training, inference code, and pre-trained checkpoints for the research community.