Skip to content

Commit

Permalink
Merge pull request #31 from jhudsl/add-image-to-readme
Browse files Browse the repository at this point in the history
Add image to readme
  • Loading branch information
howardbaek committed Jun 1, 2023
2 parents 36938aa + ce4126c commit f372764
Show file tree
Hide file tree
Showing 3 changed files with 30 additions and 31 deletions.
11 changes: 8 additions & 3 deletions README.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -18,6 +18,11 @@ knitr::opts_chunk$set(
<!-- badges: start -->
<!-- badges: end -->


```{r, echo=FALSE, out.width="35%", out.height="30%", fig.cap="IBM 7094, the First Computer to Sing"}
knitr::include_graphics("man/figures/ibm7094.jpg")
```

The goal of text2speech is to harmonize various text-to-speech engines, including Amazon Polly, Coqui TTS, Google Cloud Text-to-Speech API, and Microsoft Cognitive Services Text to Speech REST API.

With the exception of Coqui TTS, all these engines are accessible as R packages:
Expand Down Expand Up @@ -57,7 +62,7 @@ tts_auth("coqui")
# Google Cloud Text-to-Speech API
tts_auth("google")
# Microsoft Cognitive Services Text to Speech REST API
tts_auth("microsoft")
# tts_auth("microsoft")
```


Expand All @@ -79,8 +84,8 @@ voices_google <- tts_google_voices()
head(voices_google)
# Microsoft Cognitive Services Text to Speech REST API
voices_microsoft <- tts_microsoft_voices()
head(voices_microsoft)
# voices_microsoft <- tts_microsoft_voices()
# head(voices_microsoft)
```

## Convert text to speech
Expand Down
50 changes: 22 additions & 28 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,15 @@
<!-- badges: start -->
<!-- badges: end -->

<div class="figure">

<img src="man/figures/ibm7094.jpg" alt="IBM 7094, the First Computer to Sing" width="35%" height="30%" />
<p class="caption">
IBM 7094, the First Computer to Sing
</p>

</div>

The goal of text2speech is to harmonize various text-to-speech engines,
including Amazon Polly, Coqui TTS, Google Cloud Text-to-Speech API, and
Microsoft Cognitive Services Text to Speech REST API.
Expand Down Expand Up @@ -57,8 +66,7 @@ tts_auth("coqui")
tts_auth("google")
#> [1] TRUE
# Microsoft Cognitive Services Text to Speech REST API
tts_auth("microsoft")
#> [1] TRUE
# tts_auth("microsoft")
```

## Voices
Expand All @@ -81,15 +89,15 @@ head(voices_amazon)
voices_coqui <- tts_coqui_voices()
#> ℹ Test out different voices on the CoquiTTS Demo (<https://huggingface.co/spaces/coqui/CoquiTTS>)
head(voices_coqui)
#> # A tibble: 6 × 3
#> language dataset model_name
#> <chr> <chr> <chr>
#> 1 multilingual multi-dataset your_tts [already downloaded]
#> 2 bg cv vits
#> 3 cs cv vits
#> 4 da cv vits
#> 5 et cv vits
#> 6 ga cv vits
#> # A tibble: 6 × 4
#> language dataset model_name service
#> <chr> <chr> <chr> <chr>
#> 1 multilingual multi-dataset your_tts [already downloaded] coqui
#> 2 bg cv vits coqui
#> 3 cs cv vits coqui
#> 4 da cv vits coqui
#> 5 et cv vits coqui
#> 6 ga cv vits coqui

# Google Cloud Text-to-Speech API
voices_google <- tts_google_voices()
Expand All @@ -100,25 +108,11 @@ head(voices_google)
#> 3 ar-XA-Wavenet-D Arabic ar-XA FEMALE google
#> 4 ar-XA-Wavenet-C Arabic ar-XA MALE google
#> 5 ar-XA-Wavenet-A Arabic ar-XA FEMALE google
#> 6 ar-XA-Standard-C Arabic ar-XA MALE google
#> 6 ar-XA-Standard-B Arabic ar-XA MALE google

# Microsoft Cognitive Services Text to Speech REST API
voices_microsoft <- tts_microsoft_voices()
head(voices_microsoft)
#> voice
#> 1 Microsoft Server Speech Text to Speech Voice (af-ZA, AdriNeural)
#> 2 Microsoft Server Speech Text to Speech Voice (af-ZA, WillemNeural)
#> 3 Microsoft Server Speech Text to Speech Voice (am-ET, AmehaNeural)
#> 4 Microsoft Server Speech Text to Speech Voice (am-ET, MekdesNeural)
#> 5 Microsoft Server Speech Text to Speech Voice (ar-AE, FatimaNeural)
#> 6 Microsoft Server Speech Text to Speech Voice (ar-AE, HamdanNeural)
#> language language_code gender service
#> 1 Afrikaans (South Africa) af-ZA Female microsoft
#> 2 Afrikaans (South Africa) af-ZA Male microsoft
#> 3 Amharic (Ethiopia) am-ET Male microsoft
#> 4 Amharic (Ethiopia) am-ET Female microsoft
#> 5 Arabic (United Arab Emirates) ar-AE Female microsoft
#> 6 Arabic (United Arab Emirates) ar-AE Male microsoft
# voices_microsoft <- tts_microsoft_voices()
# head(voices_microsoft)
```

## Convert text to speech
Expand Down
Binary file added man/figures/ibm7094.jpg
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit f372764

Please sign in to comment.