Say See frontend

Say See is a (outdated) mini app developed at the end of year 2022. It was made for my relatives' kids. It leverages multiple third-party AI services.

The main feature is that you speak to it in any language using your device's microphone and it will generate an image based on what you said. The image can be shared to WhatsApp with a click of a button.

The app leverages Whisper automatic speech recognition AI model for audio to text speech transcription. Whisper AI model is used through Replicate's API. After transcribing makes a call to DeepL API for AI enhanced translation to English. At the end it calls OpenAI's Dalle-2 API which generates an image based on the transcribed and translated text. The image is then shown in the UI.

The app was deployed to Heroku.

Link to Say See backend repository

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Say See frontend

Files

README.md

Latest commit

History

README.md

File metadata and controls

Say See frontend