Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extracting text from a picture/foto, detecting the language of the text and have SpeechService to pronounce the original text plus translations. #9

Open
wants to merge 12 commits into
base: master
Choose a base branch
from

Conversation

kwyjibo089
Copy link

I thought what I wanted in a travel app would be that I could walk around in a foreign country and I could point my smartphone at any kind of writings or signs and have the app do the translation for me.

What I did was basically making a copy of the AddPicture page and extended the existing ComputerVisionService with the functionality of extracting text out of pictures/fotos.
Screenshot_20200425-124940

When you hit the translate button you'll be redirected to the TranslateResultPage which I extended with some functionality. Whit the help of Azure's SpeechService you can hear the text in the detected language plus some other selected languages.
Screenshot_20200425-124957

This could also be helpful in a restaurant looking at the menu or when you are learning a new language. You will hear how to pronounce the text in the detected language and also of the translations.

I added some more languages just for the fun of it.
Screenshot_20200425-125007~01
Screenshot_20200425-125013~01
Screenshot_20200425-125018~01

I'd still have a lot of ideas on how to extend this app even more. I especially like the idea of adding a real time translator. But also I struggled a lot in the UI/xaml part of it as I am a backend guy. I think I need to invest a bit into some Xamarin training first.
Having said that I loved the idea of this challenge with the existing app we could extend. I definitely could check how something worked first before adding something new.

The documentation on Azure's Cognitive Services is excellent and the code examples are very helpful. One thing to add where I struggled was when I tried using the SpeechService SDK in Xamarin and couldn't get it to work. After looking at the TranslationService I figured out I need to do it over the REST API which eventually worked out fine.
So I think it's sometimes not clear of when you can use the SDK or when you have to use the REST API.

As I am preparing for the AI-100 Exam this challenge came at the right time! Because what's better than real hands on when you are learning about a new topic?!

@codemillmatt
Copy link
Contributor

Thanks for completing the challenge! Fill out this form so we can send you the swag! https://aka.ms/cog-svcs-swag

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants