PixReader

Running

To run this project you will need to create a .env file with the definitions for 'GOOGLE_APPLICATION_CREDENTIALS' and 'MICROSOFT_CV_SUBSCRIPTION_KEY'. These can be obtained by setting up a Google Cloud project and a Microsoft Azure project.

Addtionally in order for the key strokes to register, the program must be run as root.

Inspiration

I wanted to create an assistive technology that was more robust than what's currently available. Web developers don't always provide alt-text for their images and I wanted to create an alternative to depending on alt-text.

What it does

PixReader is an assistive screen reader that reads text and auto-generated captions for images with the help of computer vision!

How I built it

This was built with Microsoft Cognitive Services (Describe Image), Google Cloud Text-to-Speech, and all strung together with python3.

Challenges I ran into

I initially tried to build the screen reader portion of the project with Google's Web Speech API, but there's minimal documentation for it and the only resource I could find was a blog post translated from Polish. Additionally, after implementing Microsoft Cognitive Services for the image caption generation, I tried to use Google's "im2text" model because I was impressed by its superior results. Sadly, all the pre-trained models I found online were sub-par and training it myself, even with my GPU, would take weeks. Not a luxury provided by TAMUhack.

Accomplishments that I'm proud of

I was in an unfortunate seating arrangement where I needed to stay where I was to use the ethernet cable and didn't really have room for teammates to join me, so I'm proud that I finished something so lofty both by myself and so quickly!

What I learned

I learned how to use Google Cloud Text-to-Speech and Microsoft Cognitive Services!

What's next for PixReader

Hopefully an improved model for the image caption generation!

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md
reader.py		reader.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PixReader

Running

Inspiration

What it does

How I built it

Challenges I ran into

Accomplishments that I'm proud of

What I learned

What's next for PixReader

About

Releases

Packages

Languages

efortanely/PixReader

Folders and files

Latest commit

History

Repository files navigation

PixReader

Running

Inspiration

What it does

How I built it

Challenges I ran into

Accomplishments that I'm proud of

What I learned

What's next for PixReader

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages