Skip to content

A smart assistant for the visually impaired, to make their lives easier

Notifications You must be signed in to change notification settings

shreayan98c/LookoutX

Repository files navigation

LookoutX

License

A smart assistant for the visually impaired, to make their lives easier.

Goal is to guide the visually impaired user via audio by answering their questions and queries and providing them with the response.

Demo

LookoutX: Smart Assistant for the Visually Impaired

YouTube Link: https://youtu.be/YNT_FSTY52A

Technologies used:

  1. DroidCam for streaming video from phone to the system
  2. OpenCV for capturing and processing real-time video
  3. OpenAI Whisper for converting speech to text
  4. OpenFlamingo (LLaMA 7B + CLIP ViT/L-14) Vision-Language Model

Architecture

Architecture

Installation

These instructions assume a working installation of Anaconda.

git clone git@github.com:shreayan98c/LookoutX.git
cd LookoutX
conda env create -f environment.yml

Depending on your desired configuration, you may need to install the PyTorch package separately. This can be done following the instructions on the PyTorch website, in an empty conda environment. Then you can install the remaining packages with:

conda activate lookoutx
pip install -r requirements.txt
pip install git+https://github.com/zphang/transformers.git@68d640f7c368bcaaaecfc678f11908ebbd3d6176
pip install -e .

This is only necessary if the installation from environment.yml fails.

Usage

python main.py train

License

This project is licensed under the terms of the MIT license.

About

A smart assistant for the visually impaired, to make their lives easier

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages