V.I.C.S (Visual Impairment Companion System)

Powered by Nvidia

VICS is an end-to-end learning based visual impairment device that strives to make the world more accessible to the visually impaired. VICS is currently made up of three separate systems that work to bring the world to the user via a suite of information rallying devices including vibration motors and speech enabled assistance.

Running on native machine

Hardware Configuration

sudo /opt/nvidia/jetson-io/jetson-io.py (Configure mic hardware first then configure i2s4 pins)
sudo grep dap4 /sys/kernel/debug/tegra_pinctrl_reg
alsactl init tegrasndt210ref
amixer -c tegrasndt210ref cset name="MVC1 Mux" I2S4
amixer -c tegrasndt210ref cset name="ADMAIF1 Mux" MVC1
amixer -c tegrasndt210ref cset name="MVC1 Vol" 13500
amixer -c tegrasndt210ref cset name='I2S4 codec bit format' 32
amixer -c tegrasndt210ref sget "MVC1 Channels"

Pytorch prerequisites

sudo apt-get install -y --no-install-recommends python3-pip python3-dev libopenblas-dev libopenmpi2 openmpi-bin openmpi-common gfortran && sudo rm -rf /var/lib/apt/lists/*
pip3 install setuptools Cython wheel
pip3 install numpy --verbose

Pytorch for Jetson nano

Instructions from https://forums.developer.nvidia.com/t/pytorch-for-jetson-version-1-7-0-now-available/72048

Install PyTorch pip wheel v1.8.0

wget https://nvidia.box.com/shared/static/p57jwntv436lfrd78inwl7iml6p13fzh.whl -O torch-1.8.0-cp36-cp36m-linux_aarch64.whl
sudo apt-get install python3-pip libopenblas-base libopenmpi-dev 
pip3 install Cython
pip3 install numpy torch-1.8.0-cp36-cp36m-linux_aarch64.whl

Install torchvision

sudo apt-get install libjpeg-dev zlib1g-dev libpython3-dev libavcodec-dev libavformat-dev libswscale-dev
git clone --branch v0.9.0 https://github.com/pytorch/vision torchvision   # see below for version of torchvision to download
cd torchvision
export BUILD_VERSION=0.9.0
sudo python3 setup.py install    
cd ../

Install torchaudio

sudo apt-get update 
sudo apt-get install sox libsox-dev libsox-fmt-all
git clone -b v0.8.0 https://github.com/pytorch/audio torchaudio
cd torchaudio
export BUILD_VERSION=0.8.0
sudo python3 setup.py install

pip install packages

sudo apt-get install portaudio19-dev
pip3 install --user --upgrade -r requirements.txt

Google Cloud Text-to-Speech API key

wget https://dl.google.com/dl/cloudsdk/channels/rapid/downloads/google-cloud-sdk-329.0.0-linux-x86_64.tar.gz
tar -xvzf google-cloud-sdk-329.0.0-linux-x86_64.tar.gz
./google-cloud-sdk/install.sh
(Select mystical-ace-305717 project)
./google-cloud-sdk/bin/gcloud init
source ~/.bashrc
gcloud auth list
gcloud config list project
gcloud services enable texttospeech.googleapis.com
export PROJECT_ID=$(gcloud config get-value core/project)
echo "export PROJECT_ID=$(gcloud config get-value core/project)" > ~/.bashrc
gcloud iam service-accounts keys create ~/key.json --iam-account my-tts-sa@${PROJECT_ID}.iam.gserviceaccount.com
export GOOGLE_APPLICATION_CREDENTIALS=~/key.json
echo export GOOGLE_APPLICATION_CREDENTIALS=~/key.json > ~/.bashrc

Google Tesseract-OCR Engine

sudo apt get install tesseract-ocr
sudo apt-get install libjpeg8 libjpeg62-dev libfreetype6 libfreetype6-dev
wget https://github.com/ZER-0-NE/EAST-Detector-for-text-detection-using-OpenCV/blob/master/frozen_east_text_detection.pb

YOLOV3 weights file

Download the weights file in vics/speech_and_comms/scene_describe_system/weights

wget https://pjreddie.com/media/files/yolov3-tiny.weights

TO-DO

NOTE: All tasks involving the GPS Guidance System should be held off until everything else has been completed. This is only an extension of the project.

General Hardware

Jetson Nano Developer Kit 4-GB
Design rechargeable battery / buy a rechargeable battery

1. Direction Planning System

Software

Implement AlexNet architecture for Direction Planning (direction_plan/neuralnet/model.py)
Write training process for the Direction Planning System (direction_plan/neuralnet/train.py)
Interface vibration motor for halt signalling (direction_plan/devices/vibrator_motor.py)
Finish up mpu6050 code to enable calculation of Yaw angle (direction_plan/devices/mpu6050.py)
Figure out halt signal calculations (direction_plan/collect_data.py)
Build direction_plan interfacing engine (direction_plan/engine.py)
Collect data (data such as stopping at cross walks and roads be very helpful)

Hardware

MPU9250 9-axis IMU sensor
Raspberry Pi Camerasudo apt-get install libjpeg8 libjpeg62-dev libfreetype6 libfreetype6-dev
2x Vibration motors
Get and setup Macro/Wide lens for Pi Camera

2. Speech and Communications Systems (Voice Assistant and Scene Description)

Sofware

Interface with microphone hardware
Interface with speaker hardware
Write object detection model
Build WakeWord model for AI voice assistant (wake word: "Hey VICS")
Build NLP model for taking in audio input and return a classification for the command issued
Using the detected objects find way to create voice responses with the detected objects
Write data collection scripts

Hardware

SPH0645LM4H MEMS Microphone
MAX98357A Amflifier

Product Design

Software

Design CAD for Enclosure using FreeCAD

Hardware

Create chest mount straps
3d print Enclosure

Name		Name	Last commit message	Last commit date
Latest commit History 269 Commits
command_processor		command_processor
describe		describe
devs_and_utils		devs_and_utils
guide		guide
hardware		hardware
read		read
wakeword		wakeword
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
engine.py		engine.py
report_plan.md		report_plan.md
requirements.txt		requirements.txt
vics_logo.gif		vics_logo.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

V.I.C.S (Visual Impairment Companion System)

Running on native machine

Hardware Configuration

Pytorch prerequisites

Pytorch for Jetson nano

pip install packages

Google Cloud Text-to-Speech API key

Google Tesseract-OCR Engine

YOLOV3 weights file

TO-DO

General Hardware

1. Direction Planning System

Software

Hardware

2. Speech and Communications Systems (Voice Assistant and Scene Description)

Sofware

Hardware

Product Design

Software

Hardware

About

Releases

Packages

Languages

License

grok0n/vics

Folders and files

Latest commit

History

Repository files navigation

V.I.C.S (Visual Impairment Companion System)

Running on native machine

Hardware Configuration

Pytorch prerequisites

Pytorch for Jetson nano

pip install packages

Google Cloud Text-to-Speech API key

Google Tesseract-OCR Engine

YOLOV3 weights file

TO-DO

General Hardware

1. Direction Planning System

Software

Hardware

2. Speech and Communications Systems (Voice Assistant and Scene Description)

Sofware

Hardware

Product Design

Software

Hardware

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages