Depracated- Virtex 2.0 coming soon as Presense with more plug-n-play computer vision tools

Virtex 1.0

End-to-End Pose Recognition

Virtex is an easy to use API for pose recognition, built on top of this awesome guy's tf_pose_estimation, which lets you estimate and classify the poses. You can train on a bulk of images or you can just train for the poses in real-time using the webcam data generator. The classifier implements XG Boost Classifier to train and make predictions.

Medium Post of Virtex

Usage:

from api import generate_data
generate_data(PARAMETERS)

Docstring:
    Saves the data of a single pose as a separate csv file which can be later used for preprocessing

    PARAMETERS:
    mode :- Key Point data can be generated in two ways, 'realtime' mode and 'image' model
                realtime - captures poses in realtime through a webcam
                image - captures poses from images in a folder
    pose :- name of the pose you are generating data for
    folder :- path to the folder if you are using image model
    save_to :- path to the folder to save your csv files
    save_as :- name of the csv file you want to save it with
    nb_frames :- number of frames or entries you want to capture in realtime mode
    model :- only for realtime mode, accepts mobilenet_thin / mobilenet_v2_small / mobilenet_v2_larger / cmu. Default is mobilenet_thin

Preprocessing

The generate_data() function creates a separate csv files for each pose. To get the final dataset, you need to combine the csv files into a single dataframe. This can be achieved using the combine_poses(*args) function from utils.

Usage:

from utils import combine_poses
dataframe = combine_poses(PARAMETERS)

Docstring:
    RETURNS the dataframe containing all the poses' data

    PARAMETERS:
    *args :- The file paths of all the pose csv files.
    Example Usage:
    data = combine_poses('./data/pose-1.csv','./data/pose-2.csv',.....,'./data/pose-n.csv')
    You can add as many paths to the csv files

The dataframe returned by combine_poses() is now ready to be fed for preprocessing using the preprocess_data(dataframe) function which splits the data as an array of x features and y labels, encodes the labels and returns a dictionary containing the mapping from encoded label to string label (eg: {0:'pose-1', 1: 'pose-2'}).

Usage:

from api import preprocess_data
x_train, y_train, classes = preprocess_data(PARAMETERS)

Docstring:
    RETURNS two arrays, x and y from the dataset and a dictionary, classes which is a mapping
            between the encoded integers and the pose name strings

    PARAMETERS:
    dataframe :- the dataframe containing the keypoint data of all the poses

Additionally you can save the dataframe to csv as usual using pandas (eg: dataframe.to_csv('data.csv', index=False))

Training

The training process is as simple as a single line of code. Currently, the API has XGBoost Classifier with default hyperparameters, but more models like SVMs, Bayesian Models and Neural Nets will be added in the future. The train_classifier() function returns the trained model and also saves it by default.

Usage:

from api import train_classifier
classifier = train_classifier(PARAMETERS)

Docstring:
    RETURNS trained model

    PARAMETERS:
    x :- the input array of features
    y :- the target array of pose labels
    save_as :- the name of the model with which you want to save it
    save_to :- directory where you want to save the model to

The accuracy of the trained model can be obtained by the cross_val() function which implements k-fold cross validation.

Usage:
from api import cross_val
acc_mean, acc_std = cross_val(PARAMETERS)

Docstring:
    RETURNS two float values of mean accuracy and standard deviation

    PARAMETERS:
    model :- the trained classifier model
    x :- the input array of features
    y :- the target array of pose labels

Inference

The predict() function makes it super easy to make realtime inference as well as inference on a batch of images. If you are inferencing on a batch of images, the results are saved for you in a csv file containing the image name and the predicted pose.

Usage:
from api import predict
predict(PARAMETERS)

Docstring:
    Predicts the poses in realtime or on a batch of images

    PARAMETERS:
    mode :- poses can be predicted in two ways, 'realtime' mode and 'image' model
                realtime - infers poses in realtime through a webcam
                image - infers poses from images in a folder
    classifier :- path to the saved model
    cls_dict :- path to the dictionary object containing the mapping
    save_to :- path to the folder to save your results (only for image mode)
    save_as :- name of the csv file you want to save the results with (only for image mode)
    model :- choose from mobilenet_thin / mobilenet_v2_small / mobilenet_v2_larger / cmu. Default is mobilenet_thin

Documentation (CLI)

The API runs the CLI based scripts behind the scenes so you can infact directly generate, train and predict using the CLI based scripts. The following scripts are available with CLI:

run_datagen_directory.py
run_datagen_webcam.py
run_train_classifier.py
run_predict_directory.py
run_predict_webcam.py

Future Updates

More Classifiers (SVM, DNN)
Inference Logging and continuous training to decrease prediction fluctuations during inference and increase stability and overall performance
Support for Continuous Actions using LSTMs
Hand Gesture Recognition
Tracking specific Tagged Persons using Face Recognition
3D Pose

Contribute

Feel free to contribute to this project and help improve it. Raise issues if you find any bugs or have any suggestions. Make pull requests to help develop it further and fix bugs. Also, you can feel free to contact me at my email bhavesh.laddagiri1@gmail.com or instagram or facebook

References

Pose Estimation Code https://github.com/ildoonet/tf-pose-estimation/

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
docker		docker
example		example
models		models
msg		msg
scripts		scripts
tf_pose		tf_pose
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
Dockerfile		Dockerfile
README.md		README.md
Tutorial.ipynb		Tutorial.ipynb
__init__.py		__init__.py
api.py		api.py
package.xml		package.xml
requirements.txt		requirements.txt
run_datagen_directory.py		run_datagen_directory.py
run_datagen_webcam.py		run_datagen_webcam.py
run_predict_directory.py		run_predict_directory.py
run_predict_webcam.py		run_predict_webcam.py
run_train_classifier.py		run_train_classifier.py
setup.py		setup.py
utils.py		utils.py

nerdimite/virtex

Folders and files

Latest commit

History

Repository files navigation

Depracated- Virtex 2.0 coming soon as Presense with more plug-n-play computer vision tools

Virtex 1.0

End-to-End Pose Recognition

Table of Contents

Installation

Install

Download Tensorflow Graph File(pb file)

Tutorial

Documentation (API)

Data Generation

Preprocessing

Training

Inference

Documentation (CLI)

Future Updates

Contribute

References

About

Resources

Stars

Watchers

Forks

Languages