About

mlfactory is a simple modular wrapper library that abstracts several different types of neural network architectures and techniques for computer vision (pytorch and tensorflow backend) providing seemless easy to use training in a few lines of code.

Using the standard modular philosophy you can also define your own neural network in pytorch/tensorflow, and if youre lazy to write the data loaders or the training loop then pass the network to our submodules !, or vice versa.

Getting Started

pip install mlfactory

Out of box colab usage

Machine Learning and AI applications full pipeline

High definition mapping using monocular camera (using monocular depth estimation and superglue feature extractor)

https://colab.research.google.com/drive/1lZYHjYszROIvMjtZgAp7r5eL1YZC9x9M?usp=sharing

Simple and fast visual odometry directly from MOV files and output pose trajectory in open3d

https://colab.research.google.com/drive/1Nr1nYFBKieDQG6UeNsnrV4gHh-l-65G0?usp=sharing

Compose machine learning applications in a modular way

(NYUV2 dataloader) Easy monocular depth estimation

https://colab.research.google.com/drive/1T2gONs_gst4zpdS7fBoIaQclgg3J2Jgk?usp=sharing

Finetune deeplabv3 for any general binary segmentation using less than 100 samples

https://colab.research.google.com/drive/1y0wirrjuoha3_SQk52IMJmL99iPGT6DZ?usp=sharing

Annotation and other computer vision utilities

Polygon annotation tool allowing to create polygon masks for segmentation directly in colab

https://colab.research.google.com/drive/1YUoMU3H_m9KM6xTrAKAy9ebiDWWzXTle?usp=sharing

Easy usage of superglue neural network based image feature matching

https://colab.research.google.com/drive/1fqnW1-Dlwz3fYlacTjMUmu5gxreGfjj6?usp=sharing

Easy usage of holistically nested edge detection for generic high level edge detection

https://colab.research.google.com/drive/174STj0gsUZ4qOrbj_WFXhtSfwcef-oMi?usp=sharing

More examples and use cases

Functionality to record gameplay from the screen for training behavior cloning, and also play model policies using pyautogui. See - https://github.com/Homagn/mlfactory/blob/master/datacreators/utils/gameplay_recorder.py
Dataloader class for images and labels using jsonfiles (Data needs to be saved in folders, each folder containing images and json labels). See - https://github.com/Homagn/mlfactory/blob/master/dataloaders/imgjsonloader.py
Training high quality and low weight variational autoencoders for larger dimension images and using custom data (uses subpixel convolution for upsampling). See - https://github.com/Homagn/mlfactory/blob/master/examples/variational_encoders/main.py
Latent GPT models for behavior cloning. Uses encoded representations from vae to train a GPT model for behavior cloning. See - https://github.com/Homagn/mlfactory/blob/master/examples/behavior_cloning/train_latent_gpt.py
Vision transformer minimal example for classification. See - https://github.com/Homagn/mlfactory/blob/master/models/pytorch/ViT.py
Timesformer model (pure attention based model for video classification ). See - https://github.com/Homagn/mlfactory/blob/master/models/pytorch/timesformer.py

Upcoming

applications/multiview_sba_recon integrate with the pip project as a functionality by creating demo in examples/ folder
integrate the photometric bundle adjustment part of applications/multiview_sba_recon in deep_modular_reconstruction module
after integrating applications/multiview_sba_recon use it to input estimated poses to construct nerfs (pytorch)
examples/variational_encoders/main_b.py, integrate variational encoders with the pip project and remove results folder in it taking up space
examples/behavior_cloning/train_latent_gpt.py , test_latent_gpt.py integrate with the pip project
behavior transformers and using GPT - https://github.com/notmahi/bet and https://github.com/notmahi/miniBET
base transformer model definition + segformer model definitions pytorch
diffusion models from scratch pytorch
MIRnet keras for low light image enhancement - https://keras.io/examples/vision/mirnet/
SRGAN for image superresolution - https://github.com/sgrvinod/a-PyTorch-Tutorial-to-Super-Resolution
differentiable pointcloud generation models - https://github.com/puhsu/point-clouds
image animation generation models - https://github.com/snap-research/articulated-animation (code cloned in /ml/misctools/articulated-animation/, run the working demo-> demo_outofbox.py)
FCOSnet pytorch - https://github.com/VectXmy/FCOS.Pytorch
dataloader for ouster lidar datasets
Coco bounding box dataloader colab
tum_rgbd dataloader
Resnet finetunining module for 2D image regregression (pose estimation example)

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
applications		applications
custom_losses		custom_losses
datacreators		datacreators
dataloaders		dataloaders
examples		examples
misctools		misctools
models		models
trainers		trainers
visualizers		visualizers
README.md		README.md
__init__.py		__init__.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

applications

applications

custom_losses

custom_losses

datacreators

datacreators

dataloaders

dataloaders

examples

examples

misctools

misctools

models

models

trainers

trainers

visualizers

visualizers

README.md

README.md

init.py

init.py

Repository files navigation

About

Table of contents

Getting Started

Out of box colab usage

Machine Learning and AI applications full pipeline

Compose machine learning applications in a modular way

Annotation and other computer vision utilities

More examples and use cases

Upcoming

About

Releases

Packages

Languages

Homagn/mlfactory

Folders and files

Latest commit

History

Repository files navigation

About

Table of contents

Getting Started

Out of box colab usage

Machine Learning and AI applications full pipeline

Compose machine learning applications in a modular way

Annotation and other computer vision utilities

More examples and use cases

Upcoming

About

Resources

Stars

Watchers

Forks

Languages