Video Prediction and Mask Segmentation

Overview

This project focuses on the challenging tasks of video prediction and mask segmentation. It employs ConvLSTM (Convolutional Long Short-Term Memory) networks for predicting future frames in a video sequence and segments objects by generating masks. This approach can be particularly useful in applications such as video surveillance, autonomous driving, and dynamic scene understanding.

Motivation

This was the final project required in course DS-GA 1008 Deep Learning (with Yann LeCun and Alfredo Canziani). The current plan is to build on top of this to tackle this problem statement with newer approaches in the domain.

Results

Segmenter + ConvLSTM

Segmenter + MSPred

Features

Video Prediction: Uses ConvLSTM to predict future frames based on past sequences.
Mask Segmentation: Segments objects in video frames to understand scene dynamics better.
Customizable Configurations: Offers configuration options for prediction and segmentation tasks.
Dataset Sorting in Prediction: Implements sorting of video folders in PredictionDataset for streamlined data processing.

Project Structure

configs/: Configuration files for prediction and segmentation models.
predictor/: Implementation of the ConvLSTM predictor model.
segmenter/: Implementation of the segmentation model.
utils/: Utility scripts for dataset handling and other common functions.
predict_hidden.py: Script for running predictions with hidden configurations.
requirements.txt: Lists all the dependencies required to run the project.
train_predictor.py: Script for training the ConvLSTM predictor model.
train_segmenter.py: Script for training the segmentation model.

Getting Started

Prerequisites

Ensure you have Python 3.x installed on your system. You can install all the dependencies using:

pip install -r requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video Prediction and Mask Segmentation

Overview

Motivation

Results

Segmenter + ConvLSTM

Segmenter + MSPred

Features

Project Structure

Getting Started

Prerequisites

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
configs		configs
images		images
predictor		predictor
segmenter		segmenter
utils		utils
README.md		README.md
Video_Prediction_and_Segmentation_Report.pdf		Video_Prediction_and_Segmentation_Report.pdf
__init__.py		__init__.py
predict_hidden.py		predict_hidden.py
requirements.txt		requirements.txt
test_dataloader.py		test_dataloader.py
train_predictor.py		train_predictor.py
train_segmenter.py		train_segmenter.py

raishish/video-prediction-segmentation

Folders and files

Latest commit

History

Repository files navigation

Video Prediction and Mask Segmentation

Overview

Motivation

Results

Segmenter + ConvLSTM

Segmenter + MSPred

Features

Project Structure

Getting Started

Prerequisites

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages