- Clone and enter the repository:
git clone git@github.com:AIS-Bonn/OCVP-object-centric-video-prediction.git
cd OCVP-object-centric-video-prediction
- Install all required packages by installing the
conda
environment file included in the repository:
conda env create -f environment.yml
conda activate OCVP
- Download the Obj3D and MOVi-A datasets, and place them under the
datasets
directory. The folder structure should be like:
OCVP
├── datasets/
| ├── Obj3D/
| └── MOViA/
- Obj3D: Donwload and extract this dataset by running the following bash script:
chmod +x download_obj3d.sh
./download_obj3d.sh
- MOViA: Download the MOVi-A dataset to your local disk from the Google Cloud Storage, and preprocess the TFRecord files to extract the video frames and other required metadata by running the following commands:
gsutil -m cp -r gs://kubric-public/tfds/movi_a/128x128/ .
mkdir movi_a
mv 128x128/ movi_a/128x128/
python src/extract_movi_dataset.py
- Download and extract the pretrained models, including checkpoints for the SAVi decomposition and prediction modules:
chmod +x download_pretrained.sh
./download_pretrained.sh