CS5824

Course project for VT CS5824 Spring '23

Setup

Clone the repository: git clone https://github.com/r-ramaraja/CS5824.git
Data collection: Run the data_collection Jupyter notebook to download the image data from Google Open Images and segregate them into their respective catgeories.
Data preprocessing: Run the data_preprocessing Jupyter notebook to preprocess the data and prepare the images and its associated masks for inpainting model inference

Clone the LaMa repository: git clone https://github.com/advimman/lama.git
Setup the environment with the required dependencies mentioned here
Change .png to .jpg in the config file given here
Run the inference as per the steps given here for each category. Example: python predict.py --config configs/prediction/default.yaml --input_dir /home/ram/CS5824/images_and_masks/validation/food --output_dir /home/ram/CS5824/output/lama/validation/food

Clone the Deepfill v2 repository: git clone https://github.com/JiahuiYu/generative_inpainting.git
Setup the environment with the required dependencies mentioned here
Run the inference as per the steps given here for each category. Example: python test.py --image /home/ram/CS5824/images_and_masks/validation/food/000000000139.jpg --mask /home/ram/CS5824/images_and_masks/validation/food/000000000139_mask001.png --output /home/ram/CS5824/output/deepfill/validation/food/000000000139_mask001.png --checkpoint pretrained/states_tf_places2.pth

Go to the clone LaMa repository
Change .png to .jpg in the config file given here
Run the evaluation script given here for the inputs and outputs to get the LPIPS, FID, and SSIM metrics. Example: python bin/evaluate_predicts.py configs/eval2_gpu.yaml /home/ram/CS5824/images_and_masks/validation/food /home/ram/CS5824/output/lama/validation/food /home/ram/CS5824/output/lama/food.csv
You can repeat the above steps by using the same LaMa evaluation script for the deepfill v2 outputs as well.
You can also find the results from the evaluation done for the reports can be found here
Run the results_visualization Jupyter notebook to visualize the results of the inpainting models.
Please note that the metrics may vary slightly from the ones reported in the report due to the different GPU configuration and also the fact that the report used a downsized version of the dataset (1500 images per category) for the evaluation.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
results		results
test_data		test_data
train_data		train_data
validation_data		validation_data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
category_labels.json		category_labels.json
data-collection.ipynb		data-collection.ipynb
data-preprocessing.ipynb		data-preprocessing.ipynb
downloader.py		downloader.py
hierarchy.json		hierarchy.json
labels.csv		labels.csv
results_visualization.ipynb		results_visualization.ipynb