Skip to content

OPVTON is the GAN model I trained that takes a clothing item, a human and generates an image with the clothing item on the human even when there's occlusion in the torso area

Notifications You must be signed in to change notification settings

sethupavan12/OPVTON-Virtual-Try-on-Dissertation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Thesis

This repo contains the RAW code I trained and tested the GAN model that I submitted for my bachelor thesis. Now, more details about the project

Abstract

Recent Image-based Virtual Try On (VTON) methods have gained widespread attention. The task involves generating a person image with an in-shop clothing item. Prior works have successfully addressed this problem including preserving clothing characteristics like logos, text, and patterns. However, occlusion remains a problem to be solved as existing methods produce artifacts when occlusion is present in the person image. This dissertation addresses the problem of occlusion in virtual try-on through proposed Occlusion-Preserving Virtual Try On Network (OP-VTON). At the core of OP-VTON pipeline are a geometric matching module that aligns the clothing item to the pose of the person in the input image and a powerful image generator that takes aligned clothing item along with other guiding information to generate a convincing try-on result. OP-VTON is evaluated qualitatively and quantitatively on VITON dataset and then compared with state-of-the-art works from the virtual try-on domain. The qualitative evaluation revealed that OP-VTON outperforms state-of-the-art models in occlusion scenarios, while the quantitative evaluation demonstrated that OP-VTON outperforms state-of-the-art models in terms of FID and SSIM scores. Additionally, this dissertation also trains a latent diffusion model on VITON and DeepFashion datasets through use of a recent pioneering work called Control Net to enable novel text-guided virtual try-on image generation capabilities without losing the input image identity.

Architecture

image

Qualitative Comparision of OPVTON and other SOTA models

image

Quantitative Comparision of OPVTON and other SOTA models

image

Maintaining

I want to make this repo as clean as possible. Given that this is a bit of a niche field, I have other ideas but sadly no time to make it happen. Will clean the code base and add sensible instructions later.

note: I leftout the controlnet code part from this repo

About

OPVTON is the GAN model I trained that takes a clothing item, a human and generates an image with the clothing item on the human even when there's occlusion in the torso area

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published