style-transfer

The purpose of this project is to combine a content image with a style image. The images used for this project comes from various popular paintings and photgraphs taken from Wikipedia. Specifically, the content images include a photograph of a dancing ballerina, the famous Girl with a Pearl Earring, and Mona Lisa. At the moment, the style images include the paintings from world-renowned artists, such as Van Gogh and Picasso.

Currently, the neural style transfer model built for combining styles with content images isn't run in real-time. The model already has been run on these combination of images to avoid wait times for the user. However, the time it takes for the model to run is on average about 20 seconds. Thus, I plan on building a feature for allowing the user to provide and model his or her own content image (or style image possibly) in the future.

What is Neural Style Transfer?

Before diving into any descriptions of neural style transfer, let's first define style transfer. The original style transfer approach generates a new, stylized content image $x^{*}$ given a content image $x^{c}$ and a style image $x^{s}$ . The feature maps of $x^{*}$ , $x^{c}$ , and $x^{s}$ in the layer l of a CNN are denoted by F, P, and S, respectively. To help illustrate these variable, N is the number of the feature maps in the layer l. Neural style transfer iteratively generates $x^{*}$ by optimizing a content loss and a style loss.

Why Style Transfer uses Gram Matrices?

In this paper, it has been shown that matching the Gram matrices of feature maps is equivalent to minimizing the Maximum Mean Discrepancy (MMD) with the second order polynomial kernel. Thus, the paper argues that the essence of neural style transfer is to generate a new image from white noise by matching the neural activations with the content image and the Gram matrices with the style image.

The original algorithm for neural style transfer used a cost function that minimized the sum of the content loss and the style loss. Here, the content loss represented the difference in content between the content image and our generated image. And, the style loss represented the difference in style between the style image and our generated image.

The style loss function uses the gram matrix. Specifically, the style loss represents the normalized, squared difference between the gram matrix of the style image and the gram matrix of the generated image. The gram matrix function cares about some aspects between two images, but it doesn't care about the specific presence or location of features within an image.

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
data		data
notebooks		notebooks
scripts		scripts
web		web
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

notebooks

notebooks

scripts

scripts

web

web

.gitignore

.gitignore

Dockerfile

Dockerfile

LICENSE

LICENSE

README.md

README.md

package-lock.json

package-lock.json

requirements.txt

requirements.txt

Repository files navigation

style-transfer

What is Neural Style Transfer?

Why Style Transfer uses Gram Matrices?

References

About

Releases

Packages

Languages

License

dkharazi/style-transfer

Folders and files

Latest commit

History

Repository files navigation

style-transfer

What is Neural Style Transfer?

Why Style Transfer uses Gram Matrices?

References

About

Resources

License

Stars

Watchers

Forks

Languages