Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Thanks! #21

Open
afiaka87 opened this issue Feb 11, 2022 · 3 comments
Open

Thanks! #21

afiaka87 opened this issue Feb 11, 2022 · 3 comments

Comments

@afiaka87
Copy link

Thank you for releasing the filtered model. I realize a lot of people have complained about this, but it's a dicey alignment issue unfortunately. That you did the extra work to release something was very cool.

Feel free to close 👍

@aufbakanleitung
Copy link

So what is it filtered on exactly? Because I tried a few things like "Elon Musk as a saint on Mars in the style of Salvador Dalí" but it just created a picture of an Elk with I guess a somewhat Martian color background.
image

I assume this is because of the filtering. It doesn't know who Elon Musk is. So I'm wondering, what is left? Apart from Corgi's.

@woctezuma
Copy link

what is it filtered on exactly?

@Arcitec
Copy link

Arcitec commented Feb 19, 2022

Alright to sum it up:

  • The paper's model, "GLIDE", is not released.
  • The released model is called "GLIDE (filtered)".
  • The paper's model has 10x more parameters (more intelligence/detail). The small public model has 300 million parameters, compared to the unreleased 3.5 billion parameter model.
  • The public model can do 256x256. The paper's model can do 256x256 and 512x512.
  • The public model has filtered all humans, violent objects (weapons etc), and hate symbols (swastikas etc). It doesn't know anything about what those look like so it can't draw those.
  • Here's a comparison between the paper's model (top row of image) and the released model (last row of image): Colab notebook #2 (comment)

So the public model can basically make pretty bad output and no humans.

I guess they're never going to release the detailed model since they'll probably turn it into some product they sell to companies later.

Maybe it'll be like GPT-Neo later, where some open source folks take the paper and re-implement the full model and re-train it from scratch on open source data.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants