Feature Request: Support for Meta Chameleon 7B and 34B #7995

arch-btw · 2024-06-18T20:04:05Z

Prerequisites

I am running the latest code. Mention the version if possible as well.
I carefully followed the README.md.
I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
I reviewed the Discussions, and have a new and useful enhancement to share.

Feature Description

"Meta Chameleon is a family of models that can combine text and images as input and output any combination of text and images with a single unified architecture for both encoding and decoding. While most current late-fusion models use diffusion-based learning, Meta Chameleon uses tokenization for text and images. This enables a more unified approach and makes the model easier to design, maintain, and scale. The possibilities are endless—imagine generating creative captions for images or using a mix of text prompts and images to create an entirely new scene."

Motivation

This would be a great addition to llama.cpp!

The image features look interesting but it can also simply do Text -> Text and a lot of other combinations:

Text -> Text
Image -> Image
Text -> Image
Image -> Text
Image -> Text + Image
Text + Image -> Text
Text -> Text + Image
Text + Image -> Text + Image

chameleon.mp4

EliEron · 2024-06-18T20:07:31Z

Here are some relevant links:

~~Given it's a completely new architecture~~, and a multimodal one at that, I imagine adding support for it will not be easy. But I'm also very excited to see this supported.

Edit: According to Meta researcher Armen Aghajanyan the architecture is actually similar:

Similar architecture to LLaMa (apart from QK-norm), get fast inference working.

0wwafa · 2024-06-18T21:23:43Z

Yes! Please do! Also because as of now there is no way to run it on CPU only.

SolvAI · 2024-06-19T12:56:03Z

yum yum yum :p

ann-brown · 2024-06-19T13:09:31Z

Since it uses similar to Medusa architecture is that likely to be supportable at the same time for the self-speculative decoding side of inference? It sounded like it could run without that, but it'd be neat to have that available too.

jacobkahn · 2024-06-21T15:52:39Z

Let me know if we can answer any questions about the architecture, inference, etc. Our reference implementation in https://github.com/facebookresearch/chameleon should be clear. Differences from the Llama architecture are minor:

qk-norm in attention after the initial qk weight transformations
swin-norm (in the 30B model only) which is normalizes inputs before attention and feedforward blocks

typedrat · 2024-06-24T00:34:23Z

Considering the VQGAN is public, it should be possible for llama.cpp to reinstate the image output capabilities.

chigkim · 2024-06-25T17:28:29Z

+1000! I'd love to run Chameleon with llama.cpp!

freeqaz · 2024-07-18T09:16:14Z

In the interim: Is anybody aware of any quantized weights for this model currently? Would love to try to 30b version on my 2x 3090s.

Found this issue because I know llama.cpp is a common way to shrink these models down.

github-actions · 2024-09-01T01:07:45Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

chigkim · 2024-09-01T01:46:32Z

Still not supported.

arch-btw · 2024-09-28T15:33:13Z

Hi guys, it got merged: #8543

chigkim · 2024-09-28T16:14:15Z

Hi guys, it got merged: #8543

The pr is only for text, not image right?

arch-btw added the enhancement New feature or request label Jun 18, 2024

arch-btw changed the title ~~Feature Request: Support for Meta Chameleon 7B and 30B~~ Feature Request: Support for Meta Chameleon 7B and 34B Jun 18, 2024

PaulCapestany mentioned this issue Jun 21, 2024

Feature Request: Support for Meta Chameleon ollama/ollama#5201

Open

nopperl mentioned this issue Jul 17, 2024

Add support for Chameleon #8543

Merged

4 tasks

github-actions bot added the stale label Aug 18, 2024

github-actions bot closed this as completed Sep 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Support for Meta Chameleon 7B and 34B #7995

Feature Request: Support for Meta Chameleon 7B and 34B #7995

arch-btw commented Jun 18, 2024 •

edited

Loading

EliEron commented Jun 18, 2024 •

edited

Loading

0wwafa commented Jun 18, 2024

SolvAI commented Jun 19, 2024

ann-brown commented Jun 19, 2024

jacobkahn commented Jun 21, 2024

typedrat commented Jun 24, 2024

chigkim commented Jun 25, 2024

freeqaz commented Jul 18, 2024

github-actions bot commented Sep 1, 2024

chigkim commented Sep 1, 2024

arch-btw commented Sep 28, 2024

chigkim commented Sep 28, 2024

Feature Request: Support for Meta Chameleon 7B and 34B #7995

Feature Request: Support for Meta Chameleon 7B and 34B #7995

Comments

arch-btw commented Jun 18, 2024 • edited Loading

Prerequisites

Feature Description

Motivation

EliEron commented Jun 18, 2024 • edited Loading

0wwafa commented Jun 18, 2024

SolvAI commented Jun 19, 2024

ann-brown commented Jun 19, 2024

jacobkahn commented Jun 21, 2024

typedrat commented Jun 24, 2024

chigkim commented Jun 25, 2024

freeqaz commented Jul 18, 2024

github-actions bot commented Sep 1, 2024

chigkim commented Sep 1, 2024

arch-btw commented Sep 28, 2024

chigkim commented Sep 28, 2024

arch-btw commented Jun 18, 2024 •

edited

Loading

EliEron commented Jun 18, 2024 •

edited

Loading