Release full TextEncoder

Right now Lens ships with `LensGptOssEncoder` at <https://huggingface.co/microsoft/Lens/tree/main/text_encoder> which is pre-quantized using `mxfp4` method - which is great for *Hopper* architecture, but not-so-great for any other GPU due to lack of native FP4 support.

Ask is to release `text_encoder` in non-quantized `BFloat16` so it can be quantized as needed by different apps.

For example, [SD.Next](https://github.com/vladmandic/sdnext) prefers its own [SDNQ](https://vladmandic.github.io/sdnext-docs/SDNQ-Quantization/) method which among others, has efficient kernels for UINT dtypes and is fully cross-platform compatible (nVidia, AMD, IPEX, etc.)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release full TextEncoder #6

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Release full TextEncoder #6

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions