Implement standard inference modes #13

justheuristic · 2022-06-21T17:37:06Z

It would be awesome to have a collection of standard inference methods, e.g. greedy, temperature, top-p, top-k, eventually tree / beam search and batched inference, once we support it on the backend.

For now, perhaps, it would be best to implement them as standalone functions / functors that take the full model as input and do the inference. Eventually, we'll figure out the best way of integrating them together.

Roadmap (tentative)

sampling, greedy, top-k, nucleus, etc -- with obligatory support for prefixes
inference with prompt-tuned model
beam search (requires changes on backend)

.. and then, in no particular order,

inference with LoRA / AdaMix
user-defined, constraints, other crazy stuff

justheuristic · 2022-12-31T00:46:23Z

Done by @artek0chumak in #87 , will discuss next steps in a separate issue

justheuristic assigned artek0chumak Jun 21, 2022

justheuristic mentioned this issue Jun 21, 2022

Roadmap (tentative) #12

Open

32 tasks

justheuristic closed this as completed Dec 31, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement standard inference modes #13

Implement standard inference modes #13

justheuristic commented Jun 21, 2022

justheuristic commented Dec 31, 2022

Implement standard inference modes #13

Implement standard inference modes #13

Comments

justheuristic commented Jun 21, 2022

justheuristic commented Dec 31, 2022