LiteLLM Integeration & Batch Call Interface

I wrote a very manual LLM inference engine system for the initial version of KernelBench.

Two things we should do to better integrate 
* Use `litellm` and `.env` so we can support a variety of future models and not writing a new backend for each
* For `pass@k` and test-time compute settings, we should use the batch call API to simultaneously call rather than a thread of each.

Will implement with team @pythonomar22 @nathanjpaek @AffectionateCurry .

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LiteLLM Integeration & Batch Call Interface #70

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

LiteLLM Integeration & Batch Call Interface #70

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions