[Feature] Asynchronous Batch API support

### What feature would you like to see?

My use-case involves using a GEPA-optimized DSPy module to do tabular prediction. Since I'm independent (and not rich), I'd like to be able to use Asynchronous Batch endpoints (ex. [OpenAI Batch API](https://platform.openai.com/docs/guides/batch)) natively from DSPy to reduce costs on large datasets.

LiteLLM supports using Batch API endpoints ([link to docs](https://docs.litellm.ai/docs/batches)). I think it would be nice to have something like a `module.abatch()` method directly from DSPy to be able to use those Batch endpoints.

Pros:
- Reduce costs for various tasks like pure inference, evaluation and optimization.
- Bypass rate limits for some providers.

Cons:
- Variable inference time (15s to a few minutes on my tests).
- Tricky to do for complex / multi-call modules...

### Would you like to contribute?

- [x] Yes, I'd like to help implement this.
- [ ] No, I just want to request it.

### Additional Context

I do have a bloated-but-working implementation (from 5.1-Codex) [on my fork](https://github.com/rayanehmi/dspy/tree/feat/async_batching) along with a [demo notebook](https://github.com/rayanehmi/dspy/blob/feat/async_batching/batch_testing.ipynb). It only works for single-call modules as of writing this.

Would absolutely love to contribute to a proper implementation with some guidance!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] Asynchronous Batch API support #9102

What feature would you like to see?

Would you like to contribute?

Additional Context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature] Asynchronous Batch API support #9102

Description

What feature would you like to see?

Would you like to contribute?

Additional Context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions