feat: (TESTING) add batches API with OpenAI compatibility (#3088) #3153

cdoern · 2025-08-14T17:43:07Z

What does this PR do?

Add complete batches API implementation with protocol, providers, and tests:

Core Infrastructure:

Add batches API protocol using OpenAI Batch types directly
Add Api.batches enum value and protocol mapping in resolver
Add OpenAI "batch" file purpose support
Include proper error handling (ConflictError, ResourceNotFoundError)

Reference Provider:

Add ReferenceBatchesImpl with full CRUD operations (create, retrieve, cancel, list)
Implement background batch processing with configurable concurrency
Add SQLite KVStore backend for persistence
Support /v1/chat/completions endpoint with request validation

Comprehensive Test Suite:

Add unit tests for provider implementation with validation
Add integration tests for end-to-end batch processing workflows
Add error handling tests for validation, malformed inputs, and edge cases

Configuration:

Add max_concurrent_batches and max_concurrent_requests_per_batch options
Add provider documentation with sample configurations

Test Plan

Test with -

$ uv run llama stack build --image-type venv --providers inference=YOU_PICK,files=inline::localfs,batches=inline::reference --run &
$ LLAMA_STACK_CONFIG=http://localhost:8321 uv run pytest tests/unit/providers/batches tests/integration/batches --text-model YOU_PICK

addresses #3066

Add complete batches API implementation with protocol, providers, and tests: Core Infrastructure: - Add batches API protocol using OpenAI Batch types directly - Add Api.batches enum value and protocol mapping in resolver - Add OpenAI "batch" file purpose support - Include proper error handling (ConflictError, ResourceNotFoundError) Reference Provider: - Add ReferenceBatchesImpl with full CRUD operations (create, retrieve, cancel, list) - Implement background batch processing with configurable concurrency - Add SQLite KVStore backend for persistence - Support /v1/chat/completions endpoint with request validation Comprehensive Test Suite: - Add unit tests for provider implementation with validation - Add integration tests for end-to-end batch processing workflows - Add error handling tests for validation, malformed inputs, and edge cases Configuration: - Add max_concurrent_batches and max_concurrent_requests_per_batch options - Add provider documentation with sample configurations Test with - ``` $ uv run llama stack build --image-type venv --providers inference=YOU_PICK,files=inline::localfs,batches=inline::reference --run & $ LLAMA_STACK_CONFIG=http://localhost:8321 uv run pytest tests/unit/providers/batches tests/integration/batches --text-model YOU_PICK ``` addresses llamastack#3066

cdoern · 2025-08-14T17:43:19Z

opening this to see if this re-records

cdoern · 2025-08-14T17:46:08Z

@mattf FYI these are the same failures I see locally when recording or replaying

ashwinb · 2025-08-14T17:46:09Z

opening this to see if this re-records

I added the label now, lets see

ashwinb · 2025-08-14T17:46:49Z

Failed. My workflow sucks. Let me look into it in a bit.

cdoern · 2025-08-14T17:50:16Z

ah the checkout mechanism is wrong. @ashwinb I think just this #3154 will do it, right?

cdoern · 2025-08-14T17:50:36Z

which is just the basic checkout mechanism without a ref

ashwinb · 2025-08-14T17:55:59Z

@cdoern the important issue is that you want to be able to push back to the ref. That needs you to specify the ref. But this can only work in the parent repo (so someone who has write access), but not in a fork. I am researching more but appears all of this is possible only for repo maintainers right now.

ashwinb · 2025-08-14T18:03:40Z

OK I learnt a few things. This thing is only possible if:

we create a special app within the llamastack app for this purpose
this workflow is provided the access token of this app
the app is installed by the user in the fork under which the PR was generated

for now, maybe we just need to do this from the maintainers, or have a simple script which anyone can run locally themselves. (scripts/integration-test.sh is almost there.)

cdoern · 2025-08-14T18:07:13Z

I can run the recordings locally, the error is the same as the replay ones here.

cdoern · 2025-08-14T18:07:21Z

will keep digging

ashwinb · 2025-08-15T20:44:12Z

Closing this, will take over Matt's PR and commit appropriately.

cdoern requested review from ashwinb, hardikjshah, raghotham, ehhuang, terrytangyuan, leseb, bbrowning, reluctantfuturist, mattf and slekkala1 as code owners August 14, 2025 17:43

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 14, 2025

ashwinb added the re-record-tests Spin up ollama, inference and record responses for later use label Aug 14, 2025

ashwinb closed this Aug 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: (TESTING) add batches API with OpenAI compatibility (#3088) #3153

feat: (TESTING) add batches API with OpenAI compatibility (#3088) #3153

Uh oh!

cdoern commented Aug 14, 2025

Uh oh!

cdoern commented Aug 14, 2025

Uh oh!

cdoern commented Aug 14, 2025

Uh oh!

ashwinb commented Aug 14, 2025

Uh oh!

ashwinb commented Aug 14, 2025

Uh oh!

cdoern commented Aug 14, 2025

Uh oh!

cdoern commented Aug 14, 2025

Uh oh!

ashwinb commented Aug 14, 2025

Uh oh!

ashwinb commented Aug 14, 2025

Uh oh!

cdoern commented Aug 14, 2025

Uh oh!

cdoern commented Aug 14, 2025

Uh oh!

ashwinb commented Aug 15, 2025

Uh oh!

Uh oh!

feat: (TESTING) add batches API with OpenAI compatibility (#3088) #3153

feat: (TESTING) add batches API with OpenAI compatibility (#3088) #3153

Uh oh!

Conversation

cdoern commented Aug 14, 2025

What does this PR do?

Test Plan

Uh oh!

cdoern commented Aug 14, 2025

Uh oh!

cdoern commented Aug 14, 2025

Uh oh!

ashwinb commented Aug 14, 2025

Uh oh!

ashwinb commented Aug 14, 2025

Uh oh!

cdoern commented Aug 14, 2025

Uh oh!

cdoern commented Aug 14, 2025

Uh oh!

ashwinb commented Aug 14, 2025

Uh oh!

ashwinb commented Aug 14, 2025

Uh oh!

cdoern commented Aug 14, 2025

Uh oh!

cdoern commented Aug 14, 2025

Uh oh!

ashwinb commented Aug 15, 2025

Uh oh!

Uh oh!