Batch
Process multiple requests asynchronously at lower cost
How It Works
Input File Format
{"custom_id": "request-1", "method": "POST", "url": "/v1/chat/completions", "body": {"model": "gpt-4.1-mini", "messages": [{"role": "user", "content": "Hello"}]}}
{"custom_id": "request-2", "method": "POST", "url": "/v1/chat/completions", "body": {"model": "gpt-4.1-mini", "messages": [{"role": "user", "content": "World"}]}}any_llm.create_batch()
any_llm.create_batch()def create_batch(
provider: str | LLMProvider,
input_file_path: str,
endpoint: str,
*,
completion_window: str = "24h",
metadata: dict[str, str] | None = None,
api_key: str | None = None,
api_base: str | None = None,
client_args: dict[str, Any] | None = None,
**kwargs: Any,
) -> Batchany_llm.acreate_batch()
any_llm.acreate_batch()Parameters (create)
Parameter
Type
Default
Description
any_llm.retrieve_batch()
any_llm.retrieve_batch()any_llm.aretrieve_batch()
any_llm.aretrieve_batch()Parameters (retrieve)
Parameter
Type
Default
Description
any_llm.cancel_batch()
any_llm.cancel_batch()any_llm.acancel_batch()
any_llm.acancel_batch()any_llm.list_batches()
any_llm.list_batches()any_llm.alist_batches()
any_llm.alist_batches()Parameters (list)
Parameter
Type
Default
Description
Usage
Last updated