Add Google Gemini support with async processing by Copilot · Pull Request #29 · agamm/batchata

Copilot · 2025-07-26T15:34:43Z

This PR adds comprehensive support for Google Gemini models to batchata, implementing Google's native batch API for true batch processing with 50% cost savings.

Key Features

🤖 5 Gemini Models Supported

gemini-2.5-flash - High-performance model with fast processing
gemini-2.5-pro - Premium model with advanced capabilities
gemini-1.5-pro - High-performance model with 2M context window
gemini-1.5-flash - Fast model with 1M context window
gemini-1.5-flash-8b - Lightweight version with 1M context

⚡ Native Batch Processing

Uses Google's official batch API with inline requests for true batch processing:

from batchata import Batch

batch = Batch(results_dir="./output")
    .set_default_params(model="gemini-2.5-flash")
    .add_cost_limit(usd=5.0)

# Add jobs just like other providers
batch.add_job(prompt="Analyze sentiment", response_model=SentimentAnalysis)
run = batch.run()
results = run.results()

🔧 Full Integration

Batch API: Uses Google's google.genai SDK with inline batch requests
Cost Savings: 50% batch discount on all requests
Token Counting: Integration with Google's official token counting API
Structured Output: Full Pydantic model support with JSON schema conversion
File Processing: PDF, TXT, and image support (native PDF processing)
Error Handling: Comprehensive validation and error reporting

📚 Documentation & Examples

Updated README with Gemini provider comparison table
New example script demonstrating Gemini usage
Clear documentation of batch processing capabilities

Implementation Details

The Gemini provider implements Google's batch API using:

Inline Requests: Creates batch jobs using InlinedRequest format
Batch States: Handles Google's batch job states (JOB_STATE_* and BATCH_STATE_*)
Token Counting: Uses Google's official count_tokens API for accurate pricing
Result Parsing: Extracts responses from dest.inlined_responses
Cost Calculation: Applies 50% batch discount for all batch requests

Testing

Added comprehensive test suite with 51 test cases covering:

Provider initialization and batch creation
Google-specific validation constraints
Token counting API integration
Real batch response format handling
Error scenarios and edge cases
Structured output parsing
File handling and limitations

All existing tests continue to pass, ensuring no regression in OpenAI/Anthropic functionality.

Usage

# Set your API key
export GOOGLE_API_KEY="your-gemini-api-key"

# Use like any other provider
from batchata import Batch
from pydantic import BaseModel

class Analysis(BaseModel):
    sentiment: str
    confidence: float

batch = Batch(results_dir="./results")
    .set_default_params(model="gemini-2.5-flash", temperature=0.3)

batch.add_job(prompt="Analyze: 'I love this product!'", response_model=Analysis)
run = batch.run()

Batch Discount: Gemini provides 50% cost savings through Google's batch API, matching OpenAI and Anthropic batch pricing.

Fixes #6.

Co-authored-by: yammesicka <1492939+yammesicka@users.noreply.github.com>

yammesicka · 2025-07-26T16:02:30Z

You claim that Gemini doesn't have Batch processing. Can you please take a look at https://cloud.google.com/vertex-ai/generative-ai/docs/multimodal/batch-prediction-from-cloud-storage#create-batch-job-python_genai_sdk?

Copilot · 2025-07-26T16:08:11Z

You claim that Gemini doesn't have Batch processing. Can you please take a look at [https://cloud.google.com/vertex-ai/generative-ai/docs/multimodal/batch-prediction-from-cloud-storage#create-batch-jo...

You're absolutely right! I missed that Google now provides batch processing through the google.genai library and Vertex AI. Looking at the documentation, Google does support true batch processing with client.batches.create(). I'll update the implementation to use proper batch processing instead of async simulation. Thank you for pointing this out - this will enable actual cost savings and better performance.

You claim that Gemini doesn't have Batch processing. Can you please take a look at [https://cloud.google.com/vertex-ai/generative-ai/docs/multimodal/batch-prediction-from-cloud-storage#create-batch-jo...

I've implemented true Google Gemini batch processing using the google.genai.batches.create() API. The changes include:

• Real batch processing via Google's batch API instead of async simulation
• 50% batch discount support (updated from 0.0 to 0.5)
• Removed deprecated google.generativeai dependency
• Updated all tests to use new API (13/13 passing)
• Fixed documentation removing false claims about no batch processing

Key commit: df21282 implements the full conversion to true batch processing with proper cost savings. Thanks for pointing me to the correct documentation!

Warning

Firewall rules blocked me from connecting to one or more addresses

I tried to connect to the following addresses, but was blocked by firewall rules:

generativelanguage.googleapis.com
- Triggering command: python /tmp/test_batch_api.py (dns block)

If you need me to access, download, or install something from one of these locations, you can either:

Configure Actions setup steps to set up my environment, which run before the firewall is enabled
Add the appropriate URLs or hosts to the custom allowlist in this repository's Copilot coding agent settings (admins only)

agamm · 2025-07-26T16:13:04Z

Look at the openai provider as you need to work like it does, you first need to upload files (if files are supported), and then reference it: https://cloud.google.com/vertex-ai/generative-ai/docs/multimodal/batch-prediction-from-cloud-storage#create-batch-job-python_genai_sdk

yammesicka · 2025-07-26T16:14:17Z

Look at the openai provider as you need to work like it does, you first need to upload files (if files are supported), and then reference it: https://cloud.google.com/vertex-ai/generative-ai/docs/multimodal/batch-prediction-from-cloud-storage#create-batch-job-python_genai_sdk

Co-authored-by: yammesicka <1492939+yammesicka@users.noreply.github.com>

Copilot · 2025-07-26T16:38:36Z

Look at the openai provider as you need to work like it does, you first need to upload files (if files are supported), and then reference it: https://cloud.google.com/vertex-ai/generative-ai/docs/mult...

Implemented proper file upload pattern following Google's batch API documentation. The Gemini provider now:

Uploads files first using client.files.upload() (like OpenAI uploads JSONL)
References uploaded files using FileData objects in batch requests
Cleans up uploaded files after batch completion/cancellation

This follows the same pattern as the OpenAI provider but adapted for Google's API structure. Commit 16906b4 implements the full conversion to proper batch processing with file upload first.

Look at the openai provider as you need to work like it does, you first need to upload files (if files are supported), and then reference it: https://cloud.google.com/vertex-ai/generative-ai/docs/mult...

Fixed! The Gemini provider now properly uploads files first using client.files.upload() then references them in batch requests using FileData objects, following the exact pattern of the OpenAI provider.

Key changes in commit 16906b4:

Upload files to Google storage before creating batch
Reference uploaded files instead of embedding content
Clean up uploaded files after batch completion
Maintain same interface as other providers

All tests passing (13/13) with proper batch file handling.

agamm · 2025-07-26T23:58:41Z

@yammesicka Ready for review :)

Copilot

Pull Request Overview

This PR adds comprehensive Google Gemini support to batchata, implementing Google's native batch API for true batch processing with 50% cost savings. The implementation includes support for 5 Gemini models, structured output with Pydantic models, file processing capabilities, and comprehensive error handling.

Key Changes:

Added complete Gemini provider with Google's batch API integration
Implemented support for 5 Gemini models (gemini-2.5-pro, gemini-2.5-flash, etc.)
Added comprehensive test suite with 51 test cases covering all Gemini functionality

Reviewed Changes

Copilot reviewed 16 out of 17 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
batchata/providers/gemini/	New Gemini provider implementation with batch API support
tests/providers/gemini/	Comprehensive test suite for Gemini provider functionality
batchata/providers/provider_registry.py	Registration of Gemini provider in the system
batchata/core/job.py	Updated job validation to allow prompt-only jobs
pyproject.toml	Added Google dependencies and new example script
README.md	Updated documentation with Gemini provider comparison
examples/simple_demo.py	Updated demo to use Gemini model

Comments suppressed due to low confidence (6)

batchata/providers/gemini/models.py:9

The model version 'gemini-2.5-pro' may not exist. Based on my knowledge cutoff, Google's latest Gemini models were version 1.5. Please verify that version 2.5 models are actually available in the Google API.

    "gemini-2.5-pro": ModelConfig(

batchata/providers/gemini/models.py:20

The model version 'gemini-2.5-flash' may not exist. Based on my knowledge cutoff, Google's latest Gemini models were version 1.5. Please verify that version 2.5 models are actually available in the Google API.

    "gemini-2.5-flash": ModelConfig(

batchata/providers/gemini/models.py:31

The model version 'gemini-2.5-flash-lite' may not exist. Based on my knowledge cutoff, Google's latest Gemini models were version 1.5. Please verify that version 2.5 models are actually available in the Google API.

    "gemini-2.5-flash-lite": ModelConfig(

batchata/providers/gemini/models.py:42

The model version 'gemini-2.0-flash' may not exist. Based on my knowledge cutoff, Google's latest Gemini models were version 1.5. Please verify that version 2.0 models are actually available in the Google API.

    "gemini-2.0-flash": ModelConfig(

batchata/providers/gemini/models.py:53

The model version 'gemini-2.0-flash-lite' may not exist. Based on my knowledge cutoff, Google's latest Gemini models were version 1.5. Please verify that version 2.0 models are actually available in the Google API.

    "gemini-2.0-flash-lite": ModelConfig(

tests/providers/gemini/test_models.py:13

The test asserts that specific model versions exist, but these versions may not be real (based on my knowledge cutoff). This test will fail if the model versions don't actually exist in Google's API.

        assert "gemini-2.5-flash" in GEMINI_MODELS

pyproject.toml

Copilot · 2025-07-27T01:54:05Z

pyproject.toml

    "pydantic>=2.0",
    "pypdf>=5.8.0",
    "rich>=14.0.0",
+    "pytest-asyncio>=1.1.0",


The pytest-asyncio dependency is added to the main dependencies but should be in the dev dependencies group since it's only needed for testing.

Suggested change

"pytest-asyncio>=1.1.0",

pyproject.toml

yammesicka

^^

pyproject.toml

tests/providers/gemini/test_parse_results.py

tests/providers/gemini/test_models.py

yammesicka · 2025-07-27T03:01:13Z