fix: use max_completion_tokens for gpt-4.1+, gpt-5.x, and o-series models by YuHuang0525 · Pull Request #38 · Pickle-Pixel/ApplyPilot

YuHuang0525 · 2026-03-14T18:56:02Z

Problem

Newer OpenAI models — gpt-4.1, gpt-5.x, o1, o3, o4 — reject the legacy max_tokens parameter with HTTP 400 and require max_completion_tokens instead. This means any user who sets LLM_MODEL=gpt-5.2 (or any other newer model) gets an immediate 400 error on every LLM call, breaking scoring, tailoring, and cover letter generation entirely.

Relevant code before this fix (llm.py _chat_compat()):

payload = {
    "model": self.model,
    "messages": messages,
    "temperature": temperature,
    "max_tokens": max_tokens,   # ← rejected by gpt-4.1+, gpt-5.x, o-series
}

Fix

Detect the model prefix at call time and send the correct parameter:

_new_param_models = ("gpt-4.1", "gpt-5", "o1", "o3", "o4")
if any(self.model.startswith(p) for p in _new_param_models):
    token_param = {"max_completion_tokens": max_tokens}
else:
    token_param = {"max_tokens": max_tokens}

All other providers (Gemini compat, Gemini native, local/Ollama) are unaffected — they continue using max_tokens as before.
The native Gemini path already uses maxOutputTokens and is untouched.
No behaviour change for existing gpt-4o, gpt-4o-mini, or local model users.

Testing

Verified manually with gpt-5.2 — the 400 error is resolved and completions return successfully after this change. No existing automated tests cover _chat_compat() directly.

…dels Newer OpenAI models (gpt-4.1+, gpt-5.x, o1, o3, o4) reject the legacy max_tokens parameter with HTTP 400 and require max_completion_tokens instead. _chat_compat() now detects the model prefix at call time and sends the correct parameter, while all other providers (Gemini, local) continue using max_tokens unchanged.

LinkedIn (and other major sites' bot detection) probe extensions by attempting to fetch resources from `chrome-extension://{id}/...` where {id} is one of ~6,000 known extension IDs. If the resource loads, the site has a strong signal that this user runs that extension. Empty `web_accessible_resources: []` means our extension exposes NO resources to web pages, so the probe always fails. Combined with the per-install random extension key (decision Pickle-Pixel#38), this removes ApplyPilot's extension from cross-user fingerprintability entirely. `alarms` permission was already present in the manifest, so the SW-heartbeat support from spec §3.1 is also in place. Smoke verified: extension still loads in CfT 148 with the empty WAR. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: use max_completion_tokens for gpt-4.1+, gpt-5.x, and o-series models#38

fix: use max_completion_tokens for gpt-4.1+, gpt-5.x, and o-series models#38
YuHuang0525 wants to merge 1 commit intoPickle-Pixel:mainfrom
YuHuang0525:fix/openai-max-completion-tokens

YuHuang0525 commented Mar 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

YuHuang0525 commented Mar 14, 2026

Problem

Fix

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant