feat(anthropic): delegate cache_control kwarg to anthropic top-level param by ccurme · Pull Request #35967 · langchain-ai/langchain

ccurme (ccurme) · 2026-03-16T19:02:28Z

Removes handling introduced in #31523, as Anthropic now supports passing in cache_control=... top-level.

codspeed-hq · 2026-03-16T19:04:35Z

Merging this PR will not alter performance

✅ 3 untouched benchmarks
⏩ 33 skipped benchmarks¹

_{Comparing cc/anthropic_automatic_caching (3565c52) with master (69a7b9c)}

33 benchmarks were skipped, so the baseline results were used instead. If they were deleted from the codebase, click here and archive them to remove them from the performance reports. ↩

) Closes #37042 --- `AnthropicPromptCachingMiddleware` was unconditionally setting top-level `cache_control` in `model_settings` for any `ChatAnthropic` subclass. That field is direct-Anthropic-API only — `ChatAnthropicBedrock` (which subclasses `ChatAnthropic` and passed the existing `isinstance` gate) errored with `cache_control: Extra inputs are not permitted`. Investigating that surfaced a related regression: PR #35967 also deleted the block-level `cache_control` injection in `_get_request_payload`, which silently disabled caching entirely for non-direct subclasses (Bedrock had been falling back to in-block breakpoints). This restores both paths. ## Changes - Add `_is_direct_anthropic_llm_type` predicate that allowlists `_llm_type == "anthropic-chat"`. Both the middleware's `_supports_automatic_caching` and the new branch in `ChatAnthropic._get_request_payload` route through it, so any subclass that overrides `_llm_type` (Bedrock today, future direct-API variants tomorrow) is treated as non-direct by default. Replaces the prior substring-matching denylist on `"bedrock"`/`"vertex"`. - Restore `_collect_code_execution_tool_ids`, `_is_code_execution_related_block`, and a new `_apply_cache_control_to_last_eligible_block` helper in `chat_models`. For non-direct subclasses, `_get_request_payload` now pops `cache_control` from kwargs and walks messages newest-to-oldest, attaching the breakpoint to the last block that isn't `code_execution`-related (Anthropic forbids breakpoints on those). - Emit `UserWarning` when `cache_control` is requested but every candidate block is `code_execution`-related — previously a silent drop. - `AnthropicPromptCachingMiddleware._apply_caching` now sets the top-level `cache_control` only when `_supports_automatic_caching(request.model)`. System-message and tool-definition breakpoints continue to apply for all `ChatAnthropic` subclasses, since those are accepted by every transport. - Note: `ChatAnthropicVertex` does not subclass `ChatAnthropic` (it lives in `langchain-google-vertexai` and ships its own `_get_request_payload`), so the chat-models changes here only affect Bedrock. The middleware-side gate covers Vertex implicitly via the `isinstance(request.model, ChatAnthropic)` check that already excludes it.

ccurme (ccurme) added 2 commits March 16, 2026 14:59

delegate to anthropic top-level param

2e2a6b3

bump min sdk version

3565c52

ccurme (ccurme) requested a review from Mason Daugherty (mdrxy) as a code owner March 16, 2026 19:02

Colin Francis (colifran) approved these changes Mar 16, 2026

View reviewed changes

ccurme (ccurme) merged commit 55711b0 into master Mar 17, 2026
66 checks passed

ccurme (ccurme) deleted the cc/anthropic_automatic_caching branch March 17, 2026 14:49

chenzimin mentioned this pull request Apr 21, 2026

feat(anthropic): add automatic prompt caching via top-level cache_control langchain-ai/langchainjs#10735

Open

Mason Daugherty (mdrxy) mentioned this pull request Apr 28, 2026

fix(anthropic): restore cache_control on non-direct subclasses #37057

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(anthropic): delegate cache_control kwarg to anthropic top-level param#35967

feat(anthropic): delegate cache_control kwarg to anthropic top-level param#35967
ccurme (ccurme) merged 2 commits intomasterfrom
cc/anthropic_automatic_caching

ccurme (ccurme) commented Mar 16, 2026

Uh oh!

codspeed-hq Bot commented Mar 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ccurme (ccurme) commented Mar 16, 2026

Uh oh!

codspeed-hq Bot commented Mar 16, 2026

Merging this PR will not alter performance

Footnotes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants