Skip to content

Reduce chat request retention#689

Merged
jahooma merged 1 commit into
mainfrom
jahooma/fix-web-request-leaks
May 15, 2026
Merged

Reduce chat request retention#689
jahooma merged 1 commit into
mainfrom
jahooma/fix-web-request-leaks

Conversation

@jahooma
Copy link
Copy Markdown
Contributor

@jahooma jahooma commented May 15, 2026

Summary

Reduce memory retained by chat completion provider streams by replacing full request bodies in BigQuery/audit paths with a compact request audit record.
This avoids keeping huge messages arrays alive until provider streams finish while preserving request metadata, counts, tool names, and raw provider error logging for debugging.
The endpoint error telemetry now uses the same compact audit record, and a TODO notes a future append-only message_request BigQuery table for full raw request bodies.

Validation

bun run --cwd web typecheck

@jahooma jahooma merged commit 718d4fc into main May 15, 2026
35 checks passed
@jahooma jahooma deleted the jahooma/fix-web-request-leaks branch May 15, 2026 22:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant