Skip to content

feat(graph): add PJRTPlan execution wrapper with KV cache state management#75

Merged
dndungu merged 1 commit intomainfrom
p4-pjrt-plan
Apr 4, 2026
Merged

feat(graph): add PJRTPlan execution wrapper with KV cache state management#75
dndungu merged 1 commit intomainfrom
p4-pjrt-plan

Conversation

@dndungu
Copy link
Copy Markdown
Contributor

@dndungu dndungu commented Apr 4, 2026

Phase 4 PJRT pipeline: top-level execution plan.

…ement

Add RunPrefill, RunDecode, Reset, and Close methods to PJRTPlan[T] for
executing compiled PJRT programs with automatic KV cache buffer lifecycle
management. RunPrefill stores KV outputs for subsequent decode steps,
RunDecode donates previous KV buffers and captures new ones, and Reset
clears KV state for new generation sequences.
@dndungu dndungu merged commit 3e5cb40 into main Apr 4, 2026
1 check failed
@dndungu dndungu deleted the p4-pjrt-plan branch April 4, 2026 01:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant