feat(realtime): support multi-message generation per response by longcw · Pull Request #5763 · livekit/agents

longcw · 2026-05-18T06:46:20Z

Summary

Process each MessageGeneration from generation_ev.message_stream serially via perform_audio_forwarding + perform_text_forwarding + wait_for_playout. Only one flush is in flight at a time.
Per-msg state is derived directly from the playback_finished event:
- full → emit ChatMessage(interrupted=False) with the msg's message_id
- partial → emit ChatMessage(interrupted=True) and call _rt_session.truncate(...) with this msg's local playback_position (not a cumulative offset)
- skipped → drop locally and call update_chat_ctx(...) so the realtime server removes never-played items from its history
_on_first_frame now early-returns once started_speaking_at is set, so per-msg first-frame callbacks don't re-fire _update_agent_state("speaking") for each message.

Alternative considered

#5690 makes multi-message work by flushing per message — that needs the synchronizer to keep pending/finalizing impls alive and serialize concurrent flushes in room_io/_output.py. Our AudioOutput assumes there is only one speech at a time, serializing per-message at the wait_for_playout boundary (this PR) avoids both changes.

close #5690, #5684

Some realtime providers (e.g. GPT-Realtime-2.0) emit multiple message items in a single response. Process each one serially: push frames, flush, wait_for_playout. Only one flush is ever in flight at a time, so room_io and the transcript synchronizer keep their single-segment invariants without modification. Per-msg state is derived from the playback_finished event: - 'full' -> emit ChatMessage(interrupted=False) with the msg's id - 'partial' -> emit ChatMessage(interrupted=True); call truncate() with the msg's local playback position - 'skipped' -> drop from local chat ctx; call update_chat_ctx() so the realtime server removes never-played items from history This is a cleaner alternative to flushing per-message, which would require keeping multiple in-flight flush_tasks / synchronizer segments alive simultaneously.

devin-ai-integration

Devin Review found 1 potential issue.

View 5 additional findings in Devin Review.

Server-side truncation must run independently of local ChatMessage emission. The previous order skipped truncate() when forwarded_text was empty (transcription disabled, or interrupt before the text stream caught up to audio), leaving the realtime server with the full un-truncated audio.

chenghao-mou requested a review from a team May 18, 2026 06:46

devin-ai-integration Bot reviewed May 18, 2026

View reviewed changes

Comment thread livekit-agents/livekit/agents/voice/agent_activity.py Outdated

longcw mentioned this pull request May 19, 2026

Realtime API: second message generation dropped with warning when using gpt-realtime-2 #5768

Closed

theomonnom approved these changes May 20, 2026

View reviewed changes

longcw merged commit 187433c into main May 20, 2026
24 checks passed

longcw deleted the longc/multi-message-realtime-v2 branch May 20, 2026 00:36

rosetta-livekit-bot Bot mentioned this pull request May 20, 2026

feat(realtime): support multi-message generation per response livekit/agents-js#1555

Open

yaniv-peretz mentioned this pull request May 20, 2026

feat(realtime): support multi-message generation per response - gpt-realtime-2 livekit/agents-js#1563

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(realtime): support multi-message generation per response#5763

feat(realtime): support multi-message generation per response#5763
longcw merged 2 commits into
mainfrom
longc/multi-message-realtime-v2

longcw commented May 18, 2026 •

edited

Loading

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

longcw commented May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Alternative considered

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

longcw commented May 18, 2026 •

edited

Loading