Fully configure frame processors when they are used directly on an audio stream by 1egoman · Pull Request #679 · livekit/python-sdks

1egoman · 2026-05-20T17:44:02Z

Updates the python sdk so that FrameProcessor-based noise cancellation providers can be used directly on AudioStream, without having to go through the agent's RoomIO to be able to initialize itself with credentials.

For example, with this change, something like the below becomes possible:

stream = rtc.AudioStream.from_track(                                                                                                                   
    track=track,
    sample_rate=SAMPLE_RATE,                                             
    num_channels=CHANNELS,
    noise_cancellation=ai_coustics.audio_enhancement(model=ai_coustics.EnhancerModel.QUAIL_VF_L)  ,
)

The way this works - Tracks now keep track of which room they are part of (holding a weakref value). When the room a track is in changes, it computes new frame processor options and sends these to any AudioStreams which are associated with the track.

The noise_cancellation_leave_open parameter allows the agents sdk to call this from_track method with a frame processor which remains open across the whole session, and won't be auto-closed when the track is closed.

This goes along with livekit/agents#5867, which removes the relevant event handling logic in the agents sdk. I will follow up with a node version of this once the python one is in a good state.

Todo

Add some tests for this newly added behavior

…io stream And extracting metadata from that room that can be fed into the frame processor.

…o_stream

…from room

…oStream This makes it less complex.

The agents sdk can pass this opt-out flag so that it can reuse the frame processor across many audio tracks

Need to think about this a bit more, this pattern as written won't work, since the FrameProcessor today can't have a set of no-op credentials pushed.

…_track

…Processor methods, and use them when moving a track out of a room

These tests exercise all the frame processor track reparenting under room / etc paths.

theomonnom · 2026-05-27T22:46:43Z

        num_channels: int = 1,
        frame_size_ms: int | None = None,
        noise_cancellation: Optional[NoiseCancellationOptions | FrameProcessor[AudioFrame]] = None,
+        noise_cancellation_leave_open: bool = False,


Suggested change

noise_cancellation_leave_open: bool = False,

Can we move that inside NoiseCancellationOptions?

Unfortunately, no - this is important to the FrameProcessor[AudioFrame] side of that noise_cancellation union. Open to putting it somewhere else but it needs to be settable in the FrameProcessor path.

hmm, not sure if it's a good idea, but could it be a field on the FrameProcessor interface instead?

Then we could add it to NoiseCancellationOptions and new FrameProcessors would be able to set it on the processor itself

It's not a setting that a frame processor would always want to have set or not have set, so I'm not sure that would really make sense either.

For context, the reason this is here is so the agents sdk can reuse a single FrameProcessor across multiple underlying tracks. Previously, this wasn't a problem in the way this used to work, because the agents sdk had the responsibility of closing the FrameProcessor, so it could easily do it at room disconnection time. But in order to support the ability to use FrameProcessors directly on an AudioStream, calling close needs to be pushed down deeper than the agents sdk layer. This flag allows the caller to explictly tell AudioStream that they will manage cleaning up the FrameProcessor so that both use cases can continue to work.

I think this flag is not really configuring the noise suppression behavior, but how AudioStream deals with its own noise suppression, maybe the naming of noise_cancellation_leave_open is a bit confusing ?

how about close_noise_cancellation_on_stream_close or manage_noise_cancellation_processor ?

It's not a setting that a frame processor would always want to have set or not have set

it could stay undefined by default? 🤷
I understand however that it feels a bit weird for it to live on the processor if the processor itself doesn't really use the field.

We shortly discussed also the option to introduce a restart method on the processor. I think this could still be a viable alternative?

We shortly discussed also the option to introduce a restart method on the processor. I think this could still be a viable alternative?

It could, but the con there is it's a breaking api change to FrameProcessor.

Just generally, I want to understand what folks' concerns are in more detail. Is it just the noise_cancellation_ prefix naming like shijing suggested (I think out of the two suggestions, I like manage_noise_cancellation_processor better)? Or is there something deeper behavior wise that is concerning?

FWIW, two fairly similar patterns I found:

LiveKitAPI conditionally controls aiohttp.ClientSession cleanup here based on whether the user passes a custom session or uses an inbuilt session.

The LocalAudioTrack has a userProvidedTrack parameter which is used to control whether the track is cleaned up or not here.

Talked to @lukasIO in a 1:1 and he confirmed his concern was mostly with the naming, not with the broad approach, which is helpful.

A few other name ideas, in addition to shijing's suggestions (close_noise_cancellation_on_stream_close / manage_noise_cancellation_processor) - some of these would involve flipping the flag:

shared_noise_cancellation

noise_cancellation_externally_managed

auto_close_noise_cancellation

owns_noise_cancellation

Out of the above, I think I like auto_close_noise_cancellation the best:

# Usage within agents sdk: AudioStream.from_track( # ... noise_cancellation=frame_processor, auto_close_noise_cancellation=False, )

I'm going to update the pull request to use it for now in 8d5e656.

Another possible idea: maybe something like the below could be a different way to package the same data which could better contain it. In a world like this, noise_cancellation would be of type Union[NoiseCancellationOptions, FrameProcessorOptions, FrameProcessor]:

AudioStream.from_track( # ... noise_cancellation=FrameProcessorOptions(frame_processor=self, leave_open=True) )

Do any of these ideas look better than the current state?

…ation

1egoman added 7 commits May 26, 2026 11:12

feat: add MVP of propagating room downwards from room -> track -> aud…

538fc13

…io stream And extracting metadata from that room that can be fed into the frame processor.

feat: call _on_stream_info_updated with parent room reference on audi…

7c7eaa4

…o_stream

feat: call _on_credentials_updated with token / server url extracted …

12718d1

…from room

fix: remove debugging logs

af26b3d

fix: address lint errors

5ecca5d

feat: only call frame processor handlers if room is set

af56d61

fix: properly intercept room refresh token events

f62c247

1egoman force-pushed the frame-processor-on-audio-stream branch from 3e5a9ab to f62c247 Compare May 26, 2026 15:15

1egoman added 9 commits May 26, 2026 11:27

feat: add from __future__ import annotations to remove string types

e7ab10e

fix: address incorrect docs

f7f422d

refactor: centralize frame processor state logic into Track, not Audi…

24f2b6e

…oStream This makes it less complex.

feat: add auto cleanup of FrameProcessor as opt-out

ad32574

The agents sdk can pass this opt-out flag so that it can reuse the frame processor across many audio tracks

fix: disable no-op credentials push

ce5e793

Need to think about this a bit more, this pattern as written won't work, since the FrameProcessor today can't have a set of no-op credentials pushed.

fix: move processor close from __del__ to aclose

b9f34d0

fix: proxy throgh noise_cancellation_leave_open into AudioStream.from…

4c73cc8

…_track

fix: include missed noise_cancellation_leave_open in from_track

22f4896

fix: address type checker warning

2dbe350

1egoman commented May 26, 2026

View reviewed changes

Comment thread livekit-rtc/livekit/rtc/track.py Outdated

1egoman marked this pull request as ready for review May 26, 2026 21:25

1egoman requested review from cloudwebrtc, lukasIO and xianshijing-lk as code owners May 26, 2026 21:25

This comment was marked as resolved.

Sign in to view

1egoman added 2 commits May 27, 2026 11:28

feat: add new _on_stream_info_cleared / _on_credentials_cleared Frame…

07fec79

…Processor methods, and use them when moving a track out of a room

fix: apply devin suggestion

8d3f4fe

1egoman force-pushed the frame-processor-on-audio-stream branch from 564b2c7 to 8d3f4fe Compare May 27, 2026 17:02

1egoman added 2 commits May 27, 2026 13:26

feat: add new frame processor tests

7743e6a

These tests exercise all the frame processor track reparenting under room / etc paths.

fix: address type errors in tests

75d8874

1egoman mentioned this pull request May 27, 2026

Move frame processor url/token/stream info to client sdk livekit/agents#5867

Draft

3 tasks

theomonnom reviewed May 27, 2026

View reviewed changes

1egoman mentioned this pull request May 29, 2026

Add initial support for frame processor usage directly on tracks livekit/node-sdks#671

Open

1 task

fix: rename noise_cancellation_leave_open -> auto_close_noise_cancell…

8d5e656

…ation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fully configure frame processors when they are used directly on an audio stream#679

Fully configure frame processors when they are used directly on an audio stream#679
1egoman wants to merge 21 commits into
mainfrom
frame-processor-on-audio-stream

1egoman commented May 20, 2026 •

edited

Loading

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

theomonnom May 27, 2026

Uh oh!

1egoman May 28, 2026 •

edited

Loading

Uh oh!

lukasIO May 29, 2026

Uh oh!

1egoman May 29, 2026 •

edited

Loading

Uh oh!

xianshijing-lk May 29, 2026

Uh oh!

lukasIO Jun 1, 2026

Uh oh!

1egoman Jun 1, 2026 •

edited

Loading

Uh oh!

1egoman Jun 1, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

1egoman commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Todo

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

theomonnom May 27, 2026

Choose a reason for hiding this comment

Uh oh!

1egoman May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lukasIO May 29, 2026

Choose a reason for hiding this comment

Uh oh!

1egoman May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xianshijing-lk May 29, 2026

Choose a reason for hiding this comment

Uh oh!

lukasIO Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

1egoman Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

1egoman Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

1egoman commented May 20, 2026 •

edited

Loading

1egoman May 28, 2026 •

edited

Loading

1egoman May 29, 2026 •

edited

Loading

1egoman Jun 1, 2026 •

edited

Loading

1egoman Jun 1, 2026 •

edited

Loading