Add context to IR.do_evaluate #20322

TomAugspurger · 2025-10-21T14:44:42Z

Description

This adds a keyword-only context argument to cudf_polars IR.do_evaluate method. The purpose to provide access to special pieces of data that might be necessary for controlling an IR nodes' execution, but doesn't belong on the IR node itself as a non-child argument. Specifically, we'd like to provide a CUDA stream argument as part of #20228, but we generalize that slightly and provide a system for providing arbitrary data.

A few notes on the implementation:

For now, the context is just an empty dataclass. I suspect its design might change in the future.
I've opted to push the creation of the context as high as possible. For now it's created in _callback and passed into ir.evaluate / evaluate_streaming and from there to all the methods that require it.
There's some awkwardness between how our IR nodes and Dask's task graph treat arguments. I've opted to make context keyword only in IR.do_evaluate(..., context). However, Dask's task graph doesn't really deal with that. It wants a tuple of (function, arg1, arg2, ...). So that requires using functools.partial(function, context=context)(arg1, arg2, ...).
After implementing this, I realized that Expr.evaluate also takes a context, and its a different type ExecutionContext :( I can rename the IR variant if we want.

Just a draft for now, and probably not worth reviewing until I have a branch somewhere that combines CUDA streams with this to verify it meets our needs.

This adds a keyword-only `context` argument to cudf_polars IR.do_evaluate method. The purpose to provide access to special pieces of data that might be necessary for controlling an IR nodes' execution, but doesn't belong on the IR node itself as a non-child argument. Specifically, we'd like to provide a CUDA `stream` argument, but we generalize that slightly and provide a system for providing arbitrary data.

copy-pr-bot · 2025-10-21T14:44:46Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

TomAugspurger · 2025-10-21T16:21:31Z

5a77a80 has a POC for how this can be used. We add a new_stream: Callable[[], Stream] member to the context dataclass. Inside of do_evaluate we can call context.new_stream(). Once rapidsai/rapidsmpf#592 is done, we should be able to pass in a context that uses the stream pool from rapidsmpf.

Alternatively, rather than giving a Callable[[], Stream] we could attach a stream directly to the context and relying on dataclasses.replace() with new streams as needed. I'm not sure which is better at the moment.

Finally, we could drop the dataclass and just make it a dictionary. But I'd prefer to keep things structured where possible, so that both the functions and the callers of the function know what belongs in the context. We can attach an extra field to the dataclass that's just a dictionary if we need to pass arbitrary things in.

…context

python/cudf_polars/cudf_polars/callback.py

…context

python/cudf_polars/cudf_polars/experimental/parallel.py

…context

TomAugspurger · 2025-10-22T16:58:01Z

/merge

github-actions bot added Python Affects Python cuDF API. cudf-polars Issues specific to cudf-polars labels Oct 21, 2025

github-project-automation bot added this to cuDF Python Oct 21, 2025

github-actions bot assigned TomAugspurger Oct 21, 2025

GPUtester moved this to In Progress in cuDF Python Oct 21, 2025

TomAugspurger added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Oct 21, 2025

TomAugspurger added 2 commits October 21, 2025 10:32

linting

c2792a7

Merge remote-tracking branch 'upstream/main' into tom/cudf-polars-ir-…

208360d

…context

TomAugspurger commented Oct 21, 2025

View reviewed changes

python/cudf_polars/cudf_polars/callback.py Show resolved Hide resolved

TomAugspurger added 2 commits October 21, 2025 10:48

docfix

1da5e58

Merge branch 'main' into tom/cudf-polars-ir-context

8145ae4

TomAugspurger marked this pull request as ready for review October 21, 2025 17:48

TomAugspurger requested a review from a team as a code owner October 21, 2025 17:48

TomAugspurger requested review from Matt711 and vyasr October 21, 2025 17:48

TomAugspurger added 5 commits October 21, 2025 11:41

Merge remote-tracking branch 'upstream/main' into tom/cudf-polars-ir-…

0ca6b29

…context

missed some

17fa3e8

fixups

6608c43

one more

8528805

fixup the docs

38f2cb6

Matt711 approved these changes Oct 22, 2025

View reviewed changes

TomAugspurger mentioned this pull request Oct 22, 2025

[FEA]: Use CUDA streams when executing cudf-polars query with a rapidsmpf streaming network #20337

Closed

rjzamora reviewed Oct 22, 2025

View reviewed changes

python/cudf_polars/cudf_polars/experimental/parallel.py Show resolved Hide resolved

Merge remote-tracking branch 'upstream/main' into tom/cudf-polars-ir-…

5544da9

…context

rapids-bot bot merged commit 6fdbd4e into rapidsai:main Oct 23, 2025
473 of 489 checks passed

github-project-automation bot moved this from In Progress to Done in cuDF Python Oct 23, 2025

TomAugspurger deleted the tom/cudf-polars-ir-context branch October 23, 2025 13:23

TomAugspurger mentioned this pull request Oct 23, 2025

[WIP] RapidsMPF streaming-engine translation #20161

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add context to IR.do_evaluate #20322

Add context to IR.do_evaluate #20322

Uh oh!

TomAugspurger commented Oct 21, 2025 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Oct 21, 2025

Uh oh!

TomAugspurger commented Oct 21, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

TomAugspurger commented Oct 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add context to IR.do_evaluate #20322

Add context to IR.do_evaluate #20322

Uh oh!

Conversation

TomAugspurger commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

copy-pr-bot bot commented Oct 21, 2025

Uh oh!

TomAugspurger commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

TomAugspurger commented Oct 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

TomAugspurger commented Oct 21, 2025 •

edited

Loading

TomAugspurger commented Oct 21, 2025 •

edited

Loading