Skip to content

Prevent threaded Dask workers in DistRDF backend#22393

Open
JAGANNATHANJP wants to merge 3 commits into
root-project:masterfrom
JAGANNATHANJP:fix-dask-thread-validation
Open

Prevent threaded Dask workers in DistRDF backend#22393
JAGANNATHANJP wants to merge 3 commits into
root-project:masterfrom
JAGANNATHANJP:fix-dask-thread-validation

Conversation

@JAGANNATHANJP
Copy link
Copy Markdown

This PR prevents unsupported threaded Dask workers in DistRDF.

Distributed RDataFrame with Dask threads may lead to crashes and does not provide advantages due to Python GIL limitations. This change validates worker configuration at backend initialization and raises a RuntimeError when threaded workers are detected.

Suggested configuration:

  • processes=True
  • threads_per_worker=1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant