Skip to content

[BEAM-12907] Run DataFrame API tests with multiple pandas versions#15587

Merged
TheNeuralBit merged 3 commits intoapache:masterfrom
TheNeuralBit:test-multiple-pandas
Oct 14, 2021
Merged

[BEAM-12907] Run DataFrame API tests with multiple pandas versions#15587
TheNeuralBit merged 3 commits intoapache:masterfrom
TheNeuralBit:test-multiple-pandas

Conversation

@TheNeuralBit
Copy link
Copy Markdown
Member

Adds infrastructure to verify the DataFrame API with multiple pandas versions.

ValidatesRunner compliance status (on master branch)

Lang ULR Dataflow Flink Samza Spark Twister2
Go --- Build Status Build Status Build Status Build Status ---
Java Build Status Build Status
Build Status
Build Status
Build Status
Build Status
Build Status
Build Status
Build Status
Build Status
Build Status
Build Status
Build Status
Build Status
Build Status
Build Status
Python --- Build Status
Build Status
Build Status
Build Status
Build Status
Build Status Build Status ---
XLang Build Status Build Status Build Status Build Status Build Status ---

Examples testing status on various runners

Lang ULR Dataflow Flink Samza Spark Twister2
Go --- --- --- --- --- --- ---
Java --- Build Status
Build Status
Build Status
--- --- --- --- ---
Python --- --- --- --- --- --- ---
XLang --- --- --- --- --- --- ---

Post-Commit SDK/Transform Integration Tests Status (on master branch)

Go Java Python
Build Status Build Status Build Status
Build Status
Build Status

Pre-Commit Tests Status (on master branch)

--- Java Python Go Website Whitespace Typescript
Non-portable Build Status
Build Status
Build Status
Build Status
Build Status
Build Status Build Status Build Status Build Status
Portable --- Build Status Build Status --- --- ---

See .test-infra/jenkins/README for trigger phrase, status and link of all Jenkins jobs.

GitHub Actions Tests Status (on master branch)

Build python source distribution and wheels
Python tests
Java tests

See CI.md for more information about GitHub Actions CI.

@codecov
Copy link
Copy Markdown

codecov bot commented Sep 24, 2021

Codecov Report

Merging #15587 (4b36c1f) into master (7969cd5) will decrease coverage by 0.00%.
The diff coverage is n/a.

Impacted file tree graph

@@            Coverage Diff             @@
##           master   #15587      +/-   ##
==========================================
- Coverage   83.51%   83.50%   -0.01%     
==========================================
  Files         445      445              
  Lines       61415    61414       -1     
==========================================
- Hits        51288    51285       -3     
- Misses      10127    10129       +2     
Impacted Files Coverage Δ
sdks/python/apache_beam/dataframe/frames.py 94.85% <ø> (ø)
...ks/python/apache_beam/runners/worker/data_plane.py 87.50% <0.00%> (-1.71%) ⬇️
...ks/python/apache_beam/runners/worker/sdk_worker.py 89.25% <0.00%> (+0.24%) ⬆️
...hon/apache_beam/runners/worker/bundle_processor.py 93.64% <0.00%> (+0.37%) ⬆️
.../python/apache_beam/transforms/periodicsequence.py 98.36% <0.00%> (+1.63%) ⬆️
...python/apache_beam/runners/worker/worker_status.py 79.71% <0.00%> (+2.15%) ⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 7e896ba...4b36c1f. Read the comment docs.

@TheNeuralBit
Copy link
Copy Markdown
Member Author

Run Whitespace PreCommit

@TheNeuralBit
Copy link
Copy Markdown
Member Author

Run Python 3.8 PostCommit

@aaltay
Copy link
Copy Markdown
Member

aaltay commented Oct 7, 2021

@TheNeuralBit - Is this ready for a review?

@TheNeuralBit
Copy link
Copy Markdown
Member Author

Thanks for the ping, yes it is. I'll resolve the merge conflict and send it out

@TheNeuralBit
Copy link
Copy Markdown
Member Author

R: @lukecwik

@TheNeuralBit
Copy link
Copy Markdown
Member Author

Run Python PreCommit

Copy link
Copy Markdown
Member

@lukecwik lukecwik Oct 12, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we need to do something similar as #15693?

If not, please resolve merge and then LGTM.

Copy link
Copy Markdown
Member Author

@TheNeuralBit TheNeuralBit Oct 13, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, good question. We don't in this case since we're not using -m to select tests. #15693 also added a check to make run_pytest.sh fail fast in that case.

test.dependsOn "testPy38pandas-11"
test.dependsOn "testPy38pandas-12"
test.dependsOn "testPy38pandas-13"
(1..3).each {minor_version ->
Copy link
Copy Markdown
Member

@lukecwik lukecwik Oct 13, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: even though the old version had what seemed like duplicate code, it was much more code search friendly.

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good point, I went ahead and expanded it back out (for the pyarrow ones too)

@TheNeuralBit TheNeuralBit merged commit 0542886 into apache:master Oct 14, 2021
dmitriikuzinepam pushed a commit to dmitriikuzinepam/beam that referenced this pull request Nov 2, 2021
…pache#15587)

* Run DataFrame API tests with multiple pandas versions

* create tasks in a loop, make sure they're run in precommit

* Don't define tasks in a loop
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants