Skip to content

PropertyGraph updates: added features for DGL, improved extract_subgraph() and num_vertices performance#2056

Merged
rapids-bot[bot] merged 6 commits intorapidsai:branch-22.04from
rlratzel:branch-22.04-propgraph_updates
Feb 3, 2022
Merged

PropertyGraph updates: added features for DGL, improved extract_subgraph() and num_vertices performance#2056
rapids-bot[bot] merged 6 commits intorapidsai:branch-22.04from
rlratzel:branch-22.04-propgraph_updates

Conversation

@rlratzel
Copy link
Contributor

@rlratzel rlratzel commented Feb 2, 2022

Various updates needed for adding new features for DGL, improving extract_subgraph() and num_vertices performance:

  • made PropertyGraph column name constants public for use by clients
  • changed num_vertices property impl for better performance
  • added get_vertices()
  • allow a None type_name for add_*_data()
  • updated docstrings
  • changed the edge_data added to Graphs to be a DataFrame for efficiency
  • added tests and benchmarks
  • refactored and added test fixtures
  • updated pytest.ini for new benchmarks

Tested by ensuring all new tests added passed, benchmarks ran, and existing test_graph_store.py tests still passed.

… wrapping in a class wrapper instead of a function (where the return type is no longer a class type, and isinstance() unexpectedly fails). Added tests.
…hanged num_vertices property impl for better performance, added get_vertices(), allow a None type_name for add_*_data(), updated docstrings, changed the edge_data added to Graphs to be a DataFrame for efficiency, added tests and benchmarks, refactored and added test fixtures, updated pytest.ini for new benchmarks.
@rlratzel rlratzel added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change python labels Feb 2, 2022
@rlratzel rlratzel added this to the 22.04 milestone Feb 2, 2022
@rlratzel rlratzel requested a review from BradReesWork February 2, 2022 20:18
@rlratzel rlratzel requested a review from a team as a code owner February 2, 2022 20:18
@rlratzel rlratzel self-assigned this Feb 2, 2022
@rlratzel
Copy link
Contributor Author

rlratzel commented Feb 3, 2022

Could potentially be a cudf issue with nunique() (nuninque is used in all the test failures, there was a cudf PR about nunique today, and the tests pass when pandas is used), but I could not repro the failure locally. Re-running to try CI with the latest cudf while debugging locally at the same time.

rerun tests

@codecov-commenter
Copy link

codecov-commenter commented Feb 3, 2022

Codecov Report

❗ No coverage uploaded for pull request base (branch-22.04@0e99668). Click here to learn what that means.
The diff coverage is n/a.

Impacted file tree graph

@@               Coverage Diff               @@
##             branch-22.04    #2056   +/-   ##
===============================================
  Coverage                ?   73.62%           
===============================================
  Files                   ?      155           
  Lines                   ?    10172           
  Branches                ?        0           
===============================================
  Hits                    ?     7489           
  Misses                  ?     2683           
  Partials                ?        0           

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 0e99668...2c8064b. Read the comment docs.

@BradReesWork
Copy link
Member

@gpucibot merge

@rapids-bot rapids-bot bot merged commit de0f779 into rapidsai:branch-22.04 Feb 3, 2022
@jakirkham jakirkham mentioned this pull request Aug 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

improvement Improvement / enhancement to an existing function non-breaking Non-breaking change

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants