Skip to content

Add OOM section to Best Practices#244

Merged
sarahyurick merged 13 commits into
NVIDIA-NeMo:mainfrom
sarahyurick:memory_best_practices
Sep 30, 2024
Merged

Add OOM section to Best Practices#244
sarahyurick merged 13 commits into
NVIDIA-NeMo:mainfrom
sarahyurick:memory_best_practices

Conversation

@sarahyurick

Copy link
Copy Markdown
Contributor

No description provided.

Signed-off-by: Sarah Yurick <sarahyurick@gmail.com>
Signed-off-by: Sarah Yurick <sarahyurick@gmail.com>
@sarahyurick sarahyurick marked this pull request as ready for review September 11, 2024 22:26

@ryantwolf ryantwolf left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I left a couple of suggestions. Let me know what you think.

Comment thread docs/user-guide/bestpractices.rst Outdated
Comment thread docs/user-guide/bestpractices.rst Outdated
Comment thread docs/user-guide/bestpractices.rst
Comment thread docs/user-guide/bestpractices.rst
Comment thread docs/user-guide/bestpractices.rst Outdated
Signed-off-by: Sarah Yurick <sarahyurick@gmail.com>
@sarahyurick

Copy link
Copy Markdown
Contributor Author

Thanks @ryantwolf ! Updated for whenever you get a chance to review again.

Comment thread docs/user-guide/bestpractices.rst Outdated
@ryantwolf

Copy link
Copy Markdown
Contributor

@sarahyurick I left a few more comments.

Comment thread docs/user-guide/bestpractices.rst
Comment thread docs/user-guide/bestpractices.rst Outdated
Comment thread docs/user-guide/bestpractices.rst Outdated
Comment thread docs/user-guide/bestpractices.rst
@ayushdg ayushdg added the documentation Improvements or additions to documentation label Sep 23, 2024
@sarahyurick

Copy link
Copy Markdown
Contributor Author

TODO: How to report/capture GPU memory and utilization for a specific step.

References:

Signed-off-by: Sarah Yurick <sarahyurick@gmail.com>
Comment thread docs/user-guide/bestpractices.rst Outdated
Signed-off-by: Sarah Yurick <sarahyurick@gmail.com>
Comment thread docs/user-guide/bestpractices.rst Outdated
Signed-off-by: Sarah Yurick <sarahyurick@gmail.com>

@ryantwolf ryantwolf left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have one more minor thing and I review comment that somehow didn't get published before, sorry about that.

Comment thread docs/user-guide/bestpractices.rst Outdated
Comment thread docs/user-guide/bestpractices.rst Outdated
Comment thread docs/user-guide/bestpractices.rst Outdated
Signed-off-by: Sarah Yurick <sarahyurick@gmail.com>
@sarahyurick

Copy link
Copy Markdown
Contributor Author

Thanks @ryantwolf ! Updated.

Signed-off-by: Sarah Yurick <sarahyurick@gmail.com>
Signed-off-by: Sarah Yurick <sarahyurick@gmail.com>
Signed-off-by: Sarah Yurick <sarahyurick@gmail.com>
Signed-off-by: Sarah Yurick <sarahyurick@gmail.com>
Signed-off-by: Sarah Yurick <sarahyurick@gmail.com>
Signed-off-by: Sarah Yurick <sarahyurick@gmail.com>

@ryantwolf ryantwolf left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good on my end, thank you!

@sarahyurick sarahyurick merged commit 802ae31 into NVIDIA-NeMo:main Sep 30, 2024
@sarahyurick sarahyurick deleted the memory_best_practices branch October 25, 2024 20:45
jnke2016 pushed a commit to jnke2016/Curator that referenced this pull request Nov 12, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants