Skip to content

fix: don't log default partitions when GPU is not part of an instance#276

Merged
tmcroberts97 merged 3 commits into
NVIDIA:mainfrom
tmcroberts97:fix/nvl-log-suppress
Feb 17, 2026
Merged

fix: don't log default partitions when GPU is not part of an instance#276
tmcroberts97 merged 3 commits into
NVIDIA:mainfrom
tmcroberts97:fix/nvl-log-suppress

Conversation

@tmcroberts97

Copy link
Copy Markdown
Contributor

Description

Don't log when nvl partition monitor encounters a GPU in the default partition when the machine has no instance on it.

Type of Change

  • Add - New feature or capability
  • Change - Changes in existing functionality
  • Fix - Bug fixes
  • Remove - Removed features or deprecated functionality
  • Internal - Internal changes (refactoring, tests, docs, etc.)

Related Issues (Optional)

Breaking Changes

  • This PR contains breaking changes

Testing

  • Unit tests added/updated
  • Integration tests added/updated
  • Manual testing performed
  • No testing required (docs, internal refactor, etc.)

Additional Notes

@tmcroberts97 tmcroberts97 requested a review from a team as a code owner February 12, 2026 21:29
Comment thread crates/api/src/nvl_partition_monitor/mod.rs Outdated
Don't log when nvl partition monitor encounters a GPU in the default
partition when the machine has no instance on it.

Signed-off-by: Thomas McRoberts <tmcroberts@nvidia.com>
Signed-off-by: Thomas McRoberts <tmcroberts@nvidia.com>

@kensimon kensimon left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Minor nitpick, feel free to address if you end up re-pushing anyway, otherwise not a big deal.

Comment thread crates/api/src/nvl_partition_monitor/mod.rs Outdated
Signed-off-by: Thomas McRoberts <tmcroberts@nvidia.com>
@tmcroberts97 tmcroberts97 merged commit d45bfab into NVIDIA:main Feb 17, 2026
33 of 34 checks passed
jd-nv pushed a commit that referenced this pull request Feb 19, 2026
…#276)

## Description
<!-- Describe what this PR does -->
Don't log when nvl partition monitor encounters a GPU in the default
partition when the machine has no instance on it. Also, properly log the NMX-M error.

## Type of Change
<!-- Check one that best describes this PR -->
- [ ] **Add** - New feature or capability
- [ ] **Change** - Changes in existing functionality  
- [x] **Fix** - Bug fixes
- [ ] **Remove** - Removed features or deprecated functionality
- [ ] **Internal** - Internal changes (refactoring, tests, docs, etc.)

## Related Issues (Optional)
<!-- If applicable, provide GitHub Issue. -->

## Breaking Changes
- [ ] This PR contains breaking changes

<!-- If checked above, describe the breaking changes and migration steps
-->

## Testing
<!-- How was this tested? Check all that apply -->
- [ ] Unit tests added/updated
- [ ] Integration tests added/updated  
- [ ] Manual testing performed
- [ ] No testing required (docs, internal refactor, etc.)

## Additional Notes
<!-- Any additional context, deployment notes, or reviewer guidance -->

---------

Signed-off-by: Thomas McRoberts <tmcroberts@nvidia.com>
tmcroberts97 added a commit to tmcroberts97/infra-controller-core that referenced this pull request Mar 12, 2026
…NVIDIA#276)

## Description
<!-- Describe what this PR does -->
Don't log when nvl partition monitor encounters a GPU in the default
partition when the machine has no instance on it. Also, properly log the NMX-M error.

## Type of Change
<!-- Check one that best describes this PR -->
- [ ] **Add** - New feature or capability
- [ ] **Change** - Changes in existing functionality  
- [x] **Fix** - Bug fixes
- [ ] **Remove** - Removed features or deprecated functionality
- [ ] **Internal** - Internal changes (refactoring, tests, docs, etc.)

## Related Issues (Optional)
<!-- If applicable, provide GitHub Issue. -->

## Breaking Changes
- [ ] This PR contains breaking changes

<!-- If checked above, describe the breaking changes and migration steps
-->

## Testing
<!-- How was this tested? Check all that apply -->
- [ ] Unit tests added/updated
- [ ] Integration tests added/updated  
- [ ] Manual testing performed
- [ ] No testing required (docs, internal refactor, etc.)

## Additional Notes
<!-- Any additional context, deployment notes, or reviewer guidance -->

---------

Signed-off-by: Thomas McRoberts <tmcroberts@nvidia.com>
tmcroberts97 added a commit to tmcroberts97/infra-controller-core that referenced this pull request Mar 12, 2026
…NVIDIA#276)

## Description
<!-- Describe what this PR does -->
Don't log when nvl partition monitor encounters a GPU in the default
partition when the machine has no instance on it. Also, properly log the NMX-M error.

## Type of Change
<!-- Check one that best describes this PR -->
- [ ] **Add** - New feature or capability
- [ ] **Change** - Changes in existing functionality
- [x] **Fix** - Bug fixes
- [ ] **Remove** - Removed features or deprecated functionality
- [ ] **Internal** - Internal changes (refactoring, tests, docs, etc.)

## Related Issues (Optional)
<!-- If applicable, provide GitHub Issue. -->

## Breaking Changes
- [ ] This PR contains breaking changes

<!-- If checked above, describe the breaking changes and migration steps
-->

## Testing
<!-- How was this tested? Check all that apply -->
- [ ] Unit tests added/updated
- [ ] Integration tests added/updated
- [ ] Manual testing performed
- [ ] No testing required (docs, internal refactor, etc.)

## Additional Notes
<!-- Any additional context, deployment notes, or reviewer guidance -->

---------

Signed-off-by: Thomas McRoberts <tmcroberts@nvidia.com>
jd-nv pushed a commit that referenced this pull request Mar 12, 2026
…#276)

## Description
<!-- Describe what this PR does -->
Don't log when nvl partition monitor encounters a GPU in the default
partition when the machine has no instance on it. Also, properly log the NMX-M error.

## Type of Change
<!-- Check one that best describes this PR -->
- [ ] **Add** - New feature or capability
- [ ] **Change** - Changes in existing functionality
- [x] **Fix** - Bug fixes
- [ ] **Remove** - Removed features or deprecated functionality
- [ ] **Internal** - Internal changes (refactoring, tests, docs, etc.)

## Related Issues (Optional)
<!-- If applicable, provide GitHub Issue. -->

## Breaking Changes
- [ ] This PR contains breaking changes

<!-- If checked above, describe the breaking changes and migration steps
-->

## Testing
<!-- How was this tested? Check all that apply -->
- [ ] Unit tests added/updated
- [ ] Integration tests added/updated
- [ ] Manual testing performed
- [ ] No testing required (docs, internal refactor, etc.)

## Additional Notes
<!-- Any additional context, deployment notes, or reviewer guidance -->

---------

Signed-off-by: Thomas McRoberts <tmcroberts@nvidia.com>
nvcoop pushed a commit to nvcoop/bare-metal-manager-core that referenced this pull request Mar 12, 2026
…NVIDIA#276)

## Description
<!-- Describe what this PR does -->
Don't log when nvl partition monitor encounters a GPU in the default
partition when the machine has no instance on it. Also, properly log the NMX-M error.

## Type of Change
<!-- Check one that best describes this PR -->
- [ ] **Add** - New feature or capability
- [ ] **Change** - Changes in existing functionality  
- [x] **Fix** - Bug fixes
- [ ] **Remove** - Removed features or deprecated functionality
- [ ] **Internal** - Internal changes (refactoring, tests, docs, etc.)

## Related Issues (Optional)
<!-- If applicable, provide GitHub Issue. -->

## Breaking Changes
- [ ] This PR contains breaking changes

<!-- If checked above, describe the breaking changes and migration steps
-->

## Testing
<!-- How was this tested? Check all that apply -->
- [ ] Unit tests added/updated
- [ ] Integration tests added/updated  
- [ ] Manual testing performed
- [ ] No testing required (docs, internal refactor, etc.)

## Additional Notes
<!-- Any additional context, deployment notes, or reviewer guidance -->

---------

Signed-off-by: Thomas McRoberts <tmcroberts@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants