[ET Device Support] Define et_copy runtime h2d and d2h copy ops#18729
Conversation
Implement C++ runtime kernels for device copy ops using DeviceAllocator: - h2d_copy_out: infers device from out tensor, calls DeviceAllocator::copy_host_to_device - d2h_copy_out: infers device from self tensor, calls DeviceAllocator::copy_device_to_host - Registered via EXECUTORCH_LIBRARY macro Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/) [ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18729
Note: Links to docs will display an error until the docs builds have been completed. ❗ 1 Active SEVsThere are 1 currently active SEVs. If your PR is affected, please view them below: ❌ 1 New Failure, 6 Unrelated Failures, 1 Unclassified FailureAs of commit e2f82d9 with merge base 77df9b7 ( NEW FAILURE - The following job has failed:
UNCLASSIFIED FAILURE - DrCI could not classify the following job because the workflow did not run on the merge base. The failure may be pre-existing on trunk or introduced by this PR:
FLAKY - The following job failed but was likely due to flakiness present on trunk:
BROKEN TRUNK - The following jobs failed but was present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
This PR needs a
|
…y ops" Implement C++ runtime kernels for device copy ops using DeviceAllocator: - h2d_copy_out: infers device from out tensor, calls DeviceAllocator::copy_host_to_device - d2h_copy_out: infers device from self tensor, calls DeviceAllocator::copy_device_to_host - Registered via EXECUTORCH_LIBRARY macro Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/) [ghstack-poisoned]
digantdesai
left a comment
There was a problem hiding this comment.
Review automatically exported from Phabricator review in Meta.
…y ops" Implement C++ runtime kernels for device copy ops using DeviceAllocator: - h2d_copy_out: infers device from out tensor, calls DeviceAllocator::copy_host_to_device - d2h_copy_out: infers device from self tensor, calls DeviceAllocator::copy_device_to_host - Registered via EXECUTORCH_LIBRARY macro Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/) [ghstack-poisoned]
Pull Request resolved: #18729 Implement C++ runtime kernels for device copy ops using DeviceAllocator: - h2d_copy_out: infers device from out tensor, calls DeviceAllocator::copy_host_to_device - d2h_copy_out: infers device from self tensor, calls DeviceAllocator::copy_device_to_host - Registered via EXECUTORCH_LIBRARY macro ghstack-source-id: 383670667 @exported-using-ghexport Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/)
Pull Request resolved: #18729 Implement C++ runtime kernels for device copy ops using DeviceAllocator: - h2d_copy_out: infers device from out tensor, calls DeviceAllocator::copy_host_to_device - d2h_copy_out: infers device from self tensor, calls DeviceAllocator::copy_device_to_host - Registered via EXECUTORCH_LIBRARY macro ghstack-source-id: 383670667 @exported-using-ghexport Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/)
Pull Request resolved: #18729 Implement C++ runtime kernels for device copy ops using DeviceAllocator: - h2d_copy_out: infers device from out tensor, calls DeviceAllocator::copy_host_to_device - d2h_copy_out: infers device from self tensor, calls DeviceAllocator::copy_device_to_host - Registered via EXECUTORCH_LIBRARY macro ghstack-source-id: 384965089 @exported-using-ghexport Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/)
Pull Request resolved: #18729 Implement C++ runtime kernels for device copy ops using DeviceAllocator: - h2d_copy_out: infers device from out tensor, calls DeviceAllocator::copy_host_to_device - d2h_copy_out: infers device from self tensor, calls DeviceAllocator::copy_device_to_host - Registered via EXECUTORCH_LIBRARY macro ghstack-source-id: 385346808 @exported-using-ghexport Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/)
Pull Request resolved: #18729 Implement C++ runtime kernels for device copy ops using DeviceAllocator: - h2d_copy_out: infers device from out tensor, calls DeviceAllocator::copy_host_to_device - d2h_copy_out: infers device from self tensor, calls DeviceAllocator::copy_device_to_host - Registered via EXECUTORCH_LIBRARY macro ghstack-source-id: 386351153 @exported-using-ghexport Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/)
Pull Request resolved: #18729 Implement C++ runtime kernels for device copy ops using DeviceAllocator: - h2d_copy_out: infers device from out tensor, calls DeviceAllocator::copy_host_to_device - d2h_copy_out: infers device from self tensor, calls DeviceAllocator::copy_device_to_host - Registered via EXECUTORCH_LIBRARY macro ghstack-source-id: 386433420 @exported-using-ghexport Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/)
Pull Request resolved: #18729 Implement C++ runtime kernels for device copy ops using DeviceAllocator: - h2d_copy_out: infers device from out tensor, calls DeviceAllocator::copy_host_to_device - d2h_copy_out: infers device from self tensor, calls DeviceAllocator::copy_device_to_host - Registered via EXECUTORCH_LIBRARY macro ghstack-source-id: 386460692 @exported-using-ghexport Differential Revision: [D99636776](https://our.internmc.facebook.com/intern/diff/D99636776/)
clone #18729 due to bot crash
Stack from ghstack (oldest at bottom):
Implement C++ runtime kernels for device copy ops using DeviceAllocator:
DeviceAllocator::copy_host_to_device
DeviceAllocator::copy_device_to_host
Differential Revision: D99636776