How to perform distributed training on Amazon SageMaker using SageMaker's Distributed Data Parallel library and debug using Amazon SageMaker Debugger.
-
Updated
Jun 8, 2021 - Jupyter Notebook
How to perform distributed training on Amazon SageMaker using SageMaker's Distributed Data Parallel library and debug using Amazon SageMaker Debugger.
How to perform training on Amazon SageMaker using SageMaker's script mode and debug using Amazon SageMaker Debugger.
"An end-to-end Medical Imaging pipeline built on AWS SageMaker utilizing Transfer Learning (ResNet18). The project implements Hyperparameter Optimization (HPO) to minimize loss, leverages SageMaker Debugger & Profiler for resource optimization, and concludes with a Production-ready real-time inference endpoint
Add a description, image, and links to the sagemaker-debugger topic page so that developers can more easily learn about it.
To associate your repository with the sagemaker-debugger topic, visit your repo's landing page and select "manage topics."