tensor-core
Here are 3 public repositories matching this topic...
A reproducible GPU benchmarking lab that compares FP16 vs FP32 training on MNIST using PyTorch, CuPy, and Nsight profiling tools. This project blends performance engineering with cinematic storytelling—featuring NVTX-tagged training loops, fused CuPy kernels, and a profiler-driven README that narrates the GPU’s inner workings frame by frame.
-
Updated
Sep 5, 2025 - Python
🎬 Explore GPU training efficiency with FP32 vs FP16 in this modular lab, utilizing Tensor Core acceleration for deep learning insights.
-
Updated
Feb 20, 2026 - Python
Improve this page
Add a description, image, and links to the tensor-core topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the tensor-core topic, visit your repo's landing page and select "manage topics."