Skip to content
View vic4key's full-sized avatar
✔️
Hi, I'm Vic P.
✔️
Hi, I'm Vic P.

Block or report vic4key

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
21 stars written in Cuda
Clear filter

LLM training in simple, raw C/CUDA

Cuda 29,171 3,433 Updated Jun 26, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 9,048 1,121 Updated Feb 9, 2026

Sample codes for my CUDA programming book

Cuda 2,021 385 Updated Dec 14, 2025

[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl

Cuda 1,821 464 Updated Oct 9, 2023

Learn CUDA Programming, published by Packt

Cuda 1,236 260 Updated Dec 30, 2023

Examples demonstrating available options to program multiple GPUs in a single node or a cluster

Cuda 874 148 Updated Sep 26, 2025

CUDA Kernel Benchmarking Library

Cuda 832 102 Updated Mar 16, 2026

Graphics Processing Units Molecular Dynamics

Cuda 738 175 Updated Mar 14, 2026

Source code that accompanies The CUDA Handbook.

Cuda 570 198 Updated Mar 10, 2026

CUDA Learning guide

Cuda 539 65 Updated Jun 20, 2024

PopSift is an implementation of the SIFT algorithm in CUDA.

Cuda 491 125 Updated Jan 4, 2026

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

Cuda 484 26 Updated Jan 8, 2026

CUDA Data Parallel Primitives Library

Cuda 438 97 Updated Nov 9, 2018

EGGROLL in C, integer-first training

Cuda 344 31 Updated Dec 22, 2025

Alex Krizhevsky's original code from Google Code

Cuda 199 32 Updated Mar 10, 2016

FLAME GPU 2 is a GPU accelerated agent based modelling framework for CUDA C++ and Python

Cuda 142 23 Updated Mar 16, 2026

CUDA kernel author's tools

Cuda 116 8 Updated Apr 24, 2022

Parallel Simulated annealing in GPU using CUDA (used for floorplanning problem)

Cuda 12 1 Updated Jun 4, 2020

Header-only CUDA accelerated DNN library

Cuda 9 6 Updated Jun 14, 2017