Skip to content
View Qonfused's full-sized avatar
⌨️
6 hours of debugging can save you 5 minutes of reading documentation
⌨️
6 hours of debugging can save you 5 minutes of reading documentation

Organizations

@videre-project

Block or report Qonfused

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Command-line toolkit for interacting with the Naya Create split keyboard over USB CDC.

Python 5 Updated Apr 21, 2026

CLI text optimizer built on GEPA. Uses Agentic Coding CLI's as mutator and observer -- no api keys required

Python 36 5 Updated Apr 21, 2026
Python 890 122 Updated Apr 23, 2026

Stable Looped Models

Python 123 8 Updated Apr 16, 2026

The 7-Zip derivative intended for macOS

Swift 677 18 Updated Apr 23, 2026

Grant private entitlements to OSX apps

C++ 121 9 Updated Sep 13, 2020

FlexTensor is a tensor offloading and management library for PyTorch that enables running large models on limited GPU memory by intelligently offloading tensors between GPU and CPU memory.

Python 95 11 Updated Apr 19, 2026
Jupyter Notebook 970 174 Updated Apr 23, 2026

TriAttention — Efficient long reasoning with trigonometric KV cache compression. Enables OpenClaw local deployment on memory-constrained GPUs.

Python 639 53 Updated Apr 23, 2026

🎨 NeMo Data Designer: Generate high-quality synthetic data from scratch or from seed data.

Python 1,679 148 Updated Apr 23, 2026

turboquant-based compression engine for LLM KV cache

Python 57 8 Updated Apr 3, 2026

MathCode: A Frontier Mathematical Coding Agent

Python 475 48 Updated Apr 12, 2026

Python package for LLM compression

Python 319 10 Updated Apr 23, 2026
Python 559 190 Updated Apr 1, 2026

Code for the papers: “Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling” and “Adaptive Block-Scaled Data Types”

Python 171 17 Updated Apr 21, 2026

Inspects nsys dumps and measures NCCL collective launch skew

Rust 2 Updated Dec 12, 2025

common in-memory tensor structure

C++ 1,201 160 Updated Jan 26, 2026

Region-level profiling for CUDA kernels with trace, NVBit, CUPTI, NSys, and an interactive Explorer.

Python 112 11 Updated Apr 17, 2026

Rust implementation of protobuf with editions support, JSON serialization, and zero-copy views

Rust 663 32 Updated Apr 23, 2026

APOLLO: SGD-like Memory, AdamW-level Performance; MLSys'25 Oustanding Paper Honorable Mention

Python 345 18 Updated Nov 29, 2025
Python 6,485 872 Updated Apr 21, 2026

Compile programs directly into transformer weights. Includes a 2D convex-hull KV cache with O(log n) inference.

Python 187 35 Updated Mar 25, 2026
TypeScript 5,797 720 Updated Apr 19, 2026

Repository for the blog post JAX-LM: Language Modeling and Distributed Training in JAX

Python 8 1 Updated Mar 24, 2026

🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.

Jupyter Notebook 3,545 295 Updated Mar 27, 2026

A framework for verifiable reasoning with language models.

Python 29 6 Updated Mar 25, 2026

Running a big model on a small laptop

Objective-C 3,738 459 Updated Mar 19, 2026

REAP: Router-weighted Expert Activation Pruning for SMoE compression

Python 338 61 Updated Apr 17, 2026

Your personal intelligence agent. Watches the world from multiple data sources and pings you when something changes.

JavaScript 9,039 1,443 Updated Apr 3, 2026
Next