Skip to content
View charithcherry's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report charithcherry

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
charithcherry/README.md
header
portfolio email linkedin profile views

👋 Hello

I'm an ML Engineer building production-grade systems where memory, latency, and model independence are non-negotiable. Currently shipping real-time sports analytics at Owl AI (X Games, snowboard) and a brain-inspired memory framework called NeuroStack that runs in my system tray. Coming up on my MS at CU Boulder (GPA 3.93/4.00), TA-ing Distributed Systems, and graduating May 2026 — open to full-time ML / SE / DS roles.


📈 Recent impact

↑ 29%

real-time inference throughput
45 → 58 FPS via TensorRT, quantization, prefetch

+23%

Top-1 accuracy on AWA2
CLIP × KG-RGCN beats CLIP baseline (59% vs 36%)

−50%

deployment time
15 → 7 min dockerized + CI/CD on Azure

−46%

A/B-test decision latency
group-sequential sampling at Fittlyf

1 of 4

SOTA models beaten
Frontiers paper on nuclear fuel ML

🧠 Currently building

A framework, two applications built on it, and a fourth bet on motion. NeuroStack is the foundation; everything else is what gets built on top.

01 — NeuroStack  private · in development

A brain-inspired memory framework for AI agents. Built from scratch — no LangChain, no AutoGen, no ORM.

  • 8 memory layers across 4 temperature zones — Hot (identity / temporal context / experience), Session (episodic buffer), Cool (episodic store + Kuzu entity graph + procedural memory), Archive (soft-delete)
  • Multi-signal retrieval0.4 × cosine + 0.3 × ACT-R activation + 0.2 × salience + 0.1 × recency. Retrieved memories rank higher next time, automatically.
  • CompactionAgent runs 10 jobs every 5 min — embed, Zettelkasten link, decay, consolidate to semantic graph, reflect, re-promote, trim, refresh temporal cache
  • Salience-gated ingestion — every episode scored at write time: 0.5 × surprise + 0.5 × LLM importance
  • Local-first ModelRouter — qwen3.5:4b on Ollama by default, OpenAI as fallback chain (gpt-5.2 → 5.1 → 5 → 4.1 → 4o)
  • Stack  ·  Python · SQLite (28 tables) · Kuzu graph DB · Ollama · OpenAI · embeddings (text-embedding-3-small)

02 — Research Agent  private · running in tray

First app built on NeuroStack. Runs silently in the macOS / Windows system tray, researches papers continuously, and brief me every morning.

  • Autonomous arXiv + Semantic Scholar + web search every 2h
  • Trend reports every 4h · daily morning briefings · personal & career assistant chat
  • Sandboxed code agent that writes and executes Python with procedural memory
  • 7-daemon orchestrator — research / trends / briefing / compaction / model-check / local-warmup
  • Stack  ·  tkinter + pystray · arXiv + Semantic Scholar + Brave Search APIs · ~696 KB Python

03 — Brain Component  in design

The next NeuroStack app — fully local agent runtime. No OpenAI fallback, no Anthropic, no provider lock-in.

  • Privacy-by-default · cost-bounded · resilient to provider rate-limits or model sunsets
  • One config flag swaps Ollama → llama.cpp → MLX → Claude → GPT
  • Stack  ·  NeuroStack · Ollama / llama.cpp / MLX · MCP-compatible tool registry

04 — Adaptive Motion Intelligence  research drafted

Task-specific movement understanding beyond pose landmarks. The platform doesn't ship analyzers — it generates them.

  • Intent → agent-network generates analyzer → sandbox validates → deterministic runtime
  • Action signatures over time, not pose snapshots — feedback / scoring / comparison
  • Three surfaces: coaching · rehab · analytics
  • Stack  ·  MediaPipe · PyTorch · agent network · WebRTC

→ Full architecture write-up on the /building page


🦉 At Owl AI  Sept 2025 — present

Real-time sports analytics for X Games snowboard. ML pipeline that ingests live broadcast feeds, extracts pose, runs temporal models, and emits commentator-facing trick metrics in real time.

  • Productionized a live snowboard video ML pipeline on MediaMTX (SRT) for reliable broadcast ingest
  • 45 → 58 FPS GPU inference via TensorRT, INT8 quantization, caching/prefetching
  • Trained keypoint + temporal models on large-scale X Games video; tracked with MLflow
  • Bayesian / MCMC scoring of trick metrics → event-score and podium-probability priors
  • Distributed GPU data-curation pipeline orchestrating multimodal models (Qwen-3VL, NVIDIA Cosmos, SAM) for autonomous event detection, scene segmentation, dataset curation
  • LLM agents for labeling / dataset QA / bookkeeping

📍 Now

location:    Boulder, Colorado
education:   MS Computer Science · CU Boulder · 2024–2026
gpa:         3.93 / 4.00
working:     ML Engineer Intern @ Owl AI
teaching:    TA Distributed Systems · Prof. Mark Zhao
shipping:    real-time snowboard ML pipeline (X Games)
studying:    CSCI 5214 Big Data Architecture
reading:     Systems for ML — 22 paper reviews on the portfolio
open-to:     Full-time ML / SE / Data Science roles · May 2026

🛠 Tech I ship with

Skills

🤖 ML / Deep Learning

  • Frameworks · PyTorch · TensorFlow · Scikit-learn · XGBoost
  • Inference · TensorRT · CUDA · ONNX · INT8 quantization
  • Vision · OpenCV · MediaPipe · BlazePose · SAM/SAM2 · YOLOv9 · ResUNet
  • NLP · HuggingFace · BERT · seq2seq · LSTM · attention
  • Training tools · MLflow · Weights & Biases · DDP · mixed precision

🧬 LLMs / Agents / RAG

  • Local · Ollama · llama.cpp · MLX · qwen3.5 · llama3
  • Frontier · OpenAI · Anthropic Claude · Gemini
  • Retrieval · Azure Cognitive Search · FAISS · embeddings · re-ranking
  • Agentic · LlamaIndex · MCP · tool registries · RL (SCST · PPO)
  • Memory · ACT-R · A-MEM · Zep · MemGPT · Kuzu graph

🌐 Backend / APIs

  • Languages · Python · TypeScript · Java · SQL · C
  • Web · FastAPI · Flask · Node.js · Express · REST · gRPC
  • Frontend · React · Next.js 15 · Tailwind · shadcn/ui
  • Auth · NextAuth.js · OAuth2 · JWT
  • Streaming · MediaMTX · SRT · WebRTC · Kafka

☁️ Cloud / MLOps / Data

  • Cloud · Azure · AWS (Bedrock, SageMaker, Lambda, S3) · GCP
  • Containers · Docker · Kubernetes · GitHub Actions · CI/CD
  • Data · Databricks · Spark · Delta Lake · Delta Live Tables · Snowflake
  • Stores · PostgreSQL · MongoDB · Redis · SQLite · Kuzu
  • Observability · Azure Log Mgmt · Tableau · Power BI

🌟 Selected open source

Two-stage recommender · 44k products · FAISS+CLIP · <100ms · 95.4% diversity

Multi-app wellness platform · 6 Next.js services · MongoDB · Gemini · MediaPipe

Empirical study of data-storage formats on ML training · CIFAR-10 + 1M-row tabular

Zero-shot via CLIP × Knowledge-Graph R-GCN · +23% Top-1 over CLIP on AWA2

RL-trained dialogue summarization · LSTM seq2seq + SCST/PPO on SAMSum

Heart-chamber segmentation · U-Net on echocardiogram video


📊 GitHub

github stats github streak
top languages trophies

📝 Selected research


🏆 Honors

  • 3rd Place  ·  T9-MediHack 2025 (24 hr hackathon) — RePosture AI
  • 7th of 20 teams  ·  AWS Jam Hackathon, CU Boulder — 9/13 security challenges solved
  • Best Project of the Year (Computer Science)  ·  VTU Belagavi 2022 — KSCST-supported
  • Frontiers paper  ·  Co-author, nuclear fuel characterization, 2025

💼 Hiring me

I'm looking for full-time ML / SE / Data Science roles starting May 2026. US-authorized to work (OPT eligible). Strongest fit: production ML systems with hard latency, throughput, or cost constraints — sports / video / agents / RAG / inference optimization.

AIML resume  ·  SDE resume  ·  Transcript


🌐 Elsewhere


⚡ Off the keyboard

Gym · football · cricket · biking · the occasional film. Repetition over inspiration.

footer

Pinned Loading

  1. charithcherry charithcherry Public

  2. HealthyKid HealthyKid Public

    Jupyter Notebook 2 1

  3. MLnow_2.0 MLnow_2.0 Public

    Jupyter Notebook 1

  4. Innomatics_Internship_APR_21 Innomatics_Internship_APR_21 Public

    Jupyter Notebook

  5. Opencv-functions-on-objects Opencv-functions-on-objects Public

    Python