Skip to content
View Fitzgera1d's full-sized avatar
  • Zhejiang University
  • Hangzhou, China

Block or report Fitzgera1d

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ClawPhD is an agent for research that can turn academic papers into publication-ready diagrams, posters, videos, and more.

Python 101 6 Updated Mar 4, 2026

OpenMMLab 3D Human Parametric Model Toolbox and Benchmark

Python 1,404 155 Updated Nov 12, 2024

The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…

Python 2,684 296 Updated Feb 19, 2026
Python 468 40 Updated Dec 4, 2025

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 7,977 1,095 Updated Feb 24, 2026

Native Multimodal Models are World Learners

Python 1,466 59 Updated Dec 30, 2025

[CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

Python 1,781 156 Updated Oct 7, 2025

Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give it a star 🌟 if you find it useful.

Python 216 8 Updated Oct 12, 2025

A procedural Blender pipeline for photorealistic training image generation

Python 3,425 500 Updated Jan 20, 2026

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Python 2,949 213 Updated Jan 18, 2026

Official inference repo for FLUX.1 models

Python 25,252 1,858 Updated Jul 31, 2025

Enjoy the magic of Diffusion models!

Python 11,906 1,155 Updated Mar 4, 2026

[CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Python 1,629 160 Updated Mar 27, 2025

CVPR 2024: Language Guided Generation of 3D Embodied AI Environments.

Python 526 61 Updated Apr 2, 2025

Extensible memoizing collections and decorators

Python 2,708 186 Updated Mar 2, 2026

GeoCalib: Learning Single-image Calibration with Geometric Optimization (ECCV 2024)

Python 793 54 Updated Oct 26, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 9,724 757 Updated Feb 17, 2026

[ICCV 2025] Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models

Python 570 28 Updated Feb 12, 2026

[CVPR 2025] EnvGS: Modeling View-Dependent Appearance with Environment Gaussian

Python 237 16 Updated Dec 15, 2025

[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy

Python 919 48 Updated Feb 27, 2026

Code for "MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training", Arxiv 2025.

1,220 37 Updated Dec 26, 2025

4DHumans: Reconstructing and Tracking Humans with Transformers

Python 1,552 153 Updated Feb 7, 2026

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,511 1,352 Updated Mar 3, 2026

DeepStream SDK Python bindings and sample applications

Jupyter Notebook 1,795 531 Updated Oct 14, 2025

PyTorch3D is FAIR's library of reusable components for deep learning with 3D data

Python 9,807 1,448 Updated Mar 2, 2026

Official repository for our work on micro-budget training of large-scale diffusion models.

Python 1,549 55 Updated Jan 12, 2025

[SIGGRAPH Asia 2023 (Technical Communications)] EasyVolcap: Accelerating Neural Volumetric Video Research

Python 1,545 97 Updated Jan 21, 2025

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 8,393 763 Updated May 31, 2024

High-resolution models for human tasks.

Python 5,295 311 Updated Nov 18, 2024

SkyReels-A2: Compose anything in video diffusion transformers

Python 704 63 Updated Jun 3, 2025
Next