suzhenghang

suzhenghang suzhenghang

Focus on embedded,real-time systems such as mobile device

123 followers · 1.2k following

Guang Zhou

Achievements

Stars

davidhughhenrymack / party-parrot

Auto party rave lights

Python 25 6 Updated Dec 3, 2025

ace-step / ACE-Step-1.5

The most powerful local music generation model that outperforms most commercial alternatives, supporting Mac, AMD, Intel, and CUDA devices.

Python 7,386 822 Updated Mar 7, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 275,473 52,572 Updated Mar 7, 2026

ysharma3501 / MiraTTS

A high quality and fast TTS repository

Python 505 42 Updated Dec 22, 2025

facebookresearch / sam-audio

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,357 290 Updated Jan 5, 2026

MattIPv4 / PyDMXControl

A Python 3 module to control DMX using OpenDMX or uDMX - Featuring fixture profiles, built-in effects and a web control panel.

Python 139 24 Updated Feb 5, 2024

FunAudioLLM / Fun-ASR

Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.

Python 913 80 Updated Feb 25, 2026

ETCLabs / Sound2Light

UNOFFICIAL - A tool converting sound input to OSC trigger signals.

C++ 115 11 Updated Mar 20, 2019

fschmid56 / EfficientAT

This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.

Python 331 56 Updated Nov 20, 2024

LandryBulls / Oculizer

Intelligent, real-time, audio-responsive DMX light control.

Jupyter Notebook 10 Updated Feb 13, 2026

Tongyi-MAI / Z-Image

Python 10,405 683 Updated Feb 9, 2026

GiantAILab / YingMusic-SVC

Official implementation of YingMusic-SVC.

Python 121 12 Updated Dec 29, 2025

QwenLM / Qwen-Image

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,498 449 Updated Feb 10, 2026

zai-org / CogView4

CogView4, CogView3-Plus and CogView3(ECCV 2024)

Python 1,105 80 Updated Mar 29, 2025

supertone-inc / supertonic

Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.

C++ 2,685 238 Updated Jan 22, 2026

zibojia / MiniMax-Remover

This is the official implementation of our paper: "MiniMax-Remover: Taming Bad Noise Helps Video Object Removal"

Python 543 51 Updated Jul 27, 2025

YaoFANGUK / video-subtitle-remover

基于AI的图片/视频硬字幕去除、文本水印去除，无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API，本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.

Python 9,700 1,211 Updated Dec 3, 2025

WEIFENG2333 / AsrTools

Python 3,104 291 Updated Nov 25, 2025

ASLP-lab / MeanVC

A Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows

Python 228 18 Updated Jan 8, 2026

NEKOparapa / AiNiee

一款专注于Ai翻译的工具，一键自动翻译RPG SLG游戏，Epub TXT小说，PDF Word MD文档，Srt Vtt Lrc字幕等等复杂长文本。

Python 5,281 336 Updated Feb 27, 2026

facebookresearch / omnilingual-asr

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,710 242 Updated Dec 30, 2025

stepfun-ai / Step-Audio-EditX

A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech

Python 875 58 Updated Feb 13, 2026

ZFTurbo / Music-Source-Separation-Training

Repository for training models for music source separation.

Python 1,192 180 Updated Feb 4, 2026

Soul-AILab / SoulX-Podcast

SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.

Python 3,202 417 Updated Dec 11, 2025

Paulescu / image-classification-with-local-vlms

Learn to build and deploy local Visual Language Models for Edge AI

Jupyter Notebook 371 44 Updated Oct 30, 2025

WangRongsheng / awesome-LLM-resources

🧑‍🚀 全世界最好的LLM资料总结（多模态生成、Agent、辅助编程、AI审稿、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型） | Summary of the world's best LLM resources.

7,646 742 Updated Mar 7, 2026

NVlabs / OmniVinci

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

Python 636 51 Updated Feb 26, 2026

mit-han-lab / streaming-vlm

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Python 902 60 Updated Oct 15, 2025

User-tian / Conan

Official Implementation of "Conan: A Chunkwise Online Network for Zero-Shot Adaptive Voice Conversion"

Python 24 6 Updated Nov 12, 2025

a43992899 / openl2s

Open, royalty free, lyrics2song / song generation data collection / cleaning pipeline.

Python 17 3 Updated May 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

suzhenghang suzhenghang

Achievements

Achievements

Block or report suzhenghang

Stars

davidhughhenrymack / party-parrot

ace-step / ACE-Step-1.5

openclaw / openclaw

ysharma3501 / MiraTTS

facebookresearch / sam-audio

MattIPv4 / PyDMXControl

FunAudioLLM / Fun-ASR

ETCLabs / Sound2Light

fschmid56 / EfficientAT

LandryBulls / Oculizer

Tongyi-MAI / Z-Image

GiantAILab / YingMusic-SVC

QwenLM / Qwen-Image

zai-org / CogView4

supertone-inc / supertonic

zibojia / MiniMax-Remover

YaoFANGUK / video-subtitle-remover

WEIFENG2333 / AsrTools

ASLP-lab / MeanVC

NEKOparapa / AiNiee

facebookresearch / omnilingual-asr

stepfun-ai / Step-Audio-EditX

ZFTurbo / Music-Source-Separation-Training

Soul-AILab / SoulX-Podcast

Paulescu / image-classification-with-local-vlms

WangRongsheng / awesome-LLM-resources

NVlabs / OmniVinci

mit-han-lab / streaming-vlm

User-tian / Conan

a43992899 / openl2s