layumi

Zhedong Zheng layumi

Hi, I am a tenure-track assistant professor at the University of Macau. My work focuses on computer vision, especially representation learning.

2.1k followers · 606 following

University of Macau
Macau, China
01:49 (UTC +08:00)
http://www.zdzheng.xyz
https://orcid.org/0000-0002-2434-9050
https://scholar.google.com/citations?user=XT17oUEAAAAJ

Achievements

x4 x3

Achievements

x4 x3

Highlights

Developer Program Member

Stars

dingxiang-me / OpenClaw-Wechat

🔥🔥🔥🔥 首个把OpenClaw接入企业微信的插件 / 个人微信可互通 / BOT支持流式输出 / 支持群聊@ / 支持白名单控制 / 全中文可视化配置

JavaScript 301 44 Updated Mar 5, 2026

huangyiyNUS / TIGeR

Text-Instructed Generation and Refinement for Template-Free Hand-Object Interaction

Python 1 Updated Feb 12, 2026

G-U-N / AnimateLCM

[SIGGRAPH ASIA 2024 TCS] AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data

Python 660 47 Updated Oct 22, 2024

nus-cvml / awesome-temporal-action-segmentation

A curated list of awesome temporal action segmentation resources.

243 20 Updated Apr 4, 2024

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 275,132 52,508 Updated Mar 7, 2026

YaxuanLi-cn / dreamNav

The official implementation of "Last-Meter Precision Navigation for UAVs: A Diffusion-Refined Aerial Visual Servoing Approach"

Python 4 Updated Mar 5, 2026

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,140 357 Updated Nov 13, 2025

bytedance / video-SALMONN-2

video-SALMONN 2 is a powerful audio-visual large language model (LLM) that generates high-quality audio-visual video captions, which is developed by the Department of Electronic Engineering at Tsin…

Python 164 18 Updated Feb 23, 2026

Robbyant / lingbot-vla

A Pragmatic VLA Foundation Model

Python 893 70 Updated Feb 28, 2026

mll-lab-nu / VAGEN

Training VLM agents with multi-turn reinforcement learning

Python 422 47 Updated Mar 1, 2026

HKUDS / VideoAgent

"VideoAgent: All-in-One Agentic Framework for Video Understanding, Editing, and Remaking"

Python 488 71 Updated Oct 17, 2025

Ruiyang-061X / SketchThinker-R1

Official code for our paper: "SketchThinker-R1: Towards Efficient Sketch-Style Reasoning in Large Multimodal Models".

Python 7 2 Updated Nov 3, 2025

layumi / UM2025-AI-for-Safety-and-Security

3 1 Updated Sep 30, 2025

CubiCasa / CubiCasa5k

CubiCasa5k floor plan dataset

Jupyter Notebook 469 136 Updated Feb 10, 2026

naver / dust3r

DUSt3R: Geometric 3D Vision Made Easy

Python 6,996 735 Updated Sep 24, 2025

apple / ml-aim

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,405 71 Updated Aug 4, 2025

AI-in-Health / MedLLMsPracticalGuide

[Nature Reviews Bioengineering🔥] Application of Large Language Models in Medicine. A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)

1,991 172 Updated Sep 27, 2025