Skip to content
View layumi's full-sized avatar

Block or report layumi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🔥🔥🔥🔥 首个把OpenClaw接入企业微信的插件 / 个人微信可互通 / BOT支持流式输出 / 支持群聊@ / 支持白名单控制 / 全中文可视化配置

JavaScript 301 44 Updated Mar 5, 2026

Text-Instructed Generation and Refinement for Template-Free Hand-Object Interaction

Python 1 Updated Feb 12, 2026

[SIGGRAPH ASIA 2024 TCS] AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data

Python 660 47 Updated Oct 22, 2024

A curated list of awesome temporal action segmentation resources.

243 20 Updated Apr 4, 2024

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 275,132 52,508 Updated Mar 7, 2026

The official implementation of "Last-Meter Precision Navigation for UAVs: A Diffusion-Refined Aerial Visual Servoing Approach"

Python 4 Updated Mar 5, 2026

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,140 357 Updated Nov 13, 2025

video-SALMONN 2 is a powerful audio-visual large language model (LLM) that generates high-quality audio-visual video captions, which is developed by the Department of Electronic Engineering at Tsin…

Python 164 18 Updated Feb 23, 2026

A Pragmatic VLA Foundation Model

Python 893 70 Updated Feb 28, 2026

Training VLM agents with multi-turn reinforcement learning

Python 422 47 Updated Mar 1, 2026

"VideoAgent: All-in-One Agentic Framework for Video Understanding, Editing, and Remaking"

Python 488 71 Updated Oct 17, 2025

Official code for our paper: "SketchThinker-R1: Towards Efficient Sketch-Style Reasoning in Large Multimodal Models".

Python 7 2 Updated Nov 3, 2025

CubiCasa5k floor plan dataset

Jupyter Notebook 469 136 Updated Feb 10, 2026

DUSt3R: Geometric 3D Vision Made Easy

Python 6,996 735 Updated Sep 24, 2025

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,405 71 Updated Aug 4, 2025

[Nature Reviews Bioengineering🔥] Application of Large Language Models in Medicine. A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)

1,991 172 Updated Sep 27, 2025

Repo for Chinese Medical ChatGLM 基于中文医学知识的ChatGLM指令微调

Python 1,035 159 Updated May 19, 2023

明医 (MING):中文医疗问诊大模型

Python 1,107 140 Updated May 23, 2025

「TIP2023」Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments

Python 203 19 Updated Dec 12, 2025

Official Code for "CLIP-SR: Collaborative Linguistic and Image Processing for Super-Resolution"

Jupyter Notebook 2 Updated Jan 26, 2026

[ICLR 2026 🔥 ] Official implementation of "UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing"

Python 136 4 Updated Jan 26, 2026

Official implementation of BLIP3o-Series

Python 1,642 77 Updated Nov 29, 2025

Native Multimodal Models are World Learners

Python 1,469 59 Updated Dec 30, 2025

Chinese medical dialogue data 中文医疗对话数据集

Python 1,648 281 Updated Aug 18, 2023

[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

Python 297 15 Updated Mar 13, 2024

✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Models".

Python 18 3 Updated Mar 13, 2025

[IEEE TMI 2024] MultiEYE: Dataset and Benchmark for OCT-Enhanced Retinal Disease Recognition from Fundus Images

Python 43 6 Updated Dec 8, 2025

🎬 卡卡字幕助手 | VideoCaptioner - 基于 LLM 的智能字幕助手 - 视频字幕生成、断句、校正、字幕翻译全流程处理!- A powered tool for easy and efficient video subtitling.

Python 13,456 1,095 Updated Feb 26, 2026
Next