-
Beijing Institute of Technology
- Beijing, China
Stars
[CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding
[NeurIPS 2025 spotlight] Official implementation for "FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving"
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.
✔(已完结)超级全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】【大飞 大模型Agent】
Agentic IM Chatbot infrastructure that integrates lots of IM platforms, LLMs, plugins and AI feature, and can be your openclaw alternative. ✨
🚀 One-stop solution for creating your AI twin from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. 从聊天记录创造…
Materials for the Hugging Face Diffusion Models Course
[ICLR'23 Spotlight & ECCV'24 & IJCV'24] MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction
GEDepth: Ground Embedding for Monocular Depth Estimation (ICCV 2023)
fcakyon / labelme2coco
Forked from XCRobert/Labelme2CocoA lightweight package for converting your labelme annotations into COCO object detection format.
PillarNeXt: Rethinking Network Designs for 3D Object Detection in LiDAR Point Clouds (CVPR 2023)
Implementation for Panoptic-PolarNet (CVPR 2021)
Code for the Lovász-Softmax loss (CVPR 2018)
[NeurIPS2021] Code Release of K-Net: Towards Unified Image Segmentation
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".