Stars
Official code for the paper "Segment, Embed, and Align: A Universal Recipe for Aligning Subtitles to Signing"
Self-supervised video pretraining for sign language translation.
Automatic Evaluation for Pose Files
A cross-linguistic resource for sign language phonological inventories and features, modeled after PHOIBLE
A curated list of accessibility resources
MultimodalHugs is an extension of Hugging Face that offers a generalized framework for training, evaluating, and using multimodal AI models with minimal code differences, ensuring seamless compatib…
Subtitle alignment for sign language video
Multi-channel sign language translation metric.
visualize .pose files with flexible controls right in your vscode
Code for "Optimizing Hand Area Detection in MediaPipe Holistic Full-Body Pose Estimation to Improve Accuracy and Prevent Downstream Errors"
WACV 2020 "Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison"
A web service offering HTML5 articles from arXiv.org as converted with latexml
This repository contains an extension of fairseq for pixel / visual representations of text for machine translation.
Code for the experiments in the paper "JWSign: A Highly Multilingual Corpus of Bible Translations for more Diversity in Sign Language Processing".
(ECCV 2024) SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark
Official code repo for the paper "Automatic Sound Event Detection and Classification of Great Ape Calls Using Neural Networks"
Effortless Real-Time Sign Language Translation
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
SLTUNET: A Simple Unified Model for Sign Language Translation (ICLR 2023)
Reading list for research topics in multimodal machine learning
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities





