Segment Sign Language Video

This code enables segmentation of sign language video into subtitle-units, i.e. segments of video approximately corresponding to a sentence or phrase in a subtitle. Details of the model can be found here: https://slrtp.com/papers/full_papers/SLRTP.FP.01.011.paper.pdf

If this code is of use to you, please cite the following article:

Bull, H., Gouiffès, M., Braffort, A.: Automatic Segmentation of Sign Language into Subtitle-Units. In: Proceedings of the European Conference on Computer Vision (ECCV), Sign Language Recognition, Translation and Production (SLRTP) Workshop (2020)

@article{bull2020automatic,
    author = {Bull, Hannah and Gouiffès, Michèle and Braffort, Annelies},
    journal = {Proceedings of the European Conference on Computer Vision (ECCV), Sign Language Recognition, Translation and Production (SLRTP) Workshop},
    month = {8},
    title = {{Automatic Segmentation of Sign Language into Subtitle-Units}},
    url = {https://slrtp.com/papers/full_papers/SLRTP.FP.01.011.paper.pdf},
    year = {2020}
}

Data used to train the model

The data used to train the model is MEDIAPI-SKEL, a 2D-skeleton database of French Sign Language video with aligned French subtitles, available on Ortolang for research purposes.

Bull, H., Braffort, A., Gouiffès, M.: MEDIAPI-SKEL - a 2D-skeleton video database of French Sign Language with aligned French subtitles. In: Proceedings of the Twelfth International Conference on Language Resources and Evaluation(LREC’20). pp. 6063–6068. European Language Resource Association (ELRA), Marseille, France (May 2020)

@inproceedings{bull2020mediapiskel,
  title={{MEDIAPI}-{SKEL} - A 2{D}-Skeleton Video Database of French Sign Language With Aligned French Subtitles},
  author={Bull, Hannah and Braffort, Annelies and Gouiff\`es, Mich\`ele},
  booktitle={Proceedings of the Twelfth International Conference on Language Resources and Evaluation (LREC'20)},
  year={2020},
  address = {Marseille, France},
  month = {May},
  pages ={6063--6068},
  publisher = {European Language Resource Association (ELRA)},
}

Input

The input is in the form of sequences of OpenPose 2D keypoints. To clean the OpenPose 2D keypoints and to extract the sequences of likely signers, use the code provided at:

https://github.com/hannahbull/clean_op_data_sl

Output

Frame-level probablities of Subtitle-Units.

Subtitle file (.srt) with time tags corresponding to each detected sign language segment.

Example

An example of applying this model to extract the Subtitle-Units for a YouTube video is provided at this Google Colab link.

To produce .srt files with Subtitle-Units for sequences of OpenPose 2D keypoints, run:

python apply_model.py --input_folder 'data' --output_folder 'output' --which_keypoints 'body' --fps 25

To obtain predictions for the MEDIAPI-SKEL test set, run clean_op_data_sl on the OpenPose 2D keypoints and place the folders containing the .pkl files in mediapiskel_data/skeleton_sequences. Place the .vtt subtitles in the folder mediapiskel_data/subtitles. Run:

python reproduce_results_mediapiskel.py --input_folder 'mediapiskel_data/skeleton_sequences' --which_keypoints 'full' --video_information 'mediapiskel_data/video_information.csv' --subtitle_folder 'mediapiskel_data/subtitles'

References

OpenPose: https://github.com/CMU-Perceptual-Computing-Lab/openpose

mmskeleton: https://github.com/open-mmlab/mmskeleton

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
data/00330		data/00330
mediapiskel_data		mediapiskel_data
models		models
output		output
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
apply_model.py		apply_model.py
evaluate_results.py		evaluate_results.py
reproduce_results_mediapiskel.py		reproduce_results_mediapiskel.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Segment Sign Language Video

Data used to train the model

Input

Output

Example

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Segment Sign Language Video

Data used to train the model

Input

Output

Example

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages