Mask CoMER: Enhancing Handwritten Mathematical Expression Recognition with Masked Language Pretraining and Regularization

| |

📝 Abstract

Handwritten Mathematical Expression Recognition (HMER) is crucial for enhancing human-machine interaction in domains such as digitized education and automated document processing. While existing transformer-based methods employ coverage mechanisms to capture context, they often still lose critical cues that a human could quickly correct. To address this, we propose Mask CoMER, a two-stage training approach that combines a novel Masked Language Model pretraining objective with stochastic depth regularization. This method enhances contextual understanding and mitigates information loss, leading to improved recognition accuracy. Experiments on the CROHME 2014, 2016, and 2019 datasets demonstrate that Mask CoMER achieves state-of-the-art ExpRates of 64.56%, 63.03%, and 65.22%, respectively. These results outperform the baseline CoMER by up to 5% and surpass the previous SOTA by nearly 2%

⚙️ Installation

1. Create environment

conda create -n maskcomer python=3.7 -y
conda activate maskcomer

2. Install PyTorch

Make sure to install the PyTorch version compatible with your CUDA toolkit. For example, with CUDA 11.1:

pip install torch==1.8.1 torchvision==0.2.2 cudatoolkit==11.1

3. Install additional dependencies

pip install -r requirements.txt

📚 Citation

If you find this work useful in your research, please cite:

@InProceedings{10.1007/978-3-032-04624-6_22,
author="Phan, Nam Van Hai
and Nguyen, Khoa Minh
and Nguyen, Trung Thanh
and Pham, Trung Thanh
and Tran, Phuong-Nam
and Dang, Duc Ngoc Minh",
editor="Yin, Xu-Cheng
and Karatzas, Dimosthenis
and Lopresti, Daniel",
title="Mask CoMER: Enhancing Handwritten Mathematical Expression Recognition with Masked Language Pretraining and Regularization",
booktitle="Document Analysis and Recognition -- ICDAR 2025",
year="2026",
publisher="Springer Nature Switzerland",
address="Cham",
pages="375--390",
abstract="Handwritten Mathematical Expression Recognition (HMER) is crucial for enhancing human-machine interaction in domains such as digitized education and automated document processing. While existing transformer-based methods employ coverage mechanisms to capture context, they often still lose critical cues that a human could quickly correct. To address this, we propose Mask CoMER, a two-stage training approach that combines a novel Masked Language Model Pretraining objective with stochastic depth regularization. This method enhances contextual understanding and mitigates information loss, leading to improved recognition accuracy. Experiments on the CROHME 2014, 2016, and 2019 datasets demonstrate that Mask CoMER achieves state-of-the-art ExpRates of 64.56{\%}, 63.03{\%}, and 65.22{\%}, respectively. These results outperform the baseline CoMER by up to 5{\%} and surpass the previous SOTA by nearly 2{\%}. These results underscore the robustness and effectiveness of our approach for HMER tasks {\$}{\$}^{\{}1{\}}{\$}{\$}1.({\$}{\$}^{\{}1{\}}{\$}{\$}1The code is available at GitHub)",
isbn="978-3-032-04624-6"
}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Stage-1		Stage-1
Stage-2		Stage-2
.gitignore		.gitignore
README.md		README.md
data.zip		data.zip
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mask CoMER: Enhancing Handwritten Mathematical Expression Recognition with Masked Language Pretraining and Regularization

📝 Abstract

⚙️ Installation

1. Create environment

2. Install PyTorch

3. Install additional dependencies

📚 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Mask CoMER: Enhancing Handwritten Mathematical Expression Recognition with Masked Language Pretraining and Regularization

📝 Abstract

⚙️ Installation

1. Create environment

2. Install PyTorch

3. Install additional dependencies

📚 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages