MetaHTR

Towards Writer-Adaptive Handwritten Text Recognition

Ayan Kumar Bhunia
Shuvozit Ghose*
Amandeep Kumar*
Pinaki Nath Chowdhury
Aneeshan Sain
Yi-Zhe Song

SketchX, Centre for Vision Speech and Signal Processing,
University of Surrey, United Kingdom
*Interned at SketchX

Published at CVPR 2021

[Paper]
[Talk]
[Slides]
[GitHub]



Handwritten Text Recognition (HTR) remains a challenging problem to date, largely due to the varying writing styles that exist amongst us. Prior works however generally operate with the assumption that there is a limited number of styles, most of which have already been captured by existing datasets. In this paper, we take a completely different perspective – we work on the assumption that there is always a new style that is drastically different, and that we will only have very limited data during testing to perform adaptation. This creates a commercially viable solution – being exposed to the new style, the model has the best shot at adaptation, and the few-sample nature makes it practical to implement. We achieve this via a novel meta-learning framework which exploits additional new-writer data via a support set, and outputs a writer-adapted model via single gradient step update, all during inference. We discover and leverage on the important insight that there exists few key characters per writer that exhibit relatively larger style discrepancies. For that, we additionally propose to meta-learn instance specific weights for a character-wise cross-entropy loss, which is specifically designed to work with the sequential nature of text data. Our writer-adaptive MetaHTR framework can be easily implemented on the top of most state-of-the-art HTR models. Experiments show an average performance gain of 5-7% can be obtained by observing very few new style data (≤ 16).


Algorithm and Quantitative Results





Ablative Studies





Short Presentation



Paper and Bibtex

[Paper]

Citation
 
MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition. In CVPR 2021.

[Bibtex]
@inproceedings{metahtr,
author = {Ayan Kumar Bhunia and Shuvozit Ghose, Amandeep Kumar and Pinaki Nath Chowdhury and Aneeshan Sain and Yi-Zhe Song},
title = {MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2021}
}
        

Acknowledgements


Website template from here and here.