Historic handwritten text recognition – from research to a useful application

  • 14 May 2024
    5:00 PM – 6:00 PM
  • Online

Abstract

Handwritten text recognition is a standard machine learning task - with enough annotated training data and some computing power, a capable sequence-to-sequence model can be trained. However, building a useful application is not that simple. In this talk, I will touch on several practical aspects of building the PERO-OCR application and the models behind it. I will give an overview of our research, focusing on the efficient utilization of available data through active learning, self-supervised learning, self-training, domain adaptation, and transcription style adaptation.

Format

The talk will be held online (Zoom link attached) and the planned format is a 30 min talk followed by a 30 min open discussion session. The talk is open to the public so please distribute the announcement to your teams.

Share event

You are running an old browser version. We recommend updating your browser to its latest version.

More info