+ Transkribus – The Best Idea to Procrastinate I’ve Ever Had

Stefan Karcher, a graduate student at Heidelberg University has written a fascinating blog post explaining how he has been using Transkribus to process nineteenth-century German sermons.

Karcher took the opportunity to train his own Automated Text Recognition models.  He used around 30,000 transcribed words of training data to generate a model that can produce transcripts of his documents with a Character Error Rate of 8-10%.  The blog post notes that these transcripts are a useful and efficient basis for his research and includes a description of how these automated transcripts can be analysed with  Voyant Tools.

Do you want to train your own Automated Text Recognition model?

SHARE THIS ARTICLE

Recent Posts

November 17, 2022
Transkribus
We are thrilled to announce that yesterday, we hit 100,000 users on the Transkribus platform! Even with our years-long highly ...
August 12, 2022
Handwritten Text Recognition
Ever had trouble reading someone else’s handwriting?  Well, it may reassure you to know that it’s not only humans that ...
July 22, 2022
Uncategorized
The latest version of Transkribus Lite is here and brings a number of new features. Here are the most important ...