+ Transkribus – The Best Idea to Procrastinate I’ve Ever Had

Stefan Karcher, a graduate student at Heidelberg University has written a fascinating blog post explaining how he has been using Transkribus to process nineteenth-century German sermons.

Karcher took the opportunity to train his own Automated Text Recognition models.  He used around 30,000 transcribed words of training data to generate a model that can produce transcripts of his documents with a Character Error Rate of 8-10%.  The blog post notes that these transcripts are a useful and efficient basis for his research and includes a description of how these automated transcripts can be analysed with  Voyant Tools.

Do you want to train your own Automated Text Recognition model?

SHARE THIS ARTICLE
Share on facebook
Share on twitter
Share on linkedin

Recent Posts

September 14, 2021
HTR models
We are happy to introduce a new PyLaia print model (Transkribus print 0.3). You may already be familiar with our ...
September 14, 2021
Uncategorized
On the Maritime Portal there are several reports on the English archive collection of Prize Papers, documents that British privateers ...
August 4, 2021
HTR models
Transkribus is happy to introduce new public models for Spanish and French. Álvaro Cuéllar of the ETSO “Estilometría aplicada al ...