+ Transkribus – The Best Idea to Procrastinate I’ve Ever Had

Stefan Karcher, a graduate student at Heidelberg University has written a fascinating blog post explaining how he has been using Transkribus to process nineteenth-century German sermons.

Karcher took the opportunity to train his own Automated Text Recognition models.  He used around 30,000 transcribed words of training data to generate a model that can produce transcripts of his documents with a Character Error Rate of 8-10%.  The blog post notes that these transcripts are a useful and efficient basis for his research and includes a description of how these automated transcripts can be analysed with  Voyant Tools.

Do you want to train your own Automated Text Recognition model?

SHARE THIS ARTICLE

Recent Posts

March 29, 2023
Uncategorized
The majority of Transkribus models are also trained to read just one language — after all, most historical documents are ...
March 23, 2023
Transkribus
Go to any history museum or read any history book and you’ll find that many of the stories and events ...
March 15, 2023
Uncategorized
By Fiona Park Not everyone who works with history is a professional historian. From hobby genealogists to volunteers in local ...