+ Handwritten Text Recognition success with Italian documents from Archivio Storico Ricordi

The Archivio Storico Ricordi is one of the most important private music collections in the world and it has started to work with Handwritten Text Recognition (HTR) to process some of its treasures.  Founded in Milan in 1808, the Casa Ricordi publishing house contains a wealth of letters and scores from noted composers like Verdi and Puccini.

Screenshot of a letter from Giuilio Ricordi and its HTR transcription in Transkribus 

The archive submitted around 88,000 words of transcribed material written by Giulio Ricordi, the general manger of the publishing house in the late nineteenth century.  This training data was used to generate a model that can produce automatic transcriptions of pages with an impressive Character Error Rate (CER) of 12.3%.

Our example document shows the results on a sample page from the collection – take a look to see how much the computer gets right!

SHARE THIS ARTICLE

Recent Posts

June 20, 2022
Transkribus
By Joe Nockels, University of Edinburgh As part of his PhD research at the University of Edinburgh and National Library ...
June 15, 2022
Transkribus, Webinars
We are excited to launch our new “Ask Us Anything” webinar series, where attendees can ask us about anything related ...
June 9, 2022
Uncategorized
The new sharing feature It might be a small button for a screen, but it is a great function for ...