+ New article on automated transcription and search of seventeenth-century document

The Pattern Recognition and Human Language Technology research group at the Universitat Politècnica de València (one of the READ project partners) have published an article exploring their work processing seventeenth-century botanical documents in the latest issue of Digital Scholarship in the Humanities.

The article explains how the team applied techniques of layout analysis, text line detection and automated transcription to a handwritten book by the Spanish botanist Bernardo de Cienfuegos.

The 1000 page work is now fully transcribed and searchable.  In addition to this impressive result, the article also points that the experiment has generated useful feedback on the interaction between human transcribers and Automated Text Recognition technology.

  • Alejandro H Toselli, Luis A Leiva, Isabel Bordes-Cabrera, Celio Hernández-Tornero, Vicent Bosch, Enrique Vidal, ‘Transcribing a 17th-century botanical manuscript: Longitudinal evaluation of document layout detection and interactive transcription’, Digital Scholarship in the Humanities, 33, 1, April 2018, 173-202 https://doi.org/10.1093/llc/fqw064
SHARE THIS ARTICLE

Recent Posts

July 3, 2024
News, Transkribus
Some Transkribus projects finish with a complete digitised collection in Transkribus. Some take that digitised source and use it to ...
June 12, 2024
News, Transkribus
When you think of Carolingian (or Caroline) minuscule, Charlemagne and his vast Carolingian empire likely come to mind. While the ...
May 14, 2024
Uncategorized
Understanding historical documents is key to understanding history. But understanding historical documents in Polish can be a challenge. Not only ...