+ Finding patterns in eighteenth-century weddings – new blog from Xerox

Xerox Research Centre Europe is one of the READ research partners, with responsibility for Document Understanding.  Document Understanding is a crucial part of the process of training computers to recognise historical documents, as Hervé Déjean from the Xerox team explains in this blog.

Document Understanding involves analysing the layout of a document in order to extract human understandable information about its content. Hervé’s blog presents a useful overview of the concept and offers specific details about how this method can be applied to historical documents.

Image from Passau Diocesan Archives

Hervé describes how he has been using Sequential Pattern Mining Techniques on eighteenth-century wedding registers provided by Passau Diocesan Archives, another partner in the READ project.  Document Understanding helps to ensure that we can group information from a document into a meaningful sequence – in this case, ensuring the right groom is matched with the right bride on the right day!

SHARE THIS ARTICLE

Recent Posts

July 3, 2024
News, Transkribus
Some Transkribus projects finish with a complete digitised collection in Transkribus. Some take that digitised source and use it to ...
June 12, 2024
News, Transkribus
When you think of Carolingian (or Caroline) minuscule, Charlemagne and his vast Carolingian empire likely come to mind. While the ...
May 14, 2024
Uncategorized
Understanding historical documents is key to understanding history. But understanding historical documents in Polish can be a challenge. Not only ...