+ Transcribing Bentham with a computer

The Bentham Project at University College London, which works on the scholarly edition of the writings of the British philosopher Jeremy Bentham, has become increasingly involved with digital humanities across the past decade.  The project has undertaken the digitisation of thousands of Bentham manuscripts and in 2010 launched one of the first academic crowdsourcing initiatives, Transcribe Bentham.  Exciting experiments with Handwritten Text Recognition (HTR) have also been ongoing over the past few years.

Using around 900 pages of Bentham material, a first HTR model was trained with very good results.  The ‘English Writing M1’ model can recognise pages written in a relatively neat hand by Bentham and his secretaries  with an impressive Character Error Rate (CER) of 5-10%.  This model is publicly available in Transkribus and can be applied to English handwriting from the 1800s and 1900s with nice results.

The Bentham Project is now working to improve the automated recognition of Bentham’s most difficult handwriting – written at a time when the philosopher was in his eighties and losing his sight.  Early results show a promising CER of 26%, which is a very good basis for Keyword Spotting as a research tool for scholars interested in Bentham’s ideas.

Find out more at the Transcribe Bentham blog!

Screenshot from Transkribus with automatically generated transcript. Box 31, fol. 78, UCL Bentham Papers, Special Collections, University College London.

Recent Posts

September 19, 2023
We are thrilled to announce the September 2023 release of the Transkribus web app. After the successful switch to the ...
August 30, 2023
News, Transkribus
Today, the new Transkribus web app is officially launched!  Transkribus has always worked towards simplifying the digitasion and transcription of ...
August 21, 2023
Transkribus User Conference
The Transkribus User Conference 24 (15 & 16 February 2024, Innsbruck) invites stakeholders, users, scholars, and enthusiasts to explore the ...