+ Looking back on 2016…

January 20, 2017
News

January is always a time for reflection and at the READ project, we have a lot to reflect on! We’ve been busy over the past 12 months in our mission to use new technologies to make historical documents more accessible. We wanted to give a quick recap of our major milestones and our future plans.

Research

Our research teams have been exploring the fields of Handwritten Text Recognition, Layout Analysis, Document Understanding, Writer Identification, Language Models and more. Some of these technologies are already available in our Transkribus tool and more will be integrated over the coming months. Towards the end of 2016 we also started to prepare for the launch of our ScriptNet platform, a new collection of research competitions where computer scientists will experiment with huge amounts of data to improve their technologies.

Discussion topics at one of the READ project meetings [Image by Louise Seaward]

Services

The Transkribus tool has been maintained and improved across the year. Over 2000 new users registered for a Transkribus account in 2016 and they are now able to access new features such as full-text search and a table editing tool. We have also developed How to Guides to help people navigate the platform.

We are working with partners inside and outside of the project to develop bespoke Handwritten Text Recognition models capable of transcribing and searching specific collections of documents. Our most successful models so far relate to eighteenth- and nineteenth-century German and English handwriting. But we are working with many more languages, styles and time-frames – watch this space!

Demonstrating the Table Editing Tool in Transkribus [Image by Louise Seaward]

Dissemination

Dissemination is a key part of READ – we want to raise awareness about the technology that we are developing and ensure that it is used by the people who need it.

We have helped to organise four conferences in Germany, Austria and the United Kingdom for collection holders, researchers and computer scientists. We have also been travelling a lot – delivering 30 Transkribus workshops (at last count!) in different European cities. In these workshops, we teach people how to use Transkribus and explain the potential of Handwritten Text Recognition. If you are interested in organising a workshop at your institution, just send us an email!

018 — READ project members taking a break from their computers at a meeting in Passau, Germany [Image by Louise Seaward]

In terms of our research outputs, we are working to ensure that our project publications are Open Access, our research tools are Open Source via Github and our published research data is being made available in Zenodo.

We have had fun spreading the word about Transkribus on Twitter and will be branching out to YouTube and Facebook this year.

Collaboration

Our network grew steadily across 2016. Over 30 institutions have now signed a Memorandum of Understanding with READ, which brings them into the project network. To give just a couple of examples, we are working with the Belgrade University Library on training computers to understand Cyrillic text and receiving advice from the Institute for Documentology and Editing on the role of Transkribus in digital scholarly editing.

Cyrillic document from the University Library of Belgrade. — Cyrillic document from the Belgrade University Library.

What’s next?

All this work will continue into 2017 but there will also be some exciting new developments.

The project technologies are beginning to be integrated into new web tools which will be made available via the Transkribus website. An e-learning module, a platform for crowdsourced transcription and a mobile app for scanning documents are all in the works. We are also developing our business plan to ensure that we can sustain the services provided by Transkribus far into the future.

Want to find out more?

You can find more detailed summaries of the work that READ has completed in these different areas by taking at look at the latest reports (deliverables) that we have submitted to the European Commission.

SHARE THIS ARTICLE

Mapping the concerts of Beethoven and Haydn: the “Concert Life in Vienna” project

Some Transkribus projects finish with a complete digitised collection in Transkribus. Some take that digitised source and use it to ...

June 12, 2024

News, Transkribus

What is Carolingian Minuscule?

When you think of Carolingian (or Caroline) minuscule, Charlemagne and his vast Carolingian empire likely come to mind. While the ...

May 14, 2024

Uncategorized

AI models for reading Polish cursive and printed texts

Understanding historical documents is key to understanding history. But understanding historical documents in Polish can be a challenge. Not only ...

Cookie	Description	Duration
viewed_cookie_policy	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.	1 hour
PHPSESSID	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.	1 year

Cookie	Description	Duration
VISITOR_INFO1_LIVE	This cookie is set by Youtube. Used to track the information of the embedded YouTube videos on a website.	5 months
IDE	Used by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.	2 years

Cookie	Description	Duration
GPS	This cookie is set by Youtube and registers a unique ID for tracking users based on their geographical location	30 minutes
tk_or	This cookie is set by JetPack plugin on sites using WooCommerce. This is a referral cookie used for analyzing referrer behavior for Jetpack	5 years
tk_r3d	The cookie is installed by JetPack. Used for the internal metrics fo user activities to improve user experience	3 days
tk_lr	This cookie is set by JetPack plugin on sites using WooCommerce. This is a referral cookie used for analyzing referrer behavior for Jetpack	1 year
_ga	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, camapign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assigns a randoly generated number to identify unique visitors.	2 years
_gid	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the wbsite is doing. The data collected including the number visitors, the source where they have come from, and the pages viisted in an anonymous form.	1 day
matomo	For statistical analysis, we use “Matomo” on this website. This is an open source tool for web analysis. Matomo does not transmit data to servers outside the control of the READ-COOP. Matomo is deactivated when you visit our website. Only if you actively consent will your usage behaviour be recorded anonymously.	1 year

Cookie	Description	Duration
YSC	This cookies is set by Youtube and is used to track the views of embedded videos.	1 year
_gat	This cookies is installed by Google Universal Analytics to throttle the request rate to limit the colllection of data on high traffic sites.	1 minute

+ Looking back on 2016…

Recent Posts

Mapping the concerts of Beethoven and Haydn: the “Concert Life in Vienna” project

What is Carolingian Minuscule?

AI models for reading Polish cursive and printed texts

The COOP

Products & Services

Useful information

Helpful resources

Community