Transkribus
Transkribus
Everything about Transkribus

Transkribus App
Use Transkribus in your Browser

Plans & Pricing
Buy Credits for Handwritten Text Recognition

Public AI Models
Explore all publicly available models

metagrapho api
Handwritten Text Recognition api for Transkribus

Transkribus.ai
Text recognition with the click of a button

Transkribus

Download Transkribus

Transkribus Lite

Credits & Pricing

How-To Guides

Public AI Models

Scholarship Programme
ScanTent
read&search
About
About us
Find out more about the READ-COOP

Join us
Become part of the Revolution of HTR

News
Read the latest News in our Blog

Our Members
See all of our great Members

Our Team
Meet the READ-COOP Team

Success Stories
Have a look at some exceptional Projects

News

About us

Join us

Our Members

Our Team

Success Stories
Resources
Transkribus Help Center

Glossary

Events

Documentation for Developers

Insights Blog

Download Client

Scholarship Programme
Plans & Pricing

Spanish print XVIII-XIX

Free Public AI Model for Handwritten Text Recognition with Transkribus

PyLaia model created from Ground Truth data resulting from the transcription and manual segmentation of a sample of 193 pages of the Spanish XVIII-XIX press, in particular volumes from “Diario de Madrid 1788-1825” (https://hemerotecadigital.bne.es/hd/card?oid=0001510462).

This model has been developed within the CLARA-HD project (https://clara-nlp.uned.es/home/dh) founded by the Spanish Ministery and is valid for automatically transcribing similar Spanish prints of the same period. Manual segmentation is recommended since newspapers usually contain tables and columns. A CER of 1% on validation set has been achieved.

For more information or details please contact Eva Sánchez Salido at evasan@lsi.uned.es or Ana García Serrano at agarcia@lsi.uned.es.

Please cite this model as: Menta, A., Sánchez-Salido, E., & García-Serrano, A. (2022). Transcripción de periódicos históricos: Aproximación CLARA-HD. Proceedings of the Annual Conference of the Spanish Association for Natural Language Processing 2022: Projects and Demonstrations (SEPLN-PD 2022).

Model Overview

Name:
Diario de Madrid 1788-1825

Creator:
Eva Sánchez Salido, Ana García Serrano

Model ID:
48440

Century:
18th, 19th

Languages:
Spanish

Script:
Latin alphabet

Engine:
PyLaia

Material:
Print

CER on validation set:
1.00 %

Simply upload a picture and test this model

By uploading an image, you accept our terms and conditions and our privacy policy

Diario de Madrid 1788-1825 is freely available to everyone

Get started with Transkribus and use it for your own Material

You can use this model to automatically transcribe Print documents with Handwritten Text Recgnition in Transkribus. This model can be used in the Transkribus Expert Client as well as in Transkribus Lite.

This AI model was trained to automatically convert text from images of historical Latin alphabet documents into editable and searchable text.

This page was translated with the help of artificiale intelligence.

If you find any translation errors, please let us know or switch to English:

EN DE IT

Terms & conditions

Contact

Imprint

EN DE IT

Cookie	Description	Duration
viewed_cookie_policy	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.	1 hour
PHPSESSID	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.	1 year

Cookie	Description	Duration
VISITOR_INFO1_LIVE	This cookie is set by Youtube. Used to track the information of the embedded YouTube videos on a website.	5 months
IDE	Used by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.	2 years

Cookie	Description	Duration
GPS	This cookie is set by Youtube and registers a unique ID for tracking users based on their geographical location	30 minutes
tk_or	This cookie is set by JetPack plugin on sites using WooCommerce. This is a referral cookie used for analyzing referrer behavior for Jetpack	5 years
tk_r3d	The cookie is installed by JetPack. Used for the internal metrics fo user activities to improve user experience	3 days
tk_lr	This cookie is set by JetPack plugin on sites using WooCommerce. This is a referral cookie used for analyzing referrer behavior for Jetpack	1 year
_ga	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, camapign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assigns a randoly generated number to identify unique visitors.	2 years
_gid	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the wbsite is doing. The data collected including the number visitors, the source where they have come from, and the pages viisted in an anonymous form.	1 day
matomo	For statistical analysis, we use “Matomo” on this website. This is an open source tool for web analysis. Matomo does not transmit data to servers outside the control of the READ-COOP. Matomo is deactivated when you visit our website. Only if you actively consent will your usage behaviour be recorded anonymously.	1 year

Cookie	Description	Duration
YSC	This cookies is set by Youtube and is used to track the views of embedded videos.	1 year
_gat	This cookies is installed by Google Universal Analytics to throttle the request rate to limit the colllection of data on high traffic sites.	1 minute

Spanish print XVIII-XIX

Model Overview

Simply upload a picture and test this model

Diario de Madrid 1788-1825 is freely available to everyone

The COOP

Products & Services

Useful information

Helpful resources

Community