Scientific Publications

Here you will find the main scientific output of the project so far. These include data, papers and presentations in the fields of digitial humanities, computational linguistics and history. Below them, we also list the events hosted by the project aiming at fostering the interdisciplinary discussion on digitised newspapers.



Maud Ehrmann, Matteo Romanello, Simon Clematide, Phillip Benjamin Ströbel, Raphaël Barman. 2020. Language Resources for Historical Newspapers: the Impresso Collection. In Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020) (to appear).

Ehrmann, Maud, Matteo Romanello, Stefan Bircher and Simon Clematide. 2019.‘Introducing the CLEF 2020 HIPE Shared Task: Named Entity Recognition and Linking on Historical Newspapers’. In Proceedings of 42nd European Conference on IR Research, ECIR 2020, publisher version, postprint, zenodo record, slides.


Ehrmann, Maud, Estelle Bunout and Marten Düring. 2019. ‘Historical Newspaper User Interfaces: A Review’. In WLIC proceedings. Athens, Greece: IFLA. Related dataset: ‘Survey of Digitized Newspaper Interfaces’, available on Zenodo.

Amrhein, Chantal, and Simon Clematide. 2018. ‘Supervised OCR Error Detection and Correction Using Statistical and Neural Machine Translation Methods’. Journal for Language Technology and Computational Linguistics (JLCL) 33 (1): 49–76.


Clematide, Simon, Lenz Furrer, and Martin Volk. 2018. ‘Crowdsourcing the OCR Ground Truth of a German and French Cultural Heritage Corpus’. Journal for Language Technology and Computational Linguistics (JLCL) 33 (1): 25–47.

Makarov, Peter, and Simon Clematide. 2018a. ‘Imitation Learning for Neural Morphological String Transduction’. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing abs/1808.10701: 2877–82.

Makarov, Peter, and Simon Clematide. 2018b. ‘Neural Transition-Based String Transduction for Limited-Resource Setting in Morphology’. In Proceedings of the 27th International Conference on Computational Linguistics , 83–93.

Makarov, Peter, and Simon Clematide. 2018c. ‘UZH at CoNLL–SIGMORPHON 2018 Shared Task on Universal Morphological Reinflection’. In Proceedings of the CoNLL–SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection, 69–75. Brussels: Association for Computational Linguistics.