Nilo Pedrazzini


My DPhil thesis looks at competing finite and non-finite constructions in Early Slavic temporal subordination and their place within the typology of when-clauses. Theoretically, my research is largely set within formal discourse-representation frameworks. Methodologically, it is quantitative and computational, and draws from typological variation among modern and historical languages.

Besides my doctoral research, I am working full-time as a Turing Research Fellow at The Alan Turing Institute. Before that I had been splitting my time as a Research Software Engineer within the project Digital Scholarship @ Oxford (DiSC) and as a Research Associate in Corpus-Based Digital Humanities within the Living with Machines project.

Visit my website to read about my research in more detail.


I held General Linguistics (Paper VIII) tutorials at Oxford in 2020-2021 and was Hourly-Paid Lecturer in Digital Humanities at King’s College London between September 2021 and January 2022.


Stopponi, S., Pedrazzini, N. , Peels-Matthey, S., McGillivray, B., & Nissim, M. 2023. Evaluation of Distributional Semantic Models of Ancient Greek: Preliminary Results and a Road Map for Future Work. Proceedings of the Ancient Language Processing Workshop associated with RANLP-2023, 49–58.  Association for Computational Linguistics

Pedrazzini, N., & McGillivray, B. 2022. Machines in the media: semantic change in the lexical field of mechanization in 19th-century British newspapers.  In Proceedings of the 2nd International Workshop on Natural Language Processing for Digital Humanities, 85-95. Association for Computational Linguistics.

McGillivray, B., Marongiu, P., Pedrazzini, N., Wigdorowitz, M., & Zordan, E. 2022. Deep Impact: A study on the impact of data papers and datasets in the Humanities and Social Sciences.
Publications 10(4:39).

Pedrazzini, N. 2022. One question, different annotation depths: A case study in Early Slavic. Journal of Historical Syntax 6 (Special issue: “Annotating Historical Corpora”), 1–40. DOI:

Pedrazzini, N., & Eckhoff, H. M. 2021. OldSlavNet: A scalable Early Slavic dependency parser trained on modern language data. Software Impacts 100063. DOI:

Pedrazzini, N. 2020. Exploiting cross-dialectal gold syntax for low-resource historical languages: Towards a generic parser for pre-modern Slavic. In CEUR Workshop Proceedings (Proceedings of the Workshop on Computational Humanities Research (CHR 2020)), 237–247. Amsterdam, Netherlands. URL:



Stopponi, S., Pedrazzini, N., Peels-Matthey, S., McGillivray, B., & Nissim, M. Forthcoming. Natural language processing for Ancient Greek: Design, advantages, and challenges of language models. Diachronica.

Haug, D., & Pedrazzini N. Forthcoming. The semantic map of WHEN and its typological parallels. Frontiers in Communication (Special issue ‘The Evolution of Meaning: Challenges in Quantitative Lexical Typology’).

Marongiu, P., Pedrazzini, N., Ribary, M., & McGillivray, B. Forthcoming. Le Journal of Open Humanities Data ( JOHD): enjeux et défis dans la publication de data papers pour les sciences humaines.

Ridge, M., Tolfo, G., McGillivray, B., Pedrazzini, N., Westerling, K., & Last C. Forthcoming. Living with Machines: Libraries crowdsourcing in collaboration with data scientists and the public. In Short, H., Russel, I.G., Menon, N., & Siemens, R. (eds.), Libraries, Archives, and the Digital Humanities (Routledge Companions to the Digital Humanities).

Subscribe to Faculty of Linguistics, Philology and Phonetics