Join us for this free webinar about neologism use, and large historical text collections in digital humanities
Hosted by the Oxford English Dictionary team.
Dr Tanja Säily, tenure-track assistant professor in English Language at the University of Helsinki, and Dr Eetu Mäkelä, tenure-track assistant professor in Human Sciences–Computing Interaction at the University of Helsinki, will be presenting their research on neologism use in the Corpus of Early English Correspondence and how the OED, as well as large historical text collections, are semi-automatically consulted within the project.
Join Dr Säily and Dr Mäkelä for an overview of the project and also for a discussion around the possibilities of using the OED in digital humanities research in general.
This session will cover:
· An overview of the neologism research project, with examples of new vocabulary and its social embedding in the Corpus of Early English Correspondence
· Semi-automatic workflows for moving between the OED and historical text collections
· Building a pipeline from the corpus, through automated spelling normalization, lemmatization, filtering, comparison, and analysis to results
· The barriers faced with regard to historical spelling variation, noise from optical character recognition and material bias
· A discussion of the solutions applied, including statistical and neural machine translation, look-ups against comparison corpora and user interfaces for manual checking of results
· What the researchers wished they knew before they started
· What the future looks like for this and other projects using the OED
· Q&A session – bring your questions or send them to the panellists in advance: email@example.com
Who is this session for?
· Researchers working in similar projects – using the OED or not
· Anyone interested in historical sociolinguistics, historical lexicology, or historical lexicography
· People interested in the complexities of applying computational methods to historical data