You have probably seen a google scanned page before. It looks like this:
This is a sample page from the 13th century Old Occitan poem "Romance of Flamenca". This is one of the very first "modern novels" and a combination of comedy, parody, courtly love and historical epics. One day I will share with you its very intriguing plot. But you can probably guess where I am going. Yes, first, it is written in Old Occitan. How many of you can read it even with a dictionary? Second, a scanned format is useless for any search - the OCR recognition does not help much here... The solution is to build a digital annotated parallel corpus: 1) Annotated - because I am a linguist and interested in syntax and pragmatics, and 2) Parallel - with English translation - to make it available for a larger audience.
In my next episode, I will secretly tell you about who is Flamenca and what happened. Stay tuned!