HANDS-ON SESSION MONK POSTHUMUS NIJMWEGEN
Andreas Weber (UT) / Marti Huetink (Brill) / Lambert Schomaker (RUG)
In this hands-on session we will work with the so-called 'interrogations' of the digitized Prize papers. The Prize papers are a vast and valuable collection of judicial documents, private and official letters, shipping lists, and ledgers covering the period between 1652 and 1815. The 'interrogations' provide detailed information on ships, crew members, and cargo. Our session today shows how handwriting recognition services such as MONK can help to make collection such as the digitized Prize papers searchable.
Aim of this exercise is to get acquainted with MONK's word labelling mechanism. This helps to build an index and s search engine for a collection of digitized handwritten documents. I'll explain on the main screen how the labelling works.
Some basic transcription rules:
Here is a short recap how the labelling works:
prize26: tbd
prize27: tbd
prize28: tbd
prize29: tbd
prize30: tbd
MONK actively learns from user input. As soon as a word has been labeled several times, MONK starts looking for similarly looking word shapes. Words in green boxes have been labelled already. Words in red boxes can be labelled by you. If you spot other words in red boxes you can also label them.
Example: Account
prize01: Ammunition and anchor
prize02: answereth and attempted
prize03: Britannick and Board
prize10: Instruments and James
prize17: examinant and destroyed
prize26:
prize27:
prize28:
prize29:
prize30:
If you need additional examples check out one of these: https://monk.hpc.rug.nl/cgi-bin/monkweb?cmd=TrainedWords&annot=all&sortopt=sorted_name&sortorder=normal&trainedwordmethod=Workshop&db=99900001&dispmode=quick&prefix=&begin=0