HANDS-ON SESSION MONK

PRIZE PAPERS & HANDWRITING RECOGNITION

HANDS-ON SESSION MONK POSTHUMUS NIJMWEGEN

Andreas Weber (UT) / Marti Huetink (Brill) / Lambert Schomaker (RUG)

In this hands-on session we will work with the so-called 'interrogations' of the digitized Prize papers. The Prize papers are a vast and valuable collection of judicial documents, private and official letters, shipping lists, and ledgers covering the period between 1652 and 1815. The 'interrogations' provide detailed information on ships, crew members, and cargo. Our session today shows how handwriting recognition services such as MONK can help to make collection such as the digitized Prize papers searchable.

1. Choose your login

3. Exercise I - (try to transcribe 10 or more lines)

Aim of this exercise is to get acquainted with MONK's word labelling mechanism. This helps to build an index and s search engine for a collection of digitized handwritten documents. I'll explain on the main screen how the labelling works.

Some basic transcription rules:

    • Don't label what you can't read!
    • Label what you see, even if the spelling is wrong!
    • If there are two words in the grey box use an underscore: this_ship
    • Sometimes the first line will be invisible. If that is the case jump to the next line.


Here is a short recap how the labelling works:

  1. In order to start with this excercise please click on the link as it is provided below. Each user name has its own link.
  2. Click on one of the yellow [R] buttons.
  3. This brings you to the word labelling interface.
  4. Click with the left mouse button on the white space in front of a word. MONK will start searching for word zones (grey boxes)
  5. Find the right word zone and then click on the word.
  6. Fill in or correct the label.
  7. Push save label and continue.
  8. In order to speed up the labelling process you can keep the CTRL button pressed on the keyboard, and then use the mouse to segment words by adding multiple bars. Don't forget to place the last bar after the last word in a line. Basically you can't do anything wrong, so just try and play with it.
  • For Apple users: for you the CTRL button option won't work, I'll show on screen how it works for you.


prize01: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&dispmode=web-grey&ipage=114&cmd=next_empty&key=*&pagemode=broken&mxp=block

prize02: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&ipage=_119&cmd=numbered_page_broken&pagemode=broken&dispmode=web-grey&sidebyside=&mxp=block

prize03: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&dispmode=web-grey&ipage=120&cmd=next_empty&key=*&pagemode=broken&mxp=block

prize04: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&ipage=_121&cmd=numbered_page_broken&pagemode=broken&dispmode=web-grey&sidebyside=&mxp=block

prize05: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&dispmode=web-grey&ipage=124&cmd=next_empty&key=*&pagemode=broken&mxp=block

prize06: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&dispmode=web-grey&ipage=125&cmd=next_empty&key=*&pagemode=broken&mxp=block

prize07: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&dispmode=web-grey&ipage=127&cmd=next_empty&key=*&pagemode=broken&mxp=block

prize08: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&dispmode=web-grey&ipage=128&cmd=next_empty&key=*&pagemode=broken&mxp=block

prize09: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&dispmode=web-grey&ipage=129&cmd=next_empty&key=*&pagemode=broken&mxp=block

prize10: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&dispmode=web-grey&ipage=130&cmd=next_empty&key=*&pagemode=broken&mxp=block

prize11: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&ipage=_131&cmd=numbered_page_broken&pagemode=broken&dispmode=web-grey&sidebyside=&mxp=block

prize12: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&ipage=_132&cmd=numbered_page_broken&pagemode=broken&dispmode=web-grey&sidebyside=&mxp=block

prize13: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&ipage=_133&cmd=numbered_page_broken&pagemode=broken&dispmode=web-grey&sidebyside=&mxp=block

prize14: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&ipage=_135&cmd=numbered_page_broken&pagemode=broken&dispmode=web-grey&sidebyside=&mxp=block

prize15: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&ipage=_136&cmd=numbered_page_broken&pagemode=broken&dispmode=web-grey&sidebyside=&mxp=block

prize16: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&ipage=_137&cmd=numbered_page_broken&pagemode=broken&dispmode=web-grey&sidebyside=&mxp=block

prize17: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&ipage=_139&cmd=numbered_page_broken&pagemode=broken&dispmode=web-grey&sidebyside=&mxp=block

prize18: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&dispmode=web-grey&ipage=141&cmd=next_empty&key=*&pagemode=broken&mxp=block

prize19: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&dispmode=web-grey&ipage=142&cmd=next_empty&key=*&pagemode=broken&mxp=block

prize20: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&ipage=_143&cmd=numbered_page_broken&pagemode=broken&dispmode=web-grey&sidebyside=&mxp=block

prize21: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&dispmode=web-grey&ipage=144&cmd=next_empty&key=*&pagemode=broken&mxp=block

prize22: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&dispmode=web-grey&ipage=145&cmd=next_empty&key=*&pagemode=broken&mxp=block

prize23: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&dispmode=web-grey&ipage=146&cmd=next_empty&key=*&pagemode=broken&mxp=block

prize24: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&ipage=_148&cmd=numbered_page_broken&pagemode=broken&dispmode=web-grey&sidebyside=&mxp=block

prize25: https://monk.hpc.rug.nl/cgi-bin/monkweb?db=99900001&ipage=_158&cmd=numbered_page_broken&pagemode=broken&dispmode=web-grey&sidebyside=&mxp=block

prize26: tbd

prize27: tbd

prize28: tbd

prize29: tbd

prize30: tbd

Output

This results in a clickable index!


4. Exercise II: Train with hitlists

MONK actively learns from user input. As soon as a word has been labeled several times, MONK starts looking for similarly looking word shapes. Words in green boxes have been labelled already. Words in red boxes can be labelled by you. If you spot other words in red boxes you can also label them.

Example: Account

prize01: Ammunition and anchor

prize02: answereth and attempted

prize03: Britannick and Board

prize04: Brief and been

prize05: Charles and Delaware

prize06: Eleventh and England

prize07: Eight and Families

prize08: Francois and Dunkirk

prize09: Freeman and Guns

prize10: Instruments and James

prize11: John and Lading

prize12: Mariner and November

prize13: Nantes and Ninth

prize14: Sloop and South

prize15: Saith and Sworn

prize16: Year and Zeeland

prize17: examinant and destroyed

prize18: goods and onboard

prize19: owners and part

prize20: thrown and thence

prize21: them and third

prize22: hired and one

prize23: day and France

prize24: commanded and called

prize25: attempted and West

prize26:

prize27:

prize28:

prize29:

prize30:


If you need additional examples check out one of these: https://monk.hpc.rug.nl/cgi-bin/monkweb?cmd=TrainedWords&annot=all&sortopt=sorted_name&sortorder=normal&trainedwordmethod=Workshop&db=99900001&dispmode=quick&prefix=&begin=0