This page provides some details about the test-corpora and metrics used to evaluate our automatic enjambment detection system. To see the results tables directly, visit [this page].
We evaluated the system with two test-corpora, that we manually annotated for enjambment:
In each of the corpora, we found approx. 275 enjambed line pairs.
We used a 20th century corpus that shows natural syntax in order to be able to compare with the results on sonnets from earlier periods: the sonnets can show archaic language and often show an elevated register, which are expected to be more difficult for an NLP pipeline to analyze correctly.
As regards interannotator agreement, we have obtained two annotators' input for half of the reference set. Overlap across both annotators' results was high.
We performed a typed and untyped enjambement detection evaluation:
With respect to the test-corpora, precision, recall and F1 were calculated. See the results tables.