Intertextuality and Visualization

Visualizing Intertextuality: Text reuse in unstructured corpora

Glenn Roe will address the identification and visualization of text reuse in unstructured corpora. Identifying text reuse is a specific case of the more general problem of sequence alignment; that is, the task of identifying regions of similarity shared by two strings or sequences, often thought of as the longest common substring problem. This technique is widely applied in the field of bioinformatics, where it is used to identify repeated genetic sequences. This talk will outline several different approaches to sequence alignment techniques for humanities research, as well as two recent projects aimed at visualising both the output of alignment comparisons between texts and the alignment process itself using visual analytics.