Research Process

I started with an extremely broad research question, "what sort of interesting patterns can be found by examining the changes in the text over time?" Understanding the type and nature of these patterns would be an iterative process, with initial findings leading to more detailed areas of inquiry. The most obvious data point for a corpus of the size and nature of PPP was word frequency, so that is what I focused on. After sampling some of the available tools for text analysis, I settled on Voyant Tools; of the various free tools available for text analysis, it is the most scholarly in its range of functions. It allowed for word frequency counting of both single words and phrases, and was quite fast given the size of my corpus.

I used the default stopword list provided by Voyant, with two additions - the words "president" and "administration", which were the first and second most common words in the corpus. There were a few other words that I considered excluding due to frequency, like "united" and "states", but I thought both of those words might show up in phrases that I would be interested in.