Policy topic list - see excel spreadsheet of terms
64224 references extracted from 1000 policy articles via API
43471 references extracted from 1000 policy articles via Ref template
So nearly 20,000 references (around 1/3 aren't using a template.
Data cleaning
Author field from Author 6 removed - it went to 191 authors based on one outlier reference.
Title type field removed: Only content "-1"