"Quality is never an accident. It is always the result of intelligent effort." - John Ruskin
This lesson assumes you have completed the Intro to Stata lesson.
In this lesson we will cover why high quality data is important and what can be done to ensure data is of high quality
Bad data can lead to bad results, which lead to misguided recommendations and ultimately bad policy!
We're worried about data that can make our results unreliable--either because they are noisy or biased
We can make our data vulnerable to error before, during and after data collection. Fortunately, there are strategies to reduce the likelihood of (or at least mitigate the consequences of) error at each stage
This dataset was downloaded on February 3rd 2016 after 3 days of data collection. Do we see any problems?