Data science applications require data by definition. However, data is messy. It can appears in many different forms and it can be interpreted in many different ways. Before you can do any data science work, you must discover how to access the data in many forms.