Resources

In this section, I suggest some textbooks and other resources particularly valuable for performing data analysis, data manipulation, and statistics.

Important resources

Learning Python

Python has more less become mandatory in the current data science landscape. As a result, many scientists rely on python scripts and notebooks to perform their work. Via collaboration and through colleagues, everyone will at one point or another be exposed to Python, hence the need to know, at the least, the basic.

Fortunately many resources are available. I would like to highlight:


Learning FORTRAN

Many models and tools were developed with FORTRAN. For example, models such as the Soil and Water Assessment Tool (SWAT), H08, CaMaFlood, and more recently AquaCrop were all  developed with FORTRAN. A basic understanding of the language is therefore very useful for reading their source codes and eventually for improving them. The short tutorial is straight forward and very effective.


The Generic Mapping Tools (GMT)

Learning the Generic Mapping Tools (GMT) is probably not as critical as it used to be since Python, R, and other programming languages now feature comprehensive libraries for displaying global maps. However, I notice that some of these libraries are not completely mature yet and can be rather slow for displaying high resolution dataset. As a result, learning GMT may still valuable and I would like to showcase some resources I found online: 

Suggested textbooks

R for Data Science (2e)

This is an excellent textbook that both provides i) a general and gently introduction to R and ii) a robust explanation of data science. The topics introduced feature lot's of concrete examples with plenty of code to learn from. Furthermore, the topics covered are well selected as they built on one another. As a result, the reader proficiency with R clearly expend after each reading sessions. 


An Introduction to Statistical Learning (available for R and python) 

This textbook provides an excellent introduction or refresher on statistics. Importantly, proofs are rarely provided and the textbook rather focus on concretely applying concept to real data. Some sections are a bit lengthy but the fist chapters are definitely a must read.


Python for Data Analysis (3e)

Nowadays there are just so many textbooks available for learning Python. This one is particularly well written and presented while being fairy concise.  As its title suggest, only the basic and core fundamental of python are actually introduce as the book main focus is on data analysis using popular packages.