A selection of statistical graphics and a data visualisation I've created as a scientist
A selection of statistical graphics and a data visualisation I've created as a scientist
Taylor et al. Identity-by-descent with uncertainty characterises connectivity of Plasmodium falciparum populations on the Colombian-Pacific coast. PLoS Genetics. 2020 Nov 16;16(11)
Fractions of highly-related malaria parasite sample pairs grouped by collection site.
Malaria parasite population connectivity. Edge width is proportional to the fraction of highly-related malaria parasite sample pairs. Buenaventura and Tumaco, the two largest ports on the Colombian-Pacific coast, are connected by frequent marine traffic www.marinetraffic.com
Groups of two or more statistically indistinguishable malaria parasite samples (clonal components, CCs) and the average relatedness between them. CCs are numbered in order of the dates of the earliest sample collection per CC, and are coloured by sample collection city with size proportional to the total sample count per CC, ranging from two to 28. Edge transparency and weight is proportional to inter-CC average relatedness, ranging from zero to 0.84.
Taylor and Watson (joint first authors) et al. Resolving the cause of recurrent Plasmodium vivax malaria probabilistically. Nature Communications. 2019 Dec 6;10(1)
This plot was generated to visually explore malaria parasite (P. vivax) genetic data that feature in the above-cited article.
It depicts data on seven microsatellite markers (heat map columns) and 432 P. vivax malaria episodes (heat map rows), which are grouped by person ID (grayscale columns). Each marker has its own colour scheme (legend, respective column) that depicts the rank/s of the observed allele/s, ordered from low to high repeat length (not all repeat lengths were observed). Multiple colours within a cell indicate the detection of multiple alleles per episode per microsatellite, which is evidence of a polyclonal clonal infection or a genotyping error. White represents missing data.