After collecting data, ecologists use statistical tests to infer information about samples or populations. In this lab, we will cover how to use one of these tests by examining data from an experiment that explored if penguins were able to use scent to find fish (their primary source of prey).
This lab will also demonstrate an example of the complicated relationships that make up food webs. It is not always as simple as one organism consuming another; sometimes these tropic dynamics can be indirect and have multiple levels, such as the one shown here.
Students should be able to:
Understand the concept of statistical significance and interpret the meaning of a p-value
Use given biological information in addition to statistical evidence to aid in drawing conclusions about the differences between groups
Since the measurements ecologists collect exhibit variation due to both differences in the real world and sampling error, common questions focus on determining the true distribution of a parameter (a trait we are measuring) in a given population. Often times we are focused on the average value for a given trait, but understanding the spread and shape may also be important. For example, we know that there is likely some variation in "normal" human body temperature and that thermometers may not always be accurate, but using data collection and statistics we have come to find the average human body temperature is 98.6 degrees Fahrenheit under typical conditions. We also observe that most healthy people should be close (within 1-2 degrees) of that value.
Statistics is a broad field of data analysis methods that allow us to use data to make inferences about the distribution of a focal parameter. Using statistics, we can answer any number of questions about the natural world by looking for signals that are related the parameter of interest (like the average, or mean, value) that may be hidden in data among the noise, or variation we always observe. When the same traits are measured in different groups or under different conditions, ecologists can also use statistics to consider if groups actually differ in some way or if trait values are impacted by external factors.
Statistical analysis connects to experimental design through the focus on a a null hypothesis (H0). This is a hypothesis of no difference. Depending on what you are focused on, it may state no difference between two values, no difference among groups, or no relationship exists between two measured variables. While starting with a hypothesis of no difference may seem odd, it means we have to collect data to justify new accepted values, differences among groups, or important relationships.
For example, assume someone told you that the normal human body temperature was really 102 degrees Fahrenheit. Would you believe them? Probably not, but we could use data collection and statistics to actually test that assertion. In this case, our null hypothesis might be that the true average human body temperature is 98.6 degrees, since that is the generally accepted value. Notice we are testing the average temperature here! This is often the signal we are focused on. This means we realize human body temperature may vary among individuals, but we expect the average normal human body temperature is 98.6. An alternative hypothesis (HA ) is that the true average human body temperature is not equal to 98.6 degrees (for example, it might be 102!). Statistical analysis allows us to see whether or not we have the evidence to reject the null hypothesis.
The scientific process usually requires a fair amount of evidence to reject a null hypothesis. That's because we know variation exists in our measurements. For example, if you found the temperature of one person to be equal to 101 degrees, would you immediately decide the true average "normal" human body temperature isn't 98.6? Probably not, because you know if we measured the temperature of 100 people we might expect some different results (maybe ranging from 98 to 100 degrees) based on factors like activity or health level, base metabolism, and error introduced by the measurement device. However, if we collect many temperatures and most of them were closer to a number other than 98.6 (like 102!), we might begin to suspect our null hypothesis was not correct (or that our thermometer is broken!). Statistics offers a variety of approaches to use the data we collect as evidence to test the null hypothesis. The ability to test hypotheses is a key part of the scientific method.
We often describe the strength of the evidence against a null hypothesis using a p-value. The p-value considers both the signal and noise we observed in the collected data. It does this by considering how likely we are to observe the signal we saw in our sample if the null hypothesis is true and we sampled the noisy population multiple times. Since we are assuming the null hypothesis is true, the p-value is just the probability of observing the signal we found in the data by chance. As a result, a high p-value means we likely observed the signal we found in our data just by chance. However, if the p-value is low, it means we were unlikely to observe the signal we found by chance if the null hypothesis was true, thus we have evidence against the null hypothesis. This is what is known as significance in statistics.
Scientists typically consider a p-value of less than .05 (<0.05) to be significant evidence against the null hypothesis. This means there is less than a 5% chance that the signal we observed has arisen due to chance under the null hypothesis. If you run a statistical test and end up with a p-value that is <0.05, you can reject the null hypothesis.
One major source of questions in ecology is how groups may differ from one another, whether that may be between individuals, species, or geographic areas. Multiple tests exist to consider this question depending on the data type and number of groups. Each of these tests starts with a null hypothesis that states there is no difference among the groups. If the test returns a p-value of <0.05, you can reject the null hypothesis of no difference. This means there is a significant difference among the groups.
In this lab, you will analyze a dataset that compares the ability to sense a smell associated with food in two different age groups of king penguins (adults and chicks).
Read the background information below and answer the questions interspersed throughout as you go.
King penguins (Aptentodytes patagonicus) are a species of seabird that can be found at sea or on various islands surrounding the Antarctic. These birds can forage for food broadly, sometimes more than 400 km from the beaches where they raise their young (Bost et al., 2002). Adult penguins spend most of their time foraging at sea, but when breeding or feeding their young they will spend periods of time onshore.
King penguin chicks usually hatch between January and March. Once they have hatched, parents feed their young to prepare them for the coming winter. In the winter months (~June - Septembers since we are thinking about the southern hemisphere), chicks undergo what is known as a creching period, where they fast and cluster together on the beach to withstand the cold. Once this is over, chicks get fed by their parents again from September to December, reaching maturity in the creche. When they are ready, the older chicks will join their parents on their first foraging trips (see Descamps et al., 2002).
The prey of penguins is often difficult to find since the ocean is very large and resources are unevenly distributed. Penguins, like other predators, must seek out the productive patches where their prey is. These discrete patches of productivity arise due to variation in abiotic factors such as temperature, nutrient availability, and the patterns of the currents.
It is still not clear how penguins are able to find these patches where food is abundant, but one possible signal is the scent of an airborne gas, dimethyl sulfide (DMS). This gas is what is known as an infochemical, which means any chemical cue that tells organisms something about their environment. DMS is derived from dimethylsulfoniopropionate (DMSP), which is associated with the presence of phytoplankton (plant plankton) that make up the basis of the marine food web (Raina et al., 2013). DMSP gets converted into DMS when phytoplankton are being preyed upon by organisms such as fish, which may be a prey source for penguins. DMS releases into the air above the phytoplankton patch may thus serve as an airborne indicator of penguin prey location (Berresheim et al.,1989).
In this experiment, both adult penguins and their chicks were studied to see if they could detect and respond to DMS. Birds were studied when they were asleep to see if they would wake up if they smelled the signal that prey was near (DMS). Each age group (adult and chick) were divided into two treatments: one would be exposed to the scent of DMS (DMS dissolved in vegetable oil), and the other would be exposed to a control odor (vegetable oil alone). The birds' response to the odor was ranked on a scale of 0-2, with 0 being no response and 2 meaning that they were woken up.
You will now determine whether penguins respond the presence of DMS. We will evaluate chicks and adults separately (Note: if you've take another statistics course, you may recognize another approach could be used to assess this data).
First, complete the following steps for the adult data.
1. Given the biological information above, create a null hypothesis about the relationship between king penguins and DMS that you expect to be able to observe and test using the experimental data. Your hypothesis should focus on what you expect to see when comparing the responses of the two groups (the one exposed to DMS and the one exposed to the control) and consider the average response to DMS in the two groups.
2. Then create corresponding alternative hypotheses.
Now open the dataset below (titled "Sleeping") in either Excel or Google Sheets. Note is has two tabs, one with adult and with chick data.
3. First, visualize the data for each treatment group. What type of graph would you use to visualize this data (examples include scatterplots, bar graphs, etc.)? Explain your choice and make the chart(s) (hint: ask your instructor about having multiple histograms on one chart or check out the Data Summaries in Google Sheets page. ) You can make one chart for the adult data and one for the chick data.
4. Calculate the summary statistics for each treatment group (mean, sample size, standard deviation, and standard error). These are useful numerical summaries of the data.
Use MSExcel/Google Sheets formulas to do this. Use =AVERAGE() and=STDEV() for computing the mean and standard deviation respectively. To calculate your sample size (n), you can either count up your data points or use =COUNT().
MSExcel and Google Sheets do not have formulas for standard error (SE), so you will need to create your own. SE = SD√n, in MSExcel you can use =()^0.5 or =SQRT() to calculate square root, your formula then becomes SE = SD/(n^0.5) or SE=SD/SQRT(n). Enter your formula in the SE cell =SD/(n^0.5).
You can also do this ussing pivot tables! See Data Summaries in Google Sheets .
5. Compile a table in your lab report of the summary statistics of your data like so:
6. What do each of these summary statistics tell us about our data?
7. Next, you will graph the average of the DMS and Control treatments (hint: ask your instructor about creating bar charts or check out the Data Summaries in Google Sheets page. )
8. Look at your descriptive statistics and the figure you created. Do you think there is a significant difference visually between the Control and DMS treatments?
Now you will see if there is a numerically significant difference between the treatments by doing a statistical test.
Because the data above is comprised of ranking the extent of the response of each individual from 0-2, it does not meet the usual assumptions of normality required for tests that you may have encountered before (a t-test). Therefore, you will be using a special kind of test designed for this type of data known as a Mann-Whitney U-test to compare the means between groups.
The steps to do this are as follows:
You will be using a free online calculator of sorts for t-tests and other statistical analyses found here: http://vassarstats.net
When you get to the main page, click on the tab in the menu box on the left labeled "Ordinal Data". Then choose the option for "Mann-Whitney Test"
A pop-up window will appear for you to enter your sample size for each treatment. The Control treatment is Sample A and the DMS treatment is Sample B. Use the sample size (n) you calculated above.
You will copy and paste each column of data for the Control treatments in the box for Sample A and the data from the DMS treatments in the box for Sample B (the numbers pictured below are fake data to show you how the entry will look--do not use them! Only use the data in the Sleeping spreadsheet).
Once you have your data in the boxes, click “Import data into data cells”. This will autofill the “Raw Data for…” in the “Data Entry” heading.
Then, click the “Calculate from Raw Data” button at the bottom of your autofilled Data. It will give you a series of different outputs. The output we will be using in this exercise is the p(2) value; this corresponds a null hypothesis that states the two groups are equal to each other.
9. What is the p-value for the test? Is there a significant difference between the groups exposed to DMS and those exposed to the Control?
Now complete steps 1 - 9 for the chick data before you answer the following questions!
10. Based on your statistics, are adult King penguins able to detect DMS? What makes you think this?
11. Are baby King penguins able to detect DMS? How do you know?
12. Does your analysis suggest there is a difference between the responses of adults versus chicks to DMS? Why or why not?
13. Are there differences between adults and young penguins in how they respond to a scent indicating prey? Why do you think this might be?
This lab is adapted from:
Kaitlin M. Bonner and Gregory B. Cunningham. March 2018, posting date. The nose knows: How tri-trophic interactions and natural history shape bird foraging behavior. Teaching Issues and Experiments in Ecology, Vol. 13: Practice #8 [online]. http://tiee.esa.org/vol/v13/issues/data_sets/bonner/abstract.html with permission.
Berresheim, H., M.O. Andreae, G.P. Ayers, and R.W. Gillett. 1989. Distribution of biogenic sulfur-compounds in the remote southern-hemisphere. Pp. 352–356 in Saltzman, E. J. and Cooper, W. J. (eds.), Biogenic sulfur in the environment. American Chemical Society.
Bonner, K.M. and Cunningham, G. B.. March 2018, posting date. The nose knows: How tri-trophic interactions and natural history shape bird foraging behavior. Teaching Issues and Experiments in Ecology, Vol. 13: Practice #8 [online]. http://tiee.esa.org/vol/v13/issues/data_sets/bonner/abstract.html.
Bost, C.A., T. Zorn, Y. Le Maho, and G. Duhamel. 2002. Feeding of diving predators and diel vertical migration of prey: king penguins’ diet versus trawl sampling at Kerguelen Islands. Marine Ecology Progress Series 227:51–61.
Descamps, S., M. Gauthier-Clerc, J.-P. Gendner, and Y. L. Maho. 2002. The annual breeding cycle of unbanded king penguins Aptenodytes patagonicus on Possession Island (Crozet). Avian Science 2:87-98.
Raina, J.B., D.M. Tapiolas, S. Forêt, A. Lutz, D. Abrego, J. Ceh, F.O. Seneca, P.L. Clode, D.G. Bourne, B.L. Willis, and C.A. Motti. 2013. DMSP biosynthesis by an animal and its role in coral thermal stress response. Nature 502:677–680.