Demographic Labels:
A Worksheet for Researchers
A thinking guide for researchers and data curators
Written by the CHIRON Project Team
Published on May 9, 2024
A thinking guide for researchers and data curators
Written by the CHIRON Project Team
Published on May 9, 2024
Navigating competing conventions around labeling can be challenging. Ultimately, critical self-reflection can inform data curators and analysts:
For instance, perhaps patients in your dataset are grouped exclusively as male or female. Consider how this schema affects stigma or usability of health
research by gender expansive people.
Researchers using secondary data may feel constrained by the labeling schema used in the dataset(s), but there may be methodologically sound ways to work with the data in a way that is more consistent with updated recommendations.
For instance, there may be reasons to aggregate or disaggregate cis and trans women, depending on whether factors like social oppression, exogenous hormone use, or menarche are germane to your research question.
For instance, NASEM’s recent guidelines on population descriptors recommend that “Researchers conducting human genetics studies should directly evaluate the environmental factors or exposures that are of potential relevance to their studies, rather than rely on population descriptors as proxies” (emphasis added). If a more precise set of descriptors can be used rather than demographic categories, use those descriptors.
If you plan to publish your work, consider how you may thoughtfully articulate the constraints of your labeling schema, especially when working with secondary data.
Health research can provide the foundation for clinical care, drug development, public policy, and allocation of resources. Researchers should strive to categorize groups in ways that reflect those groups’ values and enable them to seek relevant care.