Consider these prompts to determine if data is reliable:
- Is this the original source of data? Whenever possible, use data from its original source.
- Is there information describing the structure and meaning of the data? Does the data meet the research need based on the information provided?
- Is there information describing the data collection and processing strategies?
- Were best practices followed for data collection? (such as number of subjects and how they were selected; how variables were measured; and data that is generalizable)
- Were best practices followed for data processing? (such as acceptable handling of outliers and missing data; and combining categories)
- Are the names and contact information of the data creators present?
- Is the data collection or analysis tied to a financial, political, or social interest? If so, can you identify measures taken to ensure ethical compliance.
- Does the data have a use license?
- If the data includes sensitive information, are proper protocols in place for viewing the data (use agreements)?
References
May, Kay P. (2025). Statistics & Data for the Social Sciences – Tips for Locating Data – Determining Data Trustworthiness. Tulane University. Retrieved April 10, 2025 from https://libguides.tulane.edu/socialsciencedata.
Liu, G. (2024). Data Quality Literacy: Knowledge Briefs. Institute of Museum and Library Services (IMLS) Grant Project. Retrieved April 10, 2025 from https://www.dataqualityliteracy.org/knowledge-briefs
__________
Also see the following videos (sign in with your Haverford credentials):
Assess the quality of data (5 min video)
Assess the generalizability of data (7 min video)
(both from LinkedIn Learning. Data Literacy: Exploring and Describing Data. Accessed Jan 29, 2025.)