Skip to main contentResearchEquals badge, showcasing an R and an equals sign.

Synthetic identifying information for 100,000 individuals: A pseudo-population

Chris Hartgerink, Richard Klein

This is a principal dataset of synthetic identifying information. We created this synthetic dataset to test the precision of later stage mechanisms to retrieve identifying information. The dataset contains 100,000 fake individuals, which can serve as a pseudo-population to sample from. For details on how this data is generated, please view the supporting documentation.