What is synthetic data, really? We probe what it means for data to be "real," "generated," or "made up," and how those distinctions shape knowledge claims.
Explore →Five interconnected themes.
Our research programme spans technical, social, and humanistic questions about how synthetic data is made, used, and understood.
Synthetic data inherits — and amplifies — the biases of the data and models that produce it. We study representation gaps and intersectional effects in generated datasets.
Explore →When generative models invent people, identities, or facts, who and what is being misrepresented? We map the systematic errors that fall along intersectional axes.
Explore →We design frameworks for documenting, verifying, and auditing synthetic datasets so downstream users can trust what they are working with.
Explore →From healthcare to public administration, synthetic data is being deployed at speed. We examine the practical, legal, and ethical stakes of those deployments.
Explore →