Directeur.e(s) de recherche
Anne-Sophie Charest
Start date
Title of the research project
Statistical Analysis of Synthetic Data Sets Satisfying Differential Confidentiality
Description

Data sharing is often limited by privacy issues. This is very common in particular for health datasets, given the inherent sensitivity of this type of data. When sharing of the original dataset is not possible, one method that can be used is to generate a synthetic dataset, which contains as much statistical information as possible from the original dataset, but which provides data on false individuals in order to protect the confidentiality of respondents.

This project is interested in rigorously measuring the confidentiality protection offered by a synthetic dataset. We will carefully examine some measures proposed in the literature, to understand their guarantees and the differences and similarities between them in order to identify the measure (s) that would be the most relevant for the sharing of synthetic data.

Discover

Featured project

Delirium is a condition that, when left unmanaged, is associated with increased mortality and longer hospitalization of patients in intensive care; therefore, its detection should be an integral part of care. It is characterized by confusion, anxiety and reduced alertness. It is estimated that 75% of delirium cases are not detected on admission to hospital. Detecting such an acute condition requires frequent monitoring of participants, which is labor intensive and requires expertise.

Read more