Confidentiality guarantees of a new method to generate synthetic data

Student

Ariane Boivin

Directeur.e(s) de recherche

Anne-Sophie Charest

Start date

2022-09-12

Title of the research project

Confidentiality guarantees of a new method to generate synthetic data

Description

It is often difficult, even sometimes impossible, to share denominalized data between organisations and researchers due to ethical constraints regarding participant confidentiality. Synthetic datasets could facilitate data sharing. However, many current methods, which use multiple imputation (MI) techniques for missing data, lower the analysis potential and the quality of the results.

This project therefore aims to assess the confidentialy guarantees of a promising new data synthesis method. This method adds a data masking step to a multiple imputation technique to generate synthetic data based on the risk of each observation. In particular, attribute disclosure risks, which refer to the disclosure of certain attributes based on other, known ones, will be tested.

The feasibility and quality of the results will be tesed on a dataset provided by l’Institut de la statistique du Québec.

Discover

Featured project

Detection of delirium using physiological parameters and hypovigilance monitoring: a pilot observational cohort study

Student member : Raphaëlle Giguère

Delirium is a condition that, when left unmanaged, is associated with increased mortality and longer hospitalization of patients in intensive care; therefore, its detection should be an integral part of care. It is characterized by confusion, anxiety and reduced alertness. It is estimated that 75% of delirium cases are not detected on admission to hospital. Detecting such an acute condition requires frequent monitoring of participants, which is labor intensive and requires expertise.