Statistical Analysis of Synthetic Data Sets Satisfying Differential Confidentiality

Student

Mathieu Baillargeon

Directeur.e(s) de recherche

Anne-Sophie Charest

Start date

2020-01-13

Title of the research project

Statistical Analysis of Synthetic Data Sets Satisfying Differential Confidentiality

Description

Data sharing is often limited by privacy issues. This is very common in particular for health datasets, given the inherent sensitivity of this type of data. When sharing of the original dataset is not possible, one method that can be used is to generate a synthetic dataset, which contains as much statistical information as possible from the original dataset, but which provides data on false individuals in order to protect the confidentiality of respondents.

This project is interested in rigorously measuring the confidentiality protection offered by a synthetic dataset. We will carefully examine some measures proposed in the literature, to understand their guarantees and the differences and similarities between them in order to identify the measure (s) that would be the most relevant for the sharing of synthetic data.

Discover

Featured project

Effect of oxygen pressure in cancerous tissue cells on radiotherapy treatments

Student member : Corinne Chouinard

Radiotherapy treatments currently used in the clinical field are rarely modified. They generally consist of a global therapy of 50 grays, fractionated in five treatments of two grays every week for five weeks.
Thus, it could be worthwhile to develop a numeric tool, based on mathematical models found in the literature, in order to compare different types of treatment without having to test them on real tissues. Several parameters are known to alter the tissue response after irradiation including oxygen