Big data preparation with Spark

Présenté par
En ligne

Activités de formation

"Data Scientist" undoubtedly takes the title of the hottest profession in the early 21st century. Things like Machine Learning, Big Data Analytics, Story-Telling and Data Visualization are some of the most sought after skills in almost every industry and are all “must haves” in order to succeed in this increasingly competitive sector of the job market. But in the glamorous world of Data Science, one key aspect of the job - the one where Data Scientists actually spend most of their time and effort - does not get as much attention: preparing data so the skills above can be put to use. 

This workshop, given by Calcul Québec, will reveal this hidden side of the profession and introduce participants to one of the most widely used Big Data Processing technologies across industries: Apache Spark, in an interactive format where you will learn as you do!



Featured project

Student member : Gabriel Couture

This project consists of establishing good practices in health data management and building a software infrastructure in order to apply them.

We have developed pipelines that allow daily recovery of brachytherapy treatment data in order to calculate and store their dosimetric indices in a database dedicated to research. These indices are essential for planning radiotherapy treatments and for estimating their quality.

Read more