The course « Initiation to R » aims to present the programming language R, in particular data import into the R environment, manipulation of data and data export. Data description using baseline statistical functions, most commonly used statistical tests and graphs will also be addressed
R is an open source software and a programming language that gives its users access to a large number of statistical analyses.
Participants must be familiar with Windows and Excel. However, no previous experience or knowledge in programming (either in R or in any other language) is necessary. A strong interest in script programming is compulsory.
At the end of the course, participants will be able to monitor their data (import, control consistency, create variables, export…) and explore their data (data description, statistical tests, and graphical representations). Basic statistical tests will be introduced to compare continuous variables (student t-test, analysis of variance, Wilcoxon non-parametric tests) and to compare discrete/categorical variables (chi-2 test). Participants will also gain knowledge to continue self-exploration of the R environment and read R programmes, in order to perform more complex statistical analyses, and to better interact with statisticians.
During this training, many practical sessions are organized on real databases in order to implement the concepts discussed.
Applications are evaluated by the Course Committee
Unité de Recherche et d'Expertise Epidémiologie des Maladies Emergentes
Groupe à 5 ans Génétique Statistique
Y. Madec (Unité d’Epidemiologie des Maladies Emergentes, Institut Pasteur),
S. Malot (Institut Pasteur),
V. Ponticelli (Institut Pasteur),
V. Rouilly (independent Data Scientist Consultant, Datactix founder.),
M. Sala (Institut Pasteur),
D. Tavel (Institut Pasteur),
H. Waxin (Institut Pasteur).