Setting up a reproducible data analysis project in R - featuring GitHub, {renv}, {targets} and more

Author(s)

Olivia Angelin-Bonnet

Excerpt

A walk-through of my setup for a typical analysis project in R, using best practices.

Date

August 20, 2024

Location

online

Event

Abstrat

This talk is a follow up to last year’s presentation on the relevance of software engineering best practices in statistics and data science. Here, I will go through how I setup a new data analysis project in R, following some of these best practices to ensure reproducibility and reusability. In particular, I will demonstrate how I use GitHub to version control the code, and how I organise a typical analysis directory. I will also showcase some very useful R packages, such as {renv} to document the computational environment and {targets} to turn the scripts into a reproducible analytical pipeline.

Posted on:
August 20, 2024
Length:
1 minute read, 100 words
Categories:
conference best-practices R
See Also: