Efficient data analysis with data.table

By Paola Corrales & Elio Campitelli in Workshop Education English

July 8, 2024

At its core, data.table provides an enhanced version of data.frames that are faster, more memory efficient and can be manipulated using a more concise syntax. It also provides a whole set of extra functions for reading from and writing to tabular files, reshaping data between long and wide formats, joining datasets and much more.

This three-hour tutorial will introduce participants to data.table’s basics. Through live coding sessions and hands-on exercises, we will learn how to use data.table as part of a data analysis pipeline; from reading data into memory to writing the results back, including exploration, data manipulation and joins. The tutorial will also lay the foundations for learning more advanced features, such as special symbols and combined operations.

The website includes webr code blokes to play around with the exercises. The packages and the data set are preloaded. You can also check the sandbox.