Block 2: Task 2 -- Dataset Types #6

mbannert · 2020-10-21T12:12:42Z

Find an example for each of the following type of datasets:

time series
cross section
panel data
nested data

and come up with an representation for each dataset in memory, i.e., in an R object AND on disk, i.e., written to a file.

Find = Look it up online. Take it from the FSO, the KOF website or public data providers you work with.
You may also want to simulate / draw data like suggested below:

set.seed(123)
d <- rnorm(1000)

Keep the data in memory (in your R session) and find a suitable format to store them on disk. Play around reading and writing data. Discuss advantages / disadvantages in your group in order to evaluate them in class together.

Make also sure to run some experiments with the ".RData" format (created with save()). What could be the disadvantage of such a flexible format ?

hints: see ts(), xts() from the xts package, data.frame, tibbles (from tidyverse) and data.table as well as lists and the jsonlite package.

hint2: The fivethirtyeight package from Nate Silver's blog 538 is pretty cool. It contains lots of datasets behind 538 stories.

The text was updated successfully, but these errors were encountered:

mbannert added the task student task / exercise label Oct 21, 2020

mbannert changed the title ~~Block 2: Task 1~~ Block 2: Task 2 Oct 19, 2022

mbannert changed the title ~~Block 2: Task 2~~ Block 2: Task 2 -- Dataset Types Oct 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Block 2: Task 2 -- Dataset Types #6

Block 2: Task 2 -- Dataset Types #6

mbannert commented Oct 21, 2020 •

edited

Block 2: Task 2 -- Dataset Types #6

Block 2: Task 2 -- Dataset Types #6

Comments

mbannert commented Oct 21, 2020 • edited

mbannert commented Oct 21, 2020 •

edited