Dubious value added of the 'freq' variable #288

rideofyourlife · 2024-01-15T12:06:38Z

Many datasets, which contain only one frequency available (like namq_10_gdp, sts_inpr_m etc.), were awarded a new variable "freq". I generally understand the idea behind it, but while working on the package it has only proven to often be an unnecessary step of %>% select (-freq) in majority of the code I write.

Does anyone else have similar thoughts?

The text was updated successfully, but these errors were encountered:

pitkant · 2024-01-15T16:50:54Z

This is intentional behaviour but if many people find this annoying you can report it under this issue and we can reconsider.

antaldaniel · 2024-01-15T22:19:57Z

Statistical agencies worldwide have similar standards treating metadata, and metadata in this case is there to avoid unforeseen logical errors when joining or linking data; I think that the freq variable is present when there are similar statistical products or datasets available with the same variables but different frequencies. In that case a joining without frequency adjustment results in a hard to find logical error. The freq variable is the same as the unit variable, you really want to avoid unknowingly divide euros with thousand euros, or multiply annual values in a chain with quarterly values.

rideofyourlife · 2024-01-18T09:31:55Z

Statistical agencies worldwide have similar standards treating metadata, and metadata in this case is there to avoid unforeseen logical errors when joining or linking data;

Well, we are all aware. At least I hope so it is the case.

In that case a joining without frequency adjustment results in a hard to find logical error. The freq variable is the same as the unit variable, you really want to avoid unknowingly divide euros with thousand euros, or multiply annual values in a chain with quarterly values.

This would assume users are somewhat unaware of what they are doing. It seems to me that implementation of this technique is triumph of form over content.

pitkant · 2024-04-29T14:10:40Z

@rideofyourlife I have uploaded some WIP code in v4.1 branch. It enables users to make queries the same way as before but adds an additional parameter legacy.data.output to get_* functions that transforms dimensions names such as TIME_PERIOD and OBS_VALUE to time and values that were used before and removes extra columns such as freq, DATAFLOW and LAST UPDATE altogether.

If you could test this and give some feedback on what you think it would be great!

rideofyourlife · 2024-05-20T13:11:12Z

I have already laboriously replaced "time" with "TIME_PERIOD" in all my codes, so having "time" back is not as essential now as it had been before the recent change. Despite that, where do I use this legacy.data.output? In which function?

pitkant · 2024-05-20T13:28:28Z

I'm sorry for the laborious process. In version 4.1 legacy.data.output = TRUE parameter in get_eurostat() function should return a similar data.frame / tibble as it returned in version 3.8.3 and before.

rideofyourlife · 2024-05-20T13:35:24Z

Ah, yes: it works. It is just not suggested by R Studio while writing for some reason.

pitkant added this to In progress in eurostat 4.1.0 Feb 7, 2024

pitkant mentioned this issue Apr 29, 2024

TIME_PERIOD instead of time #285

Open

pitkant moved this from In progress to Done in eurostat 4.1.0 Apr 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dubious value added of the 'freq' variable #288

Dubious value added of the 'freq' variable #288

rideofyourlife commented Jan 15, 2024 •

edited

pitkant commented Jan 15, 2024

antaldaniel commented Jan 15, 2024

rideofyourlife commented Jan 18, 2024

pitkant commented Apr 29, 2024

rideofyourlife commented May 20, 2024

pitkant commented May 20, 2024

rideofyourlife commented May 20, 2024

Dubious value added of the 'freq' variable #288

Dubious value added of the 'freq' variable #288

Comments

rideofyourlife commented Jan 15, 2024 • edited

pitkant commented Jan 15, 2024

antaldaniel commented Jan 15, 2024

rideofyourlife commented Jan 18, 2024

pitkant commented Apr 29, 2024

rideofyourlife commented May 20, 2024

pitkant commented May 20, 2024

rideofyourlife commented May 20, 2024

rideofyourlife commented Jan 15, 2024 •

edited