Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

csv / xls(x) reading for more formats #36

Closed
JGuetschow opened this issue Apr 7, 2021 · 3 comments
Closed

csv / xls(x) reading for more formats #36

JGuetschow opened this issue Apr 7, 2021 · 3 comments
Assignees
Labels
enhancement New feature or request priority: medium medium priority issue. Good to solve soon if possible

Comments

@JGuetschow
Copy link
Contributor

Is your feature request related to a problem? Please describe.

Currently reading from wide format csv files is implemented (long format under development). However, some datasets use different formats, e.g. a table with gas and category as dimensions where each year has an individual sheet or file.

Describe the solution you'd like

Function that reads csv files from a folder or sheets from an xls(x) file into the interchange format.

Describe alternatives you've considered

Processing using external tools like the primap1 data_converter helper scripts

Additional context

For an example see e.g. the CEADs data for china. www.ceads.net/

@JGuetschow JGuetschow added the enhancement New feature or request label Apr 7, 2021
@JGuetschow JGuetschow self-assigned this May 19, 2021
@JGuetschow
Copy link
Contributor Author

Currently working on a set of function to help reading from different xls and csv formats which need custom code.

@JGuetschow JGuetschow added the priority: medium medium priority issue. Good to solve soon if possible label Mar 27, 2023
@JGuetschow
Copy link
Contributor Author

The current way to tackle data reading is to use custom code to convert to a long or wide pandas DataFrame and then convert that to interchange format. It might not be worth the effort to further generalize the data reading functions as functions that can read from more format can become overly complex in use and might not be preferable over custom code per dataset.

@mikapfl
Copy link
Member

mikapfl commented Mar 29, 2023

Yeah, I'd say let's start with custom code per dataset, and if we notice we are writing the same code over and over, we generalize and include more into primap2.

@mikapfl mikapfl closed this as completed Sep 13, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request priority: medium medium priority issue. Good to solve soon if possible
Projects
None yet
Development

No branches or pull requests

2 participants