Extend Excel write/read to handle large data beyond the row limits #309

behnam-zakeri · 2020-04-10T10:04:01Z

In response to issue #292, this PR extends the code in ixmp.backend.io.py to write and read large data, i.e., dataframes and series longer than the maximum row number of Excel.

How to review

You can test this branch as it is. The tests are improved to reflect the new changes. ALternatively, you can export one of the global scenarios and check if parameter land_output is completely transferred to Excel in two sheets.

PR checklist

Tests improved to reflect the enhancements.
Documentation is not needed, minor enhancement.
Release notes updated by adding a short note.

codecov · 2020-04-10T11:33:26Z

Codecov Report

Merging #309 into master will increase coverage by 0.10%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #309      +/-   ##
==========================================
+ Coverage   97.31%   97.41%   +0.10%     
==========================================
  Files          39       41       +2     
  Lines        4090     4251     +161     
==========================================
+ Hits         3980     4141     +161     
  Misses        110      110

Impacted Files	Coverage Δ
ixmp/backend/base.py	`98.65% <100.00%> (ø)`
ixmp/backend/io.py	`98.55% <100.00%> (+0.11%)`	⬆️
ixmp/cli.py	`98.67% <100.00%> (+<0.01%)`	⬆️
ixmp/core.py	`95.80% <100.00%> (-0.01%)`	⬇️
ixmp/tests/test_cli.py	`100.00% <100.00%> (ø)`
ixmp/reporting/key.py	`100.00% <0.00%> (ø)`
ixmp/reporting/quantity.py	`95.00% <0.00%> (ø)`
ixmp/tests/reporting/__init__.py	`100.00% <0.00%> (ø)`
... and 6 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ea17bab...96528d8. Read the comment docs.

…-rowlimit

zikolach

I left some comments

ixmp/backend/io.py

zikolach · 2020-04-16T07:26:34Z

ixmp/backend/io.py

+                if i > 1:
+                    suffix = '({})'.format(i)
+                else:
+                    suffix = ''


Wouldn't it better to always add numeric prefix if number of elements is more than 1 sheet?

Now, the sheets will be saved as foo, foo(1), foo(2), ... . I think it's better to see the sheets starting with the name of the items (in case the sheet name is not fully expanded).

ixmp/backend/io.py

ixmp/cli.py

khaeru · 2020-04-17T15:35:25Z

In addition to the points raised by @zikolach, the documentation (current version shown here: https://message.iiasa.ac.at/projects/ixmp/en/master/file-io.html#scenario-model-data) needs to be adjusted to describe the new format.

behnam-zakeri · 2020-04-17T16:31:32Z

In addition to the points ... needs to be adjusted to describe the new format.

Thanks @khaeru, I extended the doc to reflect this. I also noticed a redundant sentence and deleted that.

khaeru

Excellent @behnam-zakeri, thank you!

I used the Python built-in functions range(), zip(), and enumerate(), plus the technique of defining a function-within-a-function, to streamline the code a little, but your improvements were already very tidy, and the addition of tests is a solid example.

Will merge once tests pass.

behnam-zakeri · 2020-04-21T15:01:22Z

Thanks @zikolach for useful comments and @khaeru for very nice improvements of the code.

behnam-zakeri added 4 commits April 10, 2020 11:49

improve s_write_excel for writing data beyond maximum allowed rows

bfc87dc

improving s_read_excel to read data from multiple sheets for one item

6c866cc

release notes updated

4df8d9b

making max row integer

cc45562

behnam-zakeri added 6 commits April 10, 2020 15:28

extended cli and test_cli to test max row limit

d3bb359

extended core.py for input argument max_row for to_excel()

b967e08

corrected tests for io

6806892

corrected tests for io

469641e

Merge remote-tracking branch 'origin/excel-io-rowlimit' into excel-io…

c589858

…-rowlimit

reducing max_row in tests

bce03c4

behnam-zakeri force-pushed the excel-io-rowlimit branch from 2b1be35 to bce03c4 Compare April 15, 2020 08:16

behnam-zakeri added 2 commits April 15, 2020 12:41

io improved for a default max row and checking users input for that

9f276ab

reducing number of rows in tests to cover sets

82c9ff5

behnam-zakeri force-pushed the excel-io-rowlimit branch from ec0c31a to 82c9ff5 Compare April 15, 2020 13:52

behnam-zakeri requested review from khaeru and zikolach April 15, 2020 15:41

correcting append of data

a27804f

zikolach reviewed Apr 16, 2020

View reviewed changes

khaeru assigned behnam-zakeri Apr 17, 2020

khaeru linked an issue Apr 17, 2020 that may be closed by this pull request

Handle items with >10⁶ elements in Excel data I/O #292

Closed

doc updated

4cbf3d0

behnam-zakeri and others added 6 commits April 17, 2020 18:32

io and cli updated to address Nikolay's comments

6df61d1

small change in text

80a55db

extra if clause removed

5d774d4

Add iiasa#309 to release notes

f255217

Expand docstring and docs for Scenario.to_excel(..., max_row=)

e2de768

Simplify handling of max_row argument in io.s_write_excel()

9db66b4

khaeru added 3 commits April 21, 2020 14:54

Use a convenience function to read items from multiple Excel sheets

aaac697

Use Python built-ins to simplify io.s_write_excel()

0a06f59

Use Path.with_name() in test_excel_io()

96528d8

khaeru approved these changes Apr 21, 2020

View reviewed changes

zikolach approved these changes Apr 21, 2020

View reviewed changes

khaeru merged commit 56355b8 into iiasa:master Apr 21, 2020

khaeru mentioned this pull request Sep 14, 2020

Add Scenario.to_csv() method #300

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend Excel write/read to handle large data beyond the row limits #309

Extend Excel write/read to handle large data beyond the row limits #309

behnam-zakeri commented Apr 10, 2020 •

edited

Loading

codecov bot commented Apr 10, 2020 •

edited

Loading

zikolach left a comment

zikolach Apr 16, 2020

behnam-zakeri Apr 17, 2020

khaeru commented Apr 17, 2020

behnam-zakeri commented Apr 17, 2020

khaeru left a comment

behnam-zakeri commented Apr 21, 2020

Extend Excel write/read to handle large data beyond the row limits #309

Extend Excel write/read to handle large data beyond the row limits #309

Conversation

behnam-zakeri commented Apr 10, 2020 • edited Loading

How to review

PR checklist

codecov bot commented Apr 10, 2020 • edited Loading

Codecov Report

zikolach left a comment

Choose a reason for hiding this comment

zikolach Apr 16, 2020

Choose a reason for hiding this comment

behnam-zakeri Apr 17, 2020

Choose a reason for hiding this comment

khaeru commented Apr 17, 2020

behnam-zakeri commented Apr 17, 2020

khaeru left a comment

Choose a reason for hiding this comment

behnam-zakeri commented Apr 21, 2020

behnam-zakeri commented Apr 10, 2020 •

edited

Loading

codecov bot commented Apr 10, 2020 •

edited

Loading