Read xlsx #94

tralsos · 2020-12-21T13:01:50Z

Changes pd.read_excel to use engine openpyxl since newer versions of xlrd does not support '.xlsx' files. Closes issue #88 . Requirements are updated to use pandas>1.1 and xlrd>2.0, but shold also work with previous requirements if we don't want to change these yet.

tralsos · 2020-12-21T13:25:03Z

@jcrivenaes should we change the requirements as suggested here, or are there reasons to keep the previous requirements ?

berland · 2020-12-21T16:52:35Z

We should not enforce versions that are not yet in komodo stable. Pandas 1.1 is on its way in now, but xlrd might take another month or so.

berland · 2020-12-21T17:08:35Z

src/fmu/tools/sensitivities/_designsummary.py

        )
-
+    print(dgn.loc[0]["SENSCASE"])


Is this a debugging statement?

berland · 2020-12-21T17:10:17Z

src/fmu/tools/sensitivities/_designsummary.py

-            "Design matrix filename should be on Excel or csv format"
-            " and end with .xlsx or .csv"
+            "Design matrix should be on Excel or csv format"
+            " and filename should end with .xlsx or .csv"


"should -> must" is perhaps more to the point (.XLSX would even fail here, but we can live with that.)

berland · 2020-12-21T17:11:53Z

src/fmu/tools/sensitivities/_designsummary.py

@@ -53,15 +53,17 @@ def summarize_design(filename, sheetname="DesignSheet01"):

    # Read design matrix and find realisation numbers for each sensitivity
    if filename.endswith(".xlsx"):
-        dgn = pd.read_excel(filename, sheetname)
+        dgn = pd.read_excel(filename, sheetname, engine="openpyxl")
+        dgn.dropna(axis=0, how="all", inplace=True)


Is this due to things like "coloured cells"? If so, maybe nice to add a comment about that (to explain why the line of code is there, otherwise it could disappear in later refactorings)

Have added this

berland · 2021-01-07T15:50:30Z

Test pass on combinations of xlrd 1.2.0/2.0.1, and pandas versions 0.25 up to 1.2.0. Except Pandas 1.1.1, but this particular version is not in use.

Trine Alsos and others added 5 commits December 21, 2020 13:40

changes to switch pd.read_excel to openpyxl engine

6bb725a

formating

d71ae62

changes to switch pd.read_excel to openpyxl engine

d08703a

formating

f3fb5a2

Merge branch 'master' into read_xlsx

f78706e

tralsos requested a review from berland December 21, 2020 13:23

berland reviewed Dec 21, 2020

View reviewed changes

src/fmu/tools/sensitivities/_designsummary.py Outdated

)

print(dgn.loc[0]["SENSCASE"])

Copy link

Collaborator

berland Dec 21, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this a debugging statement?

berland reviewed Dec 21, 2020

View reviewed changes

Trine Alsos added 3 commits December 22, 2020 07:44

Removed print statement. Improved commenting

b383f52

removed print

585deea

Changed requirements back again

c9b1ae8

tralsos mentioned this pull request Jan 4, 2021

Crash with pandas 1.2.0 #95

Closed

berland approved these changes Jan 7, 2021

View reviewed changes

berland merged commit 6fdf3fd into equinor:master Jan 7, 2021

berland mentioned this pull request Dec 2, 2021

Unpin xlrd equinor/semeio#381

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Read xlsx #94

Read xlsx #94

tralsos commented Dec 21, 2020

tralsos commented Dec 21, 2020

berland commented Dec 21, 2020

berland Dec 21, 2020

berland Dec 21, 2020

berland Dec 21, 2020

tralsos Dec 22, 2020

berland commented Jan 7, 2021

Read xlsx #94

Read xlsx #94

Conversation

tralsos commented Dec 21, 2020

tralsos commented Dec 21, 2020

berland commented Dec 21, 2020

berland Dec 21, 2020

Choose a reason for hiding this comment

berland Dec 21, 2020

Choose a reason for hiding this comment

berland Dec 21, 2020

Choose a reason for hiding this comment

tralsos Dec 22, 2020

Choose a reason for hiding this comment

berland commented Jan 7, 2021