refactor: refactor CLI part of excel2xml (DEV-2190) #384

jnussbaum · 2023-05-25T09:06:47Z

linear · 2023-05-25T09:06:49Z

BalduinLandolt

Looks good!

One small stylistic thin: I like to be able to "read code from top to bottom" meaning that the public method should be art the beginning, and the private methods after that, ideally in the order they are used in the public method. I definitely wouldn't make this a "hard rule", and there are many edge cases where this cannot be done reasonably anyways. But as a general rule of thumb, how do you feel about this?

Additionally, it came to my mind that the changes you introduced here, should have a big impact on unit testing: Before you could only do an "end to end" test of the huge function; now you have separate units that allow testing much more fine grained and systematically (including all happy and unhappy paths). I don't think this has to be in scope for the current task at hand, but it's definitely something worth considering.

BalduinLandolt · 2023-05-25T11:32:45Z

src/dsp_tools/excel2xml.py

+        True if everything went well, False otherwise
+    """
+    # read and prepare the input file
+    success = True


why is this needed? it's not actually modified

it's modified on (new) line 2244

Ah... github stopped displaying me the file at line 2243 :)
In that case I would suggest moving it down there, or even change the logic to something like this (pseudo code):

if len(warnings) > 0: print("Finished with warnings") return False print("finished...") return True

or if you want to keep the flow you currently have

with warnings ... as w: write(file) success: bool = len(w) == 0 print(finished...) return success

jnussbaum · 2023-05-25T13:51:04Z

I like to be able to "read code from top to bottom" meaning that the public method should be art the beginning, and the private methods after that

Interesting, I had been thinking about this point, too. But I had always thought that the ideal would be the other way round: Define called functions above their caller. This is already the case in many DSP-TOOLS modules, and perhaps that makes sense for Python, because in "bare" Python code outside a main() function, a function cannot be called if it is defined further down.
It's also interesting what Bing's AI chat has to contribute to this question:

I'm especially struck by the last sentence: "Ideally, your modules should be small/short enough such that the ordering of the functions doesn’t really matter." Aouch, here I have some homework to do...

BalduinLandolt · 2023-05-25T15:59:21Z

I like to be able to "read code from top to bottom" meaning that the public method should be art the beginning, and the private methods after that

Interesting, I had been thinking about this point, too. But I had always thought that the ideal would be the other way round: Define called functions above their caller. This is already the case in many DSP-TOOLS modules, and perhaps that makes sense for Python, because in "bare" Python code outside a main() function, a function cannot be called if it is defined further down. It's also interesting what Bing's AI chat has to contribute to this question: I'm especially struck by the last sentence: "Ideally, your modules should be small/short enough such that the ordering of the functions doesn’t really matter." Aouch, here I have some homework to do...

Technically, it's not just outside a main() function, it has to be outside of any function at all. And this can never happen for DSP-TOOLS, I think.
But I agree with the AI that it's a matter of taste, so if you prefer it that way, sure.

Regarding the last point, I mostly agree with the AI: modules should definitely be very short! (to me, 200-300 lines max seems sane in many cases, but it's hard to put a number on it. - This, by the way, is also why I'm not a fan of over-documenting code... this is a "Ballance-Akt" to document enough but not too much, otherwise code gets harder to read again.)
But I'm also an advocate of many, short functions, which in turn makes ordering more important. So even for 200 lines of code, if you have short functions, it's very helpful to have a understandable ordering.

jnussbaum added 5 commits May 24, 2023 17:08

edit

7016577

wip

668643f

continue

8328e90

edit

f2371d9

improve error messages

43cdd0a

jnussbaum self-assigned this May 25, 2023

jnussbaum requested a review from BalduinLandolt May 25, 2023 10:54

BalduinLandolt approved these changes May 25, 2023

View reviewed changes

jnussbaum merged commit cd9cbb7 into main May 25, 2023
11 checks passed

jnussbaum deleted the wip/dev-2190-refactor-excel2xml branch May 25, 2023 13:52

daschbot mentioned this pull request May 25, 2023

chore: release 2.3.2 #383

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: refactor CLI part of excel2xml (DEV-2190) #384

refactor: refactor CLI part of excel2xml (DEV-2190) #384

jnussbaum commented May 25, 2023

linear bot commented May 25, 2023

BalduinLandolt left a comment

BalduinLandolt May 25, 2023

jnussbaum May 25, 2023

BalduinLandolt May 25, 2023

jnussbaum commented May 25, 2023

BalduinLandolt commented May 25, 2023

refactor: refactor CLI part of excel2xml (DEV-2190) #384

refactor: refactor CLI part of excel2xml (DEV-2190) #384

Conversation

jnussbaum commented May 25, 2023

linear bot commented May 25, 2023

BalduinLandolt left a comment

Choose a reason for hiding this comment

BalduinLandolt May 25, 2023

Choose a reason for hiding this comment

jnussbaum May 25, 2023

Choose a reason for hiding this comment

BalduinLandolt May 25, 2023

Choose a reason for hiding this comment

jnussbaum commented May 25, 2023

BalduinLandolt commented May 25, 2023