Fix for string tests failed due to bad str_overload #94

kozlov-alexey · 2019-07-17T13:02:39Z

No description provided.

fschlimb · 2019-07-17T13:13:54Z

This looks good.
Please provide a sentence or two per loop in a comment telling what it does/how it works.

shssf · 2019-07-17T14:54:06Z

hpat/str_ext.py

+# for strip methods define overloads to call Numba implementation
+# forwarding arg1 as a necessary 'chars' argument
+for method in str2str_1arg:
+    func_text = "def str_overload(in_str, arg1):\n"


Do you think it is necessary to implement several function via auto generated source?
I would vote to make them as regular functions with required decorators.
@fschlimb What do you think about it? I mean, it might be better to stay away from such technique of programming.

I usually try to avoid code duplication as much as possible. There will be cases where such code generation is required. So I tend to think it is not too bad.
Even though I am not sure I would now to do this with only decorators, such simpler cases can be handled with higher lever functions which call @overload. Here is a small POC:

from numba import njit, objmode, types from numba.extending import overload # dummy default python funcs def x(a): return a+a def y(a): return a*a def z(a): return a**a # a function generically overloading a given function with one arg def ovl(func): @overload(func) def _f(a): def _ff(a): with objmode(r='int32'): r = func(a) print(':) -> ', func(a)) return r return _ff # use the above overloader to overload x and y ovl(x) ovl(y) # we can now use them in a jitted function @njit def z(): return x(4) + y(4) r = z() print('24 ?=', r)

I am not sure which one I like better. I am torn.

@fschlimb I agree, code duplication is not good approach but autogeneration is also not good (I would say it is worse than code duplication). I would vote for any implementation that avoid code generation.

fine with me as long as we are not duplicating code.
Just out of curiosity: why do you think code generation is bad? HPAT is a code-generator, numba is, and even python is. So this is not really unusual in this context.

@fschlimb Because it non debuggable, non performance assessment, and etc. It only looks like a good approach but if something goes wrong it will be a real headache.

@fschlimb We have one more option - generate the code at build stage, but avoid doing this in runtime.

Frank, Sergey, thank you both for provided input!
I will try to re-write it using provided example, as to me it looks better than calling exec (both due to debugging and security considerations).

[BUG] Fixed problems with generation parquet files (IntelPython#93)

shssf · 2019-07-19T13:52:29Z

@kozlov-alexey Do you plan to make changes in this PR?

kozlov-alexey · 2019-07-19T13:54:04Z

@shssf Yes, I'm about to push it)

Fixing issue with named series handling in fillna (IntelPython#95)

kozlov-alexey · 2019-07-19T15:46:47Z

As per suggestion above I've rewritten it using decorators and getitem.
Two issues are observed:

Test now runs a bit slower, 4.6-4.7 seconds vs 4.2-4.3 seconds before (not sure why it takes more time), that needs to be investigated probably.
Wasn't able to verify on master due to import error from this line in test_strings.py:
"from .gen_test_data import ParquetGenerator" (even after installing pyspark). I was able to verify only based on the commit prior to [BUG] Fixed problems with generation parquet files #93.

shssf · 2019-07-19T22:20:40Z

Wasn't able to verify on master due to import error from this line in test_strings.py:
"from .gen_test_data import ParquetGenerator" (even after installing pyspark). I was able to verify only based on the commit prior to [BUG] Fixed problems with generation parquet files #93.

PR #96 might help

Fix for string tests failed due to bad str_overload

kozlov-alexey force-pushed the feature/fix_strip_overload branch from 0501371 to 7c6ea14 Compare July 17, 2019 14:07

shssf reviewed Jul 17, 2019

View reviewed changes

shssf approved these changes Jul 17, 2019

View reviewed changes

Merge pull request #1 from IntelPython/master

793d66e

[BUG] Fixed problems with generation parquet files (IntelPython#93)

kozlov-alexey and others added 3 commits July 19, 2019 18:09

Merge pull request #2 from IntelPython/master

20b9b6f

Fixing issue with named series handling in fillna (IntelPython#95)

Fix for string tests failed due to bad str_overload

cd8f253

Applying comments, use decorators for str_overload

fa5114f

kozlov-alexey force-pushed the feature/fix_strip_overload branch from 9f34326 to fa5114f Compare July 19, 2019 15:33

Merge branch 'master' into feature/fix_strip_overload

07c1c0a

fschlimb approved these changes Jul 22, 2019

View reviewed changes

fschlimb merged commit db2736f into IntelPython:master Jul 22, 2019

kozlov-alexey deleted the feature/fix_strip_overload branch August 8, 2019 10:36

kozlov-alexey added a commit to kozlov-alexey/sdc that referenced this pull request Oct 4, 2019

Fix for string tests failed due to bad str_overload (IntelPython#94)

bf901ba

Fix for string tests failed due to bad str_overload

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for string tests failed due to bad str_overload #94

Fix for string tests failed due to bad str_overload #94

kozlov-alexey commented Jul 17, 2019

fschlimb commented Jul 17, 2019 •

edited

Loading

shssf Jul 17, 2019

fschlimb Jul 17, 2019

shssf Jul 17, 2019 •

edited

Loading

fschlimb Jul 17, 2019

shssf Jul 17, 2019

shssf Jul 17, 2019

kozlov-alexey Jul 17, 2019

shssf commented Jul 19, 2019

kozlov-alexey commented Jul 19, 2019

kozlov-alexey commented Jul 19, 2019 •

edited

Loading

shssf commented Jul 19, 2019

Fix for string tests failed due to bad str_overload #94

Fix for string tests failed due to bad str_overload #94

Conversation

kozlov-alexey commented Jul 17, 2019

fschlimb commented Jul 17, 2019 • edited Loading

shssf Jul 17, 2019

Choose a reason for hiding this comment

fschlimb Jul 17, 2019

Choose a reason for hiding this comment

shssf Jul 17, 2019 • edited Loading

Choose a reason for hiding this comment

fschlimb Jul 17, 2019

Choose a reason for hiding this comment

shssf Jul 17, 2019

Choose a reason for hiding this comment

shssf Jul 17, 2019

Choose a reason for hiding this comment

kozlov-alexey Jul 17, 2019

Choose a reason for hiding this comment

shssf commented Jul 19, 2019

kozlov-alexey commented Jul 19, 2019

kozlov-alexey commented Jul 19, 2019 • edited Loading

shssf commented Jul 19, 2019

fschlimb commented Jul 17, 2019 •

edited

Loading

shssf Jul 17, 2019 •

edited

Loading

kozlov-alexey commented Jul 19, 2019 •

edited

Loading