-
Notifications
You must be signed in to change notification settings - Fork 40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Parso library update #10
Comments
Just to give some pointers in case someone wants to help. Here you may find the total diff I made on the original 1.2.1 parso library to support splitting: https://www.diffchecker.com/V6tGyqNC It looks like the library has been refactored quite a bit in 2.0 so correspondingly touch points need to be found and updated. |
@saurfang could you please give us some guidance how to update saurfang/spark-sas7bdat to latest parso library? I see https://github.com/saurfang/spark-sas7bdat/blob/master/src/main/java/com/ggasoftware/parso/SasFileParser.java We will try to get this resolved. There are a few bugs we ran into today and both of them seems to be could be resolved by getting to latest praso library. |
@Tagar thank you for taking the interests. Like I outlined in my comment above, the whole reason that we have our own copy of To see #19 to be successful, I would like to see a minimal test data that replicates the failure on 1.2.1. Only then will we able to confidently say that any changes we lay on top of it to upgrade to 2.0 actually solve the problems. That is not to say #19 is incorrect but I just do not have the capacity to perform detailed code review and has to rely on unit tests to convince myself that the change is actually doing and fixing what it claims. |
Thank you @saurfang Can we ask why that diff in |
@saurfang that diff is already in upstream Parso library.
Please let us know if that's okay with you to create a new PR for this work ? |
Yes please! I forked the implementation originally because their library development went dormant. Now that they are back on Github I am more than happy to push/pull changes to the upstream library. |
@Tagar as you are working on the PR to update parso to the latest version, could you please update link to parso in README.md in the Related Work section? It should be https://github.com/epam/parso instead of scitouch site. @saurfang hope you are ok with the link change. |
@saurfang our manager asked to bounce our contribution at legal department first as it's first time we contribute back to an open-source project. I don't expect this will be an issue, but will just take some time. We delivered this change internally a week ago. Thanks. @printsev sure - will do. Thanks. |
With earlier version of parso, was getting error while reading compressed files. So I have updated parso 2.0.7 and done relevant changes, now able to read compressed file. :) @Tagar Seems like you have successfully implemented. Have you faced this issue? If yes, can you please share your idea/implementation. Please. @saurfang Any suggestion or inputs, would like to provide on this. |
After doing some analysis on #9 with updated parso library 2.0.7. I have found that issue is with below method.
When it try to access bytes from previous page, it won't able read subheader. @saurfang Can you please elabarote this method, on what basis we are calculating partial page length. Any supporting blog/document. |
If I remember correctly, this is based on the reverse engineered http://www2.uaem.mx/r-mirror/web/packages/sas7bdat/vignettes/sas7bdat.pdf @spatil6 |
@spatil6 |
Thanks all for the discussion and contribution. Kudos to @mulya to pull everything through. |
FYI - the updated Parso library is now available:
https://github.com/epam/parso/releases
The text was updated successfully, but these errors were encountered: