-
Notifications
You must be signed in to change notification settings - Fork 187
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
pyjlwrap support for Pickle Serialization #863
Comments
In other words, you want to serialize Julia objects (wrapped in Python objects) via Pickle. I guess we could do this by embedding the Julia serialization format (via the Serialization stdlib) in pickle? |
Exactly @stevengj . How we can accomplish this? Could you provide some guidance, please? |
I think it involves overloading |
Or rather, we probably want the lower-level |
Great, @stevengj if we can overcome this, would be a huge step for the Julia community and would be glad to publish an article showing this new awesome feature! |
PyCall works nicely for many use cases between Python and Julia. In particular, there is one that could be improved and very important for Data Scientist community. For example, I tried to use it for PySpark library and works very well for the basic use case. But, if the user needs to create a UDF (User Defined Functions), the user will have trouble to serialize the functions.
The UDFs, in this case, would help to many DSs reuse Julia code and call spark to do the heavy work. Have this enabled, would improve the usage of Julia in different scenarios.
To solve the current issues with UDF, PyObject needs to be serializable with Pickle. I don't have much idea how to solve this, but I have a simple use case that if we fix would improve towards this functionality:
Example:
Error:
Reference to UDF in Python: https://docs.databricks.com/spark/latest/spark-sql/udf-python.html
The text was updated successfully, but these errors were encountered: