Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Serialization Issue with org.apache.lucene.facet.FacetsConfig #234

Open
yeikel opened this issue Feb 25, 2020 · 4 comments
Open

Serialization Issue with org.apache.lucene.facet.FacetsConfig #234

yeikel opened this issue Feb 25, 2020 · 4 comments

Comments

@yeikel
Copy link
Contributor

yeikel commented Feb 25, 2020

I am facing the following serialization issue :

Job aborted due to stage failure: Task 144.0 in stage 25.0 (TID 2122) had a not serializable result: org.apache.lucene.facet.FacetsConfig
Serialization stack:
	- object not serializable (class: org.apache.lucene.facet.FacetsConfig, value: org.apache.lucene.facet.FacetsConfig@53a75ca4)
	- field (class: org.zouzias.spark.lucenerdd.partition.LuceneRDDPartition, name: FacetsConfig, type: class org.apache.lucene.facet.FacetsConfig)
	- object (class org.zouzias.spark.lucenerdd.partition.LuceneRDDPartition, org.zouzias.spark.lucenerdd.partition.LuceneRDDPartition@30e83579)

It is hard to replicate and I am not sure what is triggering it as index works just fine sometimes.

Do you have any idea? @zouzias

Example of jobs. Some failed , some succeed(same code) :

image

@zouzias
Copy link
Owner

zouzias commented Feb 26, 2020

Hi,

do you use faceted search at all? I would like to remove the faceted search feature since DataFrames with parquet files as a backend are superior.

See: #171

@yeikel
Copy link
Contributor Author

yeikel commented Feb 26, 2020

I am not using that feature but indexing seems to be calling the FacetsConfig

@zouzias
Copy link
Owner

zouzias commented Feb 26, 2020

This looks very suspicious. Can you share some code to help you reproduce the error?

	- object (class org.zouzias.spark.lucenerdd.partition.LuceneRDDPartition, org.zouzias.spark.lucenerdd.partition.LuceneRDDPartition@30e83579)

It seems that the LuceneRDDPartition object is being serialized which it should never happen. Are you using the cartesianlinker method?

@yeikel
Copy link
Contributor Author

yeikel commented Feb 27, 2020

Yes , I am using the cartesianlinker

I am not sure what triggers it , but I will update the issue if I can find a repeatable sample

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants