New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-5380][GraphX] Solve an ArrayIndexOutOfBoundsException when build graph with a file format error #4176
Conversation
…if the format of the source file is wrong There will be an ArrayIndexOutOfBoundsException if the format of the source file is wrong
Can one of the admins verify this patch? |
The argument I can see against it is simply that you want to fail fast on bad input, not sort of continue. If all of the file is like this you'll just log a zillion warnings instead of just stopping. How about changing the warning to throw a standard exception like |
I wonder if stopping the process is the best solution. An other idea is that illegal entries are silently re-directed into a file or something. |
Since this was a very simple change, I went ahead and merged it as my first test-drive of the commit bit. Looks like it worked! |
I've also merged this into |
…ld graph with a file format error When I build a graph with a file format error, there will be an ArrayIndexOutOfBoundsException Author: Leolh <leosandylh@gmail.com> Closes #4176 from Leolh/patch-1 and squashes the following commits: 94f6d22 [Leolh] Update GraphLoader.scala 23767f1 [Leolh] [SPARK-3650][GraphX] There will be an ArrayIndexOutOfBoundsException if the format of the source file is wrong
Ah sounds like I should have done more back-porting. I was conservative and only put these several small fixes in master. Go ahead and I'll recalibrate a little accordingly. |
@srowen From now until we cut a 1.3.0 preview, I'd merge all bug fixes into at least master and |
When I build a graph with a file format error, there will be an ArrayIndexOutOfBoundsException