Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support UTF-8 charset in ShapefileReader #192

Merged
merged 1 commit into from Feb 20, 2018

Conversation

@jiayuasu
Copy link
Member

commented Feb 20, 2018

This PR contains the bugfix to solve the unreadable character in shapefile reader. See Issue #190

If you are working with Arabic, Chinese or Korean character sets in ESRI shapefile, you need to force GeoSpark shapefile reader to use UTF-8 charset.

Add the following line before GeoSpark ShapefileReader:

System.setProperty("geospark.global.charset","utf8")

@jiayuasu jiayuasu merged commit 3c6a3a1 into DataSystemsLab:master Feb 20, 2018

1 check was pending

continuous-integration/travis-ci/pr The Travis CI build is in progress
Details
@pedromorfeu

This comment has been minimized.

Copy link

commented Jun 12, 2019

@jiayuasu, does it need to be set in each executor? I think it does. I had to add:

--conf "spark.executor.extraJavaOptions=-Dgeospark.global.charset=utf8"
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
2 participants
You can’t perform that action at this time.