Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CARBONDATA-2912] Support CSV table load csv data with spark2.2 #2691

Closed

Conversation

xubo245
Copy link
Contributor

@xubo245 xubo245 commented Sep 4, 2018

In branch-1.3, CSV table cann't load csv data with spark2.2 because: If using low commons-lang version, the default for the timestampFormat is yyyy-MM-dd'T'HH:mm:ss.SSSXXX which is an illegal argument and can not be recognized after upgrade spark from 2.1 to 2.2. It needs to be set when you are writing the dataframe out.
Carbon need upgrade commons-lang3 version

CSV table is "create table ... using csv options..."

Be sure to do all of the following checklist to help us incorporate
your contribution quickly and easily:

  • Any interfaces changed?
    No
  • Any backward compatibility impacted?
    No
  • Document update required?
    No
  • Testing done
    add some test case
  • For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.
    No

@xubo245 xubo245 changed the base branch from master to branch-1.3 September 4, 2018 08:47
@CarbonDataQA
Copy link

Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/8288/

@CarbonDataQA
Copy link

Build Failed with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/219/

@xuchuanyin
Copy link
Contributor

xuchuanyin commented Sep 4, 2018

@xubo245 What does 'CSV table' mean in the title?

Can you explain why the version upgrading for common-langs is needed? Does it improve something? (Just curious)

@xuchuanyin
Copy link
Contributor

xuchuanyin commented Sep 4, 2018

Carbon need upgrade commons-lang3 'vision' -- > 'version'

@xubo245
Copy link
Contributor Author

xubo245 commented Sep 5, 2018

@xuchuanyin CSV table is "create table ... using csv options...". In branch-1.3, CSV table cann't load csv data with spark2.2 because: If using low commons-lang version, the default for the timestampFormat is yyyy-MM-dd'T'HH:mm:ss.SSSXXX which is an illegal argument and can not be recognized after upgrade spark from 2.1 to 2.2. It needs to be set when you are writing the dataframe out.

@xubo245
Copy link
Contributor Author

xubo245 commented Sep 5, 2018

retest this please

@xuchuanyin
Copy link
Contributor

Oh, I didn't know this grammar before. Is CSV table a carbon table or spark table?

@xubo245
Copy link
Contributor Author

xubo245 commented Sep 5, 2018

spark,it support using parquet too

@CarbonDataQA
Copy link

Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/8313/

@CarbonDataQA
Copy link

Build Failed with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/243/

@CarbonDataQA
Copy link

Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.1/3/

@xubo245
Copy link
Contributor Author

xubo245 commented Sep 6, 2018

retest this please

@CarbonDataQA
Copy link

Build Failed with Spark 2.3.1, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.3/8340/

@CarbonDataQA
Copy link

Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.1/102/

@xubo245
Copy link
Contributor Author

xubo245 commented Sep 6, 2018

retest this please

@CarbonDataQA
Copy link

Build Failed with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/270/

@CarbonDataQA
Copy link

Build Failed with Spark 2.3.1, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.3/8343/

@CarbonDataQA
Copy link

Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.1/105/

@CarbonDataQA
Copy link

Build Success with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/273/

@xubo245
Copy link
Contributor Author

xubo245 commented Sep 6, 2018

@jackylk Please review it. master has updated, but branch-1.3 didn't fix it.

@@ -0,0 +1,4 @@
name,age
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can you reuse the current csv file, no need to add new one.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ok, done

    1.upgrade commons-lang3 version
    2.add some test case
@xubo245 xubo245 force-pushed the CARBONDATA-2912_twoInsert1.3.2 branch from ae4fa80 to 8445a94 Compare September 13, 2018 11:36
@CarbonDataQA
Copy link

Build Failed with Spark 2.3.1, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.3/8512/

@CarbonDataQA
Copy link

Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.1/273/

@CarbonDataQA
Copy link

Build Failed with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/442/

@xubo245
Copy link
Contributor Author

xubo245 commented Sep 15, 2018

retest this please

@CarbonDataQA
Copy link

Build Failed with Spark 2.3.1, Please check CI http://136.243.101.176:8080/job/carbondataprbuilder2.3/8536/

@CarbonDataQA
Copy link

Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.1/291/

@CarbonDataQA
Copy link

Build Failed with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/466/

@xubo245 xubo245 closed this Sep 15, 2018
@xubo245 xubo245 reopened this Sep 15, 2018
@xubo245
Copy link
Contributor Author

xubo245 commented Sep 15, 2018

retest this please

@CarbonDataQA
Copy link

Build Failed with Spark 2.3.1, Please check CI http://136.243.101.176:8080/job/carbondataprbuilder2.3/8539/

@CarbonDataQA
Copy link

Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.1/294/

@CarbonDataQA
Copy link

Build Failed with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/469/

@xubo245
Copy link
Contributor Author

xubo245 commented Sep 17, 2018

retest this please

@CarbonDataQA
Copy link

Build Failed with Spark 2.3.1, Please check CI http://136.243.101.176:8080/job/carbondataprbuilder2.3/8545/

@CarbonDataQA
Copy link

Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.1/300/

@CarbonDataQA
Copy link

Build Success with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/475/

@xubo245
Copy link
Contributor Author

xubo245 commented Sep 18, 2018

Branch-1.3 don't support spark2.3, so CI will not work. Please review it. @chenliang613 @xuchuanyin @jackylk

@jackylk
Copy link
Contributor

jackylk commented Oct 9, 2018

LGTM

asfgit pushed a commit that referenced this pull request Oct 9, 2018
    1.upgrade commons-lang3 version
    2.add some test case

This closes #2691
@xubo245 xubo245 closed this Oct 9, 2018
@xubo245
Copy link
Contributor Author

xubo245 commented Oct 11, 2018

merged to branch-1.3

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

5 participants