Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

10 minutes to Pandas tutorial using Koala + Spark, incomplete #34

Closed
wants to merge 5 commits into from

Conversation

thunterdb
Copy link
Contributor

This is the start of the translation of the basic tutorial on pandas called 10 minutes to pandas, available here:

This tutorial is extensive, so I think it should be focus on the 'easy' sections that are already implemented, and add later the 'hard' sections as they get implemented.

@thunterdb
Copy link
Contributor Author

This notebook is currently too big, and does not even render on github. I am going to break it into multiple sections to help with contributions.

@garawalid
Copy link
Contributor

@thunterdb I'll join you !

@thunterdb
Copy link
Contributor Author

@garawalid please be my guest. Let us focus on the first part, up to and including 'Viewing the data'. I hope that this is small enough that it can be easily rendered in github and sphinx.

This was written before the refactoring in #90 , so there are a few things to watch for:

  • it should follow now the import best practices import databricks.koalas as ks
  • it does not need to refer to spark as prominently, most of the calls to the spark context are automated now

There will be probably some other comments so I suggest to start small first and build up from there. You should branch off from here and open a separate pull request.

Thanks a lot!

@dennyglee
Copy link
Contributor

Glad you're helping out @garawalid - glad to help with it as well. Please do not hesitate to ping me and I can work with your branch to update the tutorial. HTH!

@garawalid
Copy link
Contributor

@dennyglee
With pleasure! Just give me three days, I need to finish with data sanitization for isna and fillna and then we can start !

@dennyglee
Copy link
Contributor

Cool stuff - looking forward to it, eh?!

@shril
Copy link
Contributor

shril commented May 3, 2019

@thunterdb, @garawalid Can I also join this?

@garawalid
Copy link
Contributor

@shril
Sure. We are working in this PR #196. You can open a PR in this repo tutorial branch.
We will update the #196 later.

@HyukjinKwon
Copy link
Member

It's merged at d8c7f73

@HyukjinKwon HyukjinKwon closed this Oct 1, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants