Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

JupyterHub setup #3

Open
emiliom opened this issue Oct 24, 2017 · 15 comments
Open

JupyterHub setup #3

emiliom opened this issue Oct 24, 2017 · 15 comments

Comments

@emiliom
Copy link
Member

emiliom commented Oct 24, 2017

Dumping some JupyterHub setup info here, for convenient access. For now, mainly from emails from Don:

Accessing our JupyterHub

Other setup info

I got the authentication to work with github, not sure if I should give other option, considering if we want to encourage people to possibly contribute, they should have a github account anyways, but it’s your call. Here are your options: https://github.com/jupyterhub/oauthenticator

I am right now working on trying to get a person environment setup after they login, this gets created and setup automatically from a Docker image. :) (note from Emilio: I think this is done?!)

I was thinking of this setup (Emilio said yes to Don's questions):

  • /home/user/tutorial_contents : this is where the github repo reside, We should instruct users not to modify this too much in case of new changes, they might clash when pulling again. The user can create folder and upload their stuff to the home directory if they like. Do you think 10GB each user is enough or should I add more space for each user?
  • To upload notebooks, data, etc, just simply clone repo, put stuff in appropriate folders and then push. Then user can simply pull within their jupyter notebook in the cloud. Another question is how much RAM will a user need. I don’t think there will be any gigantic dataset, so maybe 2GB ram for each user is enough?

After authentication, "check whether you have tutorial_contents folder and have the odm2client conda environment".

@emiliom
Copy link
Member Author

emiliom commented Oct 24, 2017

Note to self, from Don: "You were way to quick to register into the jupyterhub, it was still in process when you logged in". So, I may need to redo things? Maybe that's why my user name is "jovyan" and not "emiliom"?

@emiliom
Copy link
Member Author

emiliom commented Oct 24, 2017

Pasting these TO-DO's from @lsetiawan's email, so I don't have to track it via email 😜 :

  • "I will need to add people to the whitelist, so I’ll need the core group’s github usernames"
  • "could you please give me a total count of the people that will have access to the jupyterhub, so I can figure out how many instances I need?"
  • setting up jupyterhub.bigcz.org subdomain (via Anthony/Stroud)

@ocefpaf
Copy link
Member

ocefpaf commented Oct 24, 2017

jovyan is the default user on all Jupyter docker images

Jo·vy·an
/ˈjōvēən/
noun – an inhabitant of Jupyter

@emiliom
Copy link
Member Author

emiliom commented Oct 24, 2017

jovyan is the default user on all Jupyter docker images

Thanks. I figured as much.

@lsetiawan
Copy link
Member

lsetiawan commented Oct 25, 2017 via email

@emiliom
Copy link
Member Author

emiliom commented Oct 25, 2017

Yes, Jovyan will be everyone's username

Ok. But they're still different users, right? It's not that everyone will share one account and /home file system, and screw each other up??

This why I need to know the exact number of people that will use it so I can allocate resources to scale up as more and more people register.

Like I said over email, we won't have the exact number until next week. 20-25 people is a reasonable estimate. But I assume there's no rush to allocate resources right away, or is there an actual reason to figure that out ASAP?

It's a better setup and cheaper than the geohackweek setup with a very large machine that cost a lot and not very scalable, and much harder to configure. Amanda found a better way and told me to try it :) And it worked!

That all sounds great!

@ocefpaf
Copy link
Member

ocefpaf commented Oct 25, 2017

Ok. But they're still different users, right? It's not that everyone will share one account and /home file system, and screw each other up??

I am not sure how user spaces work with jupyter but I guess that the "docker" user is always jovyan and only jovyan. I don't think the docker insteances are the same though and users files should not crash as far as I know.

@emiliom
Copy link
Member Author

emiliom commented Oct 25, 2017

I am not sure how user spaces work with jupyter but I guess that the "docker" user is always jovyan and only jovyan. I don't think the docker insteances are the same though and users files should not crash as far as I know.

That's what I'm guessing too. Just want to be 100% sure.

@lsetiawan
Copy link
Member

Ok. But they're still different users, right? It's not that everyone will share one account and /home file system, and screw each other up??

Yes, each person gets a different persistent user space of 10GB within the machine itself. Kubernetes is managing all of that. Not sure exactly the mechanic how that works. But in a point of docker it's basically creating a volume for each user, the -v /home/ocefpaf:/home/jovyan something like that I think.

Like I said over email, we won't have the exact number until next week. 20-25 people is a reasonable estimate. But I assume there's no rush to allocate resources right away, or is there an actual reason to figure that out ASAP?

No worries, based on your email I thought you wanted this ASAP especially the "self-registration", I just want the approx right now, the core people at least so I can adjust resources for that so none have trouble logging in. Thanks!

@emiliom
Copy link
Member Author

emiliom commented Oct 26, 2017

Thanks. I sent you the github profiles of the core-core people last night. But it's not a problem at all if you can't get to it this week. Having @ocefpaf in the system in addition to me is already a big help.

@emiliom
Copy link
Member Author

emiliom commented Nov 8, 2017

@lsetiawan FYI, the JupyterHub server went down around 10:20pm. The AWS instance is unreachable (" can’t find the server at a34cfd457bf5611e7b16c022cb2dbee7-216768378.us-west-2.elb.amazonaws.com"). It may come back on its own (I hope/assume). But I wanted to note that it went down, so you can investigate why and we can have more confidence that it won't happen again. Thanks.

@emiliom
Copy link
Member Author

emiliom commented Nov 8, 2017

Just for the record: It's 1am and the JupyterHub AWS instance is back. I don't know when exactly it came back, though.

@lsetiawan
Copy link
Member

lsetiawan commented Nov 8, 2017 via email

@emiliom
Copy link
Member Author

emiliom commented Nov 8, 2017

It went down, because I took it down. Was working on it last night.

Ah, that's a relief.

Next time you plan to bring it down, please ping me to let me know. Remember I often work in the evening, and the workshop is getting very close. If it happens during the day, we should let others know as well.

@lsetiawan
Copy link
Member

Jupyterhub is currently down again. Trying to test the load. Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants