-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Demography Workshop Datahub Request #5643
Comments
@wrathofquan Thanks, Responding to the time sensitive request first - Yes, students can use the workshop hub to do their work. We can enable shared-readwrite directories so that they can store their datasets. Is it possible to create a dummy bcourses site, add all the participants of the data challenge to that site and share the bcourses id? You might need to assign Teacher/TA role for the folks who will have read/write access and student role for the folks who need read access to the shared drive. We can enable access to the drive using the shared bcourses id in the workshop hub. |
Thank you @balajialg. bcourses id: 1534506 |
@wrathofquan Hi Josh, changes are merged to staging hub. and should be deployed within the next 60 minutes. Can you ask your students added to the bcourses site to check whether they can see a) shared-readwrite directory and b) RAM increase to 4 GB in the staging hub? |
@wrathofquan I just updated the configs in Datahub via this PR. Changes were merged to prod 10 mins ago. You should be able to test this in https://datahub.berkeley.edu/ in an hour |
actually, it's live on datahub now!
…On Thu, Mar 28, 2024 at 11:32 AM Balaji Alwar ***@***.***> wrote:
@wrathofquan <https://github.com/wrathofquan> I just updated the configs
in Datahub via this PR
<#5646>. Changes are
merged to prod 10 mins ago. You should be able to test this in
https://datahub.berkeley.edu/ in an hour
—
Reply to this email directly, view it on GitHub
<#5643 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAMIHLEK5NP3N2UNQMZWXFDY2RO47AVCNFSM6AAAAABFLOB6H6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDAMRVHA3DENBUGY>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***>
|
@balajialg Would it be possible to increase RAM to 8GB? We received the assignment datasets today and they are larger than expected. Few users have reported their sessions crashing when reading in data and I can confirm that reading in a 300MB csv eats almost half of available RAM Thank you for considering! |
@wrathofquan Yes, we should be able to increase the RAM to 8 GB. Here is the PR Having said that, I also want to highlight that an increase in RAM is correlated with an increase in cloud costs at our end. So, any further request to increase RAM would be something @shaneknapp and I might need to review in future. |
@wrathofquan I have increased the Workshop Hub RAM to 4 GB. Please inform me if you encounter any discrepancies. When do you recommend reverting the RAM increase? I will schedule it to be reduced accordingly. |
Thank you @balajialg! Everything looks great. Can we drop it back down on Monday June 10? |
Sounds good, thanks! |
Summary
We have two workshop events coming up in the Demography/Population sciences department and are looking at using the workshop hub or potentially another dedicated hub.
The first event is a workshop/hacakthon type event that actually will run for the entire summer and involves Berkeley faculty, graduate students and post docs. It's a data challenge called the predicting fertility data challenge. It begins April 1 and expect about 5-6 users training ML models in python and R. Since this is a longer running event (continues through summer), I'm not sure if the workshop hub is suitable but wanted to check in to see if you have suggestions or alternatives.
This year the Demography department will again host a week-long workshop, June 3-7 on statistical methods in June with researchers visiting from all over the world. We used workshop hub last year with great success and hope to use it again! Compute needs will be same as last year - 4GB of RAM per user. We expect 20-40 users.
User Stories
Workshop instructors and faculty can have all of their instructional materials in a Datahub that is consistent across users.
Attendees won't have to worry about managing their own compute environments.
Acceptance criteria
For the data challenge event, users with a calnet can navigate to datahub and use python/r to train ml models. Users will be using same datasets so a
shared-read-write
directory would be required.For the June workshop event, users can navigate to something like: workshop.datahub.berkeley.edu, authenticate without calnet (and potentially manage their own credentials), access the workshop files in RStudio or native R kernel.
Important information
Data challenge
shared-read-write
to work on shared datasets that exceed storage capacity in github.Demography workshop
Tasks to complete
The text was updated successfully, but these errors were encountered: