Skip to content

UCSB-CS-Using-GitHub-In-Courses/github-acad-scripts

Repository files navigation

github-acad-scripts

This repo containts scripts for github academic workflow, developed at UC Santa Barbara. Team Leader: Phill Conrad. Further Acknowledgements and list of authors/contributors at the bottom of this README.

Overview

The main workflow that is currently supported is:

  • For each assignment, for each student (or pair, or team): ** Set up a private repo where that student (or pair, or team) has push/pull access to the repo, as does the instructional staff (Instructor, TAs, Graders), but no other student (or pair, or team) in the course does. The repo can be pre-populated with "starting point" materials provided by the instructor.

The repos can be set up "just in time"---you don't have to set them all up at the beginning of the course.

Getting Started

(1) Before anything else, create an organization for your course.

You do that by visiting:

https://github.com/edu

and clicking on "I'm a teacher or student organization".

Indicate how many students you expect to have, and how many repositories each student or pair will need (i.e. how many assignments you expect to have for each student or pair.)

Note: John Britton of github strongly encourages "one repo per project" rather than "one repo per course (with subdirectories in that repo for assignments.)". His advice: Don't think of repos as a "scarce resource" that needs to be conserved---github is happy to contribute as many repos as you need for academic use.

So my suggestion is to overestimate by 10 to 20 percent, and then if/when you exhaust 80% of your private repos, ask for more.

  1. You will need some source of information that lists the github ids of your students, and maps those to real names.

The scripts can work either from a .csv file, or from a Google Docs Spreadsheet.

The columns can be in any order, but the first row should have the following field names:

"first" for first name "last" for last name "github" for github userid "email" for email address

You may have other fields if you like---those will be ignored. NOTE: THIS IS NOT YET TRUE, BUT WHAT I WANT TO BE TRUE IN THE F13 VERSIONS OF THE SCRIPTS.

Create a form similar to the one shown here---I suggest doing this using Google Drive (I have a sample script that grabs the data from the form) to collect github ids from your students and tie those back to "real names" and student ids.

https://docs.google.com/forms/d/1icypz0MC67pUn9ZiXZV6tMCR6_P8Ejy2jBEZCuO_8Yg/viewform

The scripts can use a .CSV file or a Google Doc

  1. Ask your students to create their github account and fill out the form as soon as you can reach them---preferably before the quarter starts.

Here's the text I used in Spring 2013 for CMPSC 56 at UC Santa Barbara:

Welcome to CMPSC56!

CMPSC56 is one of my favorite courses to teach, because, like CMPSC48,
it is where we take you from "toy programming"---where you are coding
just for the sake of learning how to code---towards "software
development" as it is practiced in industry.

As part of that process, we will be using a version control tool
called "git", and a cloud-based repository service called "github.com"
throughout the course.

Therefore it is important for you to CREATE A GitHub.com ACCOUNT ASAP!
It will save us SO much time if you can do this NOW, and it will only
take 5 or 10 minutes at most.  So, please take two steps.  BOTH are
important:

(1) Please visit http://github.com and create an account. (If you
    already have one, that's great--skip this step.)

(2) Visit the following page and let us know what your github.com
    username is.

http://bit.ly/cs56-s12-githubform

That's it!

If you want, there is a third optional step: you can go to the
folllowing link, and spend fifteen minutes to learn a little bit about
git.  You'll be assigned this as "homework" during your first week of
the quarter, so if you don't do it now, you'll have to do it later
anyway.  Why not get it out of the way?

Got 15 minutes and want to learn git? 
  http://try.github.com/levels/1/challenges/1

(4) The next step is where you clone the repository that contains the scripts for working with github. You should this step and the ones the follow on a machine that has Python 3, and the bash shell---I've made it work on both Linux and Mac.

First, I like to create a directory called ~/github, and then I always cd into that directory before I do "git clone "---that way, I know that all my github repos are organized under that directory. That's optional though--how you organize your disk space is entirely up to you.

In whatever directory where you want the repo clone to live, do this:

git clone https://github.com/UCSB-CS-Using-GitHub-In-Courses/github-acad-scripts.git

That should clone the repository that contains the scripts. There is one more thing to do though---the repo contains a "sub-repository" for PyGithub---that's like a "symbolic link" or a "pointer" to a separate repo for the software that the scripts are based on. The next step makes sure that subdirectory gets populated with the latest version of that software:

(5) To populate the PyGithub subdirectory, do this:

git submodule init git submodule update

That should populate the PyGithub subdirectory with contents. That will allow the scripts to load PyGithub as needed.

Set up Environment Variables

First create an env variable for your organization

export GHA_GITHUB_ORG=UCSB-CS56-W14

Next, one for the spreadsheet that has your student info

export GHA_STUDENT_LIST_URL=https://docs.google.com/spreadsheet/ccc?key=xxxxx

This should be the URL for the "publish to web" link in Google Drive, without the &output=html part, since the script will append &output=csv to it automatically.

Next, one for the spreadsheet that has your staff info

export GHA_STAFF_LIST_URL=https://docs.google.com/spreadsheet/ccc?key=xxxxx

This is similar to the spreadsheet for students, but it has ONLY the urls for TAs, instructors, etc. that get "advance copies" of the lab for testing.

Next, the spreadsheet that has your pair teams

export GHA_PAIR_LIST_URL=https://docs.google.com/spreadsheet/ccc?key=xxxxx

Same thing: this should be the 'publish to web' link.

Next, the directory where you get your starting point files for assignments.
The scripts will look under that directory for another directory with the starting point for each assignment. That directory might be a github repo, or a regular directory.

export GHA_STARTPOINT_DIR=~/courses/cs56/W14/labStartingPoints

Finally, create a scratch directory for working files. This needs to be a writeable directory. It is where various files will be stored that the scripts need to do their job.

export GHA_WORKDIR=~/cs56-W14-gha-work

Here are those again:

export GHA_GITHUB_ORG=UCSB-CS56-W14 export GHA_STUDENT_LIST_URL=https://docs.google.com/spreadsheet/ccc?key=xxxxx export GHA_PAIR_LIST_URL=https://docs.google.com/spreadsheet/ccc?key=xxxxx export GHA_STARTPOINT_DIR=/courses/cs56/W14/labStartingPoints export GHA_WORKDIR=/cs56-W14-gha-work

Creating Student Teams

The first step is to set up singleton pull/pull teams within the Organization, one for each student.

Our naming convention for these is:

Student_githubid

e.g.

Student_csmith89 Student_jennyjones23 etc.

We do this via the script addStudentsToTeams.py--instructions below.

Why do I need do set up singleton teams?

It may seem counterintuitive to create "teams" that have just one member for each student.

However, you NEED TO DO THIS. The reason is that:

  • the way that github supplies "private repos" at no charge for academic courses is via an "Organization"
  • at present (as of Fall 2013) within "Organizations", the only way to manage private repo access is via "teams", not via individual users. (If this changes, setting up these "singleton teams" might no longer be necessary).

So,these singleton teams are the most straightforward way we've found to set up private repos that:

  • are accessible to the individual student, and to the instructional staff (instructor/TAs/graders), but NOT to anyone else
  • don't count against a students 'educataional quota' of five private github accounts, but against the "course" for which github provided free accounts.

If there is another work flow that is simpler and accomplishes the same goals, we are interested to know! Email us, or post a comment in our github repo.

(1) Set environment variables as shown above.

(2) Run the script

./addStudentsToTeams.py

By default, the script tried to use a github user id that matches your current unix userid. If that's not the one you want to use, pass a different one on the command line:

./addStudentsToTeams.py -u mygithubid

Be sure that this github id has admin privileges to the organization that GHA_GITHUB_ORG refers to.

You'll be prompted for the github password that matches this organization.

You should see messages indicating that the teams are being created.

Creating Repositories for An Assignment--FIRST ASSIGNMENT OF COURSE

You should create student team (and if applicable, pair teams and/or group teams) before doing this step.

The first time you create repositories for an assignment in a given course, there is a bit of extra set up. Subsequent assignment setup will be more straightforward.

(1) Create a directory structure for your starting point files

Create a directory for your starting point files. I suggest making this a github repo also---public or private, inside the organization, or not---it doesn't matter. The important thing is that you have a top level directory, for example:

  • ~/courses/CS1/CS1_F13_Labs

and that under that, you have subdirectories with this naming convention:

  • ~/courses/CS1/CS1_F13_Labs/lab00
  • ~/courses/CS1/CS1_F13_Labs/lab01
  • ~/courses/CS1/CS1_F13_Labs/lab02
  • etc.

The prefix "lab00", "lab01", "lab02", etc. can be anything you like---use whatever naming convention makes sense for your course.

Note that if the top level directory (e.g. CS1_F13_Labs in this example) is a github repo, only THAT directory will have git plumbing (i.e., the .git subdirectory) stored in it. That's important, because we don't want to confuse git when our script copies files from that directory into each of the student private repos in a later step.

(2) Into lab00, put the files you want every student to have in their lab00 repo when they first clone it.

(3) Setup an environment variable called GHA_STARTPOINT_DIR (GHA=github academic workflow) that points to this starting point directory, e.g. in bash:

export GHA_STARTPOINT_DIR=../CS56-W14-Labs

The scripts will look in that directory for a subdirectory with the same name as your assignment, e.g.

../CS56-W14-Labs/lab00 ../CS56-W14-Labs/lab01 ../CS56-W14-Labs/lab02 etc.

(4) Run the script

./updateForLab.py lab00

That should create private repos for each student named lab00_student1, lab00_student2, etc, where student1, student2, etc. are the github userids from the CSV file.

Acknowledgements

These scripts were developed by Phill Conrad at UC Santa Barbara.

Assistance was provided by:

  • the instructional staff of CS56 from Spring 2013: Scott Bishop, Aaron Dodon, Leif Dreizler, Jasen Hall, Alex Hamstra, Carina Rammelkamp,

  • Christine Alvarado at UC San Diego, and her instructional support staff for CSE100 F13, especially Arden Liao, who helped to test the scripts and work flows, and provide valuable suggestions on how they could be improved.

About

New Refactored Version of github-acadwf-scripts.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published