Skip to content

Commit

Permalink
Update README with data link
Browse files Browse the repository at this point in the history
  • Loading branch information
acrule committed Mar 7, 2018
1 parent 641dbcf commit f1a03c0
Showing 1 changed file with 4 additions and 2 deletions.
6 changes: 4 additions & 2 deletions readme.md
Original file line number Diff line number Diff line change
@@ -1,8 +1,10 @@
# Jupyter Notebooks on Github
In July 2017 we searched for, downloaded, and analyzed the approximately 1.3
million public Jupyter Notebooks on Github at the time. This repository includes
the notebooks used to query and analyze that dataset. We are working on making the
raw notebook data (about 600 Gb) available for public use.
the scripts used to query and analyze that dataset. The full dataset is now
[available online](https://library.ucsd.edu/dc/collection/bb6931851t) thanks to
hosting provided by the UC San Diego Library. The full dataset is nearly 600GB
so we have created a smaller 5GB sampler dataset for you to get started.

In our analysis, we looked primarily at how notebooks employ narrative (operationalized as
markdown text). Our main finding was that many notebooks (~27%) include no
Expand Down

0 comments on commit f1a03c0

Please sign in to comment.