Permalink
Browse files

Merge branch 'release/0.01-01'

  • Loading branch information...
2 parents 6c89139 + cbf6136 commit 8256fe44da8423f874d3aa720fc2737b83a74548 @gaurav committed Mar 31, 2012
View
4 CHANGELOG.pod
@@ -2,6 +2,10 @@
=over 4
+=item Version 0.01-01 (March 31, 2012)
+
+Updated README.md.
+
=item Version 0.01 (March 30, 2012)
First release.
View
49 README.md
@@ -0,0 +1,49 @@
+The Junius Henderson Field Note Project
+=======================================
+
+Who was Junius Henderson?
+-------------------------
+
+Junius Henderson was the first curator of the University of Colorado
+Museum of Natural History. Between 1905 and 1931, he kept 13 notebooks
+(1,672 pages in total) detailing his travels across the Southern Rocky
+Mountains of North America and elsewhere. These notebooks were scanned
+by the National Snow and Ice Data Center (NSIDC).
+
+You can read more about him [on Wikipedia](http://en.wikipedia.org/wiki/Junius_Henderson);
+we have uploaded all his notebooks (and some of his photographs)
+[to the Wikimedia Commons](http://commons.wikimedia.org/wiki/Category:Junius_Henderson).
+
+Workflow
+--------
+
+1. Install the `WWW::Wikisource` module (from the `WWW-Wikisource` directory).
+
+2. Run `wikisource2xml.pl 'Index:Name of Index on Wikisource.djvu' > download.xml` to download an XML version of the Wikisource document identified by the provided Index. `wikisource2xml.pl` should have been installed to your path
+
+3. In the `scripts` directory:
+
+ 1. Run `perl concat.pl download.xml > download_concat.txt`; this will create a "concat" file which combines multiple pages so that entries are divided by `{{new-entry}}` tags.
+
+ 2. Run `perl results.pl download.xml` to calculate the per-page statistics for annotations on this page. Remember to use the `--skip` command line option to skip front matter.
+
+ 3. Similarly, `perl results_concat.pl < download_concat.txt` will generate per-*entry* statistics for annotations. Remember to use the `--skip` command line option to skip entries which cover front matter.
+
+ 4. Finally, run `perl concat2stuff.pl dwc < download_concat.txt > download_dwc.csv` to write out a CSV file using DarwinCore headers.
+
+ 5. You can use `list.pl` and `list_concat.pl` to generate a list of all annotations detected in XML and "concat" files respectively.
+
+External links
+--------------
+
+For more details, please read the following blog posts:
+
+* [An Ode to Founders and a Field Notes Challenge: Part 1](http://soyouthinkyoucandigitize.wordpress.com/2011/11/28/an-ode-to-founders-and-a-field-notes-challenge-part-1/)
+
+* Field Note Challenge Part 2: [Veni, Vidi, Wiki](http://soyouthinkyoucandigitize.wordpress.com/2011/12/05/field-note-challenge-part-2-veni-vidi-wiki/)
+
+* Field Notes Challenge Part 3: [New Year's Digital Resolutions](http://soyouthinkyoucandigitize.wordpress.com/2012/01/06/field-notes-challenge-part-3-new-years-digital-resolutions/)
+
+* Field Notes Challenge Part 4: [Help, 'Cause We Need Somebod(y/ies)](http://soyouthinkyoucandigitize.wordpress.com/2012/01/12/field-notes-challenge-part-4-help-cause-we-need-somebodies/)
+
+* Field Notes Challenge Part 4.5: [JHFNP, Post 4.5](http://soyouthinkyoucandigitize.wordpress.com/2012/01/23/jhfnp-post-4-5/)
View
35 README.pod
@@ -1,35 +0,0 @@
-The Junius Henderson Field Note Project
-
-=over 4
-
-=item Who was Junius Henderson?
-
-L<http://en.wikipedia.org/wiki/Junius_Henderson>
-
-=item What did he write?
-
-L<http://en.wikisource.org/wiki/Author:Junius_Henderson>
-
-=back
-
-For more details, please read the following blog posts:
-
-=over 4
-
-=item An Ode to Founders and a Field Notes Challenge: Part 1
-
-L<http://soyouthinkyoucandigitize.wordpress.com/2011/11/28/an-ode-to-founders-and-a-field-notes-challenge-part-1/>
-
-=item Field Note Challenge Part 2: Veni, Vidi, Wiki
-
-L<http://soyouthinkyoucandigitize.wordpress.com/2011/12/05/field-note-challenge-part-2-veni-vidi-wiki/>
-
-=item Field Notes Challenge Part 3: New Year's Digital Resolutions
-
-L<http://soyouthinkyoucandigitize.wordpress.com/2012/01/06/field-notes-challenge-part-3-new-years-digital-resolutions/>
-
-=item Field Notes Challenge Part 4: Help, 'Cause We Need Somebod(y/ies)
-
-L<http://soyouthinkyoucandigitize.wordpress.com/2012/01/12/field-notes-challenge-part-4-help-cause-we-need-somebodies/>
-
-=back
View
0 henderson/concat.pl → scripts/concat.pl
File renamed without changes.
View
0 henderson/concat2stuff.pl → scripts/concat2stuff.pl
File renamed without changes.
View
0 henderson/list.pl → scripts/list.pl
File renamed without changes.
View
0 henderson/list_concat.pl → scripts/list_concat.pl
File renamed without changes.
View
0 henderson/n1_editors.txt → scripts/n1_editors.txt
File renamed without changes.
View
0 henderson/n1_editors_sorted.txt → scripts/n1_editors_sorted.txt
File renamed without changes.
View
0 henderson/n1_taxa.csv → scripts/n1_taxa.csv
File renamed without changes.
View
0 henderson/n1_trail.csv → scripts/n1_trail.csv
File renamed without changes.
View
0 henderson/n2_editors.txt → scripts/n2_editors.txt
File renamed without changes.
View
0 henderson/n2_editors_sorted.txt → scripts/n2_editors_sorted.txt
File renamed without changes.
View
0 henderson/n3_editors.txt → scripts/n3_editors.txt
File renamed without changes.
View
0 henderson/n3_editors_sorted.txt → scripts/n3_editors_sorted.txt
File renamed without changes.
View
0 henderson/notebook1.xml → scripts/notebook1.xml
File renamed without changes.
View
0 henderson/notebook1_concat.txt → scripts/notebook1_concat.txt
File renamed without changes.
View
0 henderson/notebook1_dwc.csv → scripts/notebook1_dwc.csv
File renamed without changes.
View
0 henderson/notebook2.xml → scripts/notebook2.xml
File renamed without changes.
View
0 henderson/notebook2_concat.txt → scripts/notebook2_concat.txt
File renamed without changes.
View
0 henderson/notebook2_dwc.csv → scripts/notebook2_dwc.csv
File renamed without changes.
View
0 henderson/notebook3.xml → scripts/notebook3.xml
File renamed without changes.
View
0 henderson/notebook3_concat.txt → scripts/notebook3_concat.txt
File renamed without changes.
View
0 henderson/notebook3_dwc.csv → scripts/notebook3_dwc.csv
File renamed without changes.
View
0 henderson/results.pl → scripts/results.pl
File renamed without changes.
View
0 henderson/results_concat.pl → scripts/results_concat.pl
File renamed without changes.

0 comments on commit 8256fe4

Please sign in to comment.