Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with
or
.
Download ZIP
Browse files

Merge pull request #33 from sigmavirus24/master

Just a minor pet peeve, fix readme.rst
  • Loading branch information...
commit fea571425f7d0944478fd0a0a77981f3647eda5d 2 parents b0b4f36 + 3680d72
@sethwoodworth authored
Showing with 31 additions and 19 deletions.
  1. +31 −19 README.rst
View
50 README.rst
@@ -1,56 +1,67 @@
Project Gutenberg Stats
=======================
+
Estimated 1.6 million files
Reported 650 GB total
~40,000 + books
`link to issues`_
-.. _link to issues: ./issues
+.. _link to issues: https://github.com/sethwoodworth/GITenberg/issues
How are we getting the files?
=============================
-rsync -rvhz --progress --partial ftp...
+
+::
+
+ rsync -rvhz --progress --partial ftp...
Each repo should...
===================
- + metadata.yml
- + author
- + title
- + publishing info
- + provinence
- + book_name.{rst|tei|txt}
- + book text in a master source format
- + license.txt
- + PG license information
- + transcriber, converter credits
- + README.rst
- + generic GITenburg info
- + generic PG info
- + book specific info
- + desc and links to toolchains
- + desc and links to generated versions for ebook readers
+
++ metadata.yml
+ + author
+ + title
+ + publishing info
+ + provinence
++ book_name.{rst|tei|txt}
+ + book text in a master source format
++ license.txt
+ + PG license information
+ + transcriber, converter credits
++ README.rst
+ + generic GITenburg info
+ + generic PG info
+ + book specific info
+ + desc and links to toolchains
+ + desc and links to generated versions for ebook readers
Smart comments:
===============
+
Convert all files to UTF-8
https://groups.google.com/forum/?fromgroups#!topic/prj-alexandria/VhKbMyH9kcA
File formats:
=============
+
A list of file formats and their freqency is in the docs folder, generated via:
+::
+
find -type f|rev|cut -d\. -f1|grep -v "/" |rev|sort -f|uniq -c|sort -nr
.tei
~~~~
+
a master format
http://www.tei-c.org/Tools/Stylesheets/
http://code.google.com/p/hrit/source/browse/rst2xml-tei.py?repo=tei-rest
.rst
~~~~
+
a master format
Research toolchain for rst >> whatever
@@ -58,4 +69,5 @@ dp rst manual http://pgrst.pglaf.org/publish/181/181-h.html
Future
------
- + http://armypubs.army.mil/doctrine/
+
++ http://armypubs.army.mil/doctrine/
Please sign in to comment.
Something went wrong with that request. Please try again.