Permalink
Browse files

Revert "tidying"

This reverts commit 94885f8.
  • Loading branch information...
1 parent 94885f8 commit dddf33fb550388881ead465f1b2fce7651b165d6 @sebastianrahtz sebastianrahtz committed May 10, 2015
Showing with 472,353 additions and 0 deletions.
  1. +49 −0 .gitignore
  2. +27 −0 Makefile
  3. +62 −0 README.md
  4. BIN TEISimplespreadsheet.xlsx
  5. BIN advisory/NU.png
  6. BIN advisory/TEI.png
  7. +120 −0 advisory/UNL-logos.svg
  8. BIN advisory/UNL.png
  9. BIN advisory/advisory1-agenda.docx
  10. BIN advisory/elements.png
  11. BIN advisory/mellon.png
  12. +38 −0 advisory/mellon.svg
  13. BIN advisory/ox.png
  14. BIN advisory/report-2015-03-26.pdf
  15. +540 −0 advisory/report-2015-03-26.xml
  16. BIN advisory/report-teimm-2014.pdf
  17. +983 −0 advisory/report-teimm-2014.xml
  18. +142 −0 advisory/report1.xml
  19. BIN advisory/simplebasic.png
  20. BIN advisory/simpleworkflow.png
  21. BIN advisory/simplexsltpe-1.png
  22. BIN advisory/simplexsltpe.png
  23. +40 −0 anthead.xml
  24. +2,306 −0 basisformat.odd
  25. +4,887 −0 basisformat.rng
  26. +283 −0 basisformat.sch
  27. +258 −0 build.xml
  28. +130 −0 count.csv
  29. +194 −0 count.sh
  30. +60,284 −0 count.xml
  31. +42 −0 count.xsl
  32. +104 −0 dta.odd
  33. +2,010 −0 elementsummary.xml
  34. +173 −0 generate.xsl
  35. +3 −0 go.sh
  36. +13 −0 headeronly.xml
  37. +75 −0 isoschematron/ExtractSchFromRNG-2.xsl
  38. +77 −0 isoschematron/ExtractSchFromXSD-2.xsl
  39. +297 −0 isoschematron/iso_abstract_expand.xsl
  40. +1,508 −0 isoschematron/iso_dsdl_include.xsl
  41. +55 −0 isoschematron/iso_schematron_message_xslt2.xsl
  42. +2,299 −0 isoschematron/iso_schematron_skeleton_for_saxon.xsl
  43. +684 −0 isoschematron/iso_svrl_for_xslt2.xsl
  44. +100 −0 isoschematron/readme.txt
  45. +56 −0 isoschematron/sch-messages-cs.xhtml
  46. +57 −0 isoschematron/sch-messages-de.xhtml
  47. +57 −0 isoschematron/sch-messages-en.xhtml
  48. +54 −0 isoschematron/sch-messages-fr.xhtml
  49. +58 −0 isoschematron/sch-messages-nl.xhtml
  50. BIN lib/ant-schematron-2010-04-14.jar
  51. BIN lib/jing.jar
  52. BIN lib/saxon9he.jar
  53. BIN lib/trang.jar
  54. +13,661 −0 listOfClosedAttributeValueListsFromODDs.xml
  55. +103 −0 mapatts.xsl
  56. +2,434 −0 oddanalysis/alldta.odd
  57. +3,661 −0 oddanalysis/allota.odd
  58. +134 −0 oddanalysis/generatedCESR.odd
  59. +98 −0 oddanalysis/generatedOBVIL.odd
  60. +782 −0 oddbyexampleORIGINAL.xsl
  61. +114 −0 ota.odd
  62. +14,082 −0 polygon/AbelLeibmedicus/abel.html
  63. +852 −0 polygon/AbelLeibmedicus/abel.odd
  64. +4 −0 polygon/AbelLeibmedicus/abel.xsl
  65. +13,228 −0 polygon/AbelLeibmedicus/abel_leibmedicus_1699.TEI-P5.xml
  66. BIN polygon/Presentation/presentation.pdf
  67. +411 −0 polygon/Presentation/presentation.xml
  68. +134 −0 polygon/README.md
  69. +6,470 −0 polygon/RomeoJuliet/5721.xml
  70. +9,453 −0 polygon/RomeoJuliet/rj.html
  71. +578 −0 polygon/RomeoJuliet/rj.odd
  72. +3 −0 polygon/RomeoJuliet/rj.xsl
  73. +5,372 −0 polygon/TreasureIsland/5730.xml
  74. +7,825 −0 polygon/TreasureIsland/ti.html
  75. +205 −0 polygon/TreasureIsland/ti.odd
  76. +74 −0 polygon/TreasureIsland/ti.xsl
  77. +759 −0 polygon/html_functions.xsl
  78. +857 −0 polygon/latex_functions.xsl
  79. +347 −0 polygon/polygon_lib.xsl
  80. +43 −0 polygon/simpleoddtoxsl-latex.xsl
  81. +43 −0 polygon/simpleoddtoxsl.xsl
  82. +176 −0 rendnotes
  83. +198 −0 rhb.xml
  84. +723 −0 schematron-skeleton-api.htm
  85. +616 −0 simple-oddbyexample.xsl
  86. +54 −0 simple/pdf/to.xsl
  87. BIN simple/slides/logo.png
  88. +144 −0 simple/slides/to.xsl
  89. +110 −0 simpleelements.xml
  90. +129 −0 tei-pm.html
  91. +228 −0 tei-pm.isosch
  92. +35 −0 tei-pm.nvdl
  93. +881 −0 tei-pm.odd
  94. +174 −0 tei-pm.odd.html
  95. +9,080 −0 tei-pm.rng
  96. +16,919 −0 tei_all.rng
  97. +580 −0 teisimple.html
  98. +189 −0 teisimple.isosch
  99. +748 −0 teisimple.isosch.xsl
  100. +12 −0 teisimple.nvdl
  101. +1,317 −0 teisimple.odd
  102. +4,578 −0 teisimple.odd.html
  103. +4,711 −0 teisimple.rnc
  104. +7,473 −0 teisimple.rng
  105. +327 −0 teisimple.xml
  106. +7,714 −0 teisimple.xsd
  107. +6,470 −0 tests/5721.xml
  108. +5,372 −0 tests/5730.xml
  109. +50,559 −0 tests/A10231.html
  110. +34,895 −0 tests/A10231.xml
  111. BIN tests/nature1.jpg
  112. BIN tests/nature2.jpg
  113. BIN tests/nature3.jpg
  114. BIN tests/portrait.jpg
  115. +10,231 −0 tests/romeojuliet.html
  116. +147 −0 tests/simple.css
  117. +116 −0 tests/simple.js
  118. +33 −0 tests/testrend.xml
  119. +857 −0 tests/testsimple.html
  120. +1,523 −0 tests/testsimple.xml
  121. +233 −0 tests/testsimple2.html
  122. +250 −0 tests/testsimple2.xml
  123. +4,545 −0 tests/testsimple3.html
  124. +5,305 −0 tests/testsimple3.xml
  125. +1,042 −0 tests/testsimple4.html
  126. +1,239 −0 tests/testsimple4.xml
  127. +203 −0 tests/testsimple5.html
  128. +157 −0 tests/testsimple5.xml
  129. +6,118 −0 tests/testsimple6.html
  130. +5,372 −0 tests/testsimple6.xml
  131. +7,978 −0 tests/treasureisland.html
  132. +6,690 −0 texts/docSouth/allen.xml
  133. +9,335 −0 texts/docSouth/baldwin.xml
  134. +3,015 −0 texts/docSouth/bartram.xml
  135. +16,089 −0 texts/docSouth/battlegr.xml
  136. +9,234 −0 texts/docSouth/bayou.xml
  137. +7,347 −0 texts/docSouth/beverley.xml
  138. +5,027 −0 texts/docSouth/bonner.xml
  139. +9,958 −0 texts/docSouth/bookerwashingtonautobiog.xml
  140. +50 −0 texts/french/cesrGrabber.sh
  141. +126 −0 texts/french/obvilGrabber.sh
  142. +6 −0 texts/french/readme.txt
  143. +4,537 −0 texts/wright/wr20001.xml
  144. +7,106 −0 texts/wright/wr20002.xml
  145. +7,781 −0 texts/wright/wr20004.xml
  146. +10,537 −0 texts/wright/wr20012.xml
  147. +12,775 −0 texts/wright/wr20025.xml
  148. +3,123 −0 texts/wright/wr20030.xml
  149. +26 −0 validate.xml
  150. +38 −0 xml.xsd
View
@@ -0,0 +1,49 @@
+
+mybudget2.xlsx
+
+budget.xml
+
+teisimple.xml
+
+TEISimpleProposalNarrative.docx
+
+teisimple.docx
+
+budget.xlsx
+
+budget2.xlsx
+
+d
+
+m
+
+merge.xsl
+
+TEISimplespreadsheet.xml
+
+teisimple.odd.processedodd
+
+~$TEISimplespreadsheet.xlsx
+
+o
+
+temp.xml
+
+v.xml
+
+LOG
+
+report1.pdf
+
+teisimple.odd.compiled
+
+A*.xml
+
+result.xml
+
+r
+
+3002.xml
+
+null.xml
+
View
@@ -0,0 +1,27 @@
+default: teisimple valid pm
+TEXTS=/Users/rahtz/GDrive/Simple/
+XSL=../Stylesheets
+ANT_OPTS="-Xss2m -Xmx752m"
+docx:
+ teitodocx --profile=tei teisimple.xml teisimple.docx
+
+teisimple:
+ ANT_OPTS=${ANT_OPTS} ant -lib lib/saxon9he.jar:lib/jing.jar -DXSL=$(XSL)
+
+pm:
+ ANT_OPTS=${ANT_OPTS} ant -lib lib/saxon9he.jar:lib/jing.jar -DXSL=$(XSL) pm
+
+valid:
+ xmllint --xinclude teisimple.odd > xsimple.odd
+ java -jar lib/jing.jar tei-pm.rng xsimple.odd
+ rm xsimple.odd
+
+validate:
+ cat anthead.xml> v.xml
+ find "$(TEXTS)" -name "*.xml" | perl -p -e 's:(.*)/([A-z0-9_\-\.]+).xml:<dojob file="\2.xml" name="\1/\2.xml"/>:' >> v.xml
+ echo "</target></project>" >> v.xml
+ ANT_OPTS=${ANT_OPTS} ant -lib lib/saxon9he.jar:lib/jing.jar -Dxsl=${XSL} -Dbasedir=`pwd` -f v.xml
+
+
+#ANT_OPTS="-Xss2m -Xmx752m" ant -lib lib/saxon9he.jar:lib/jing.jar -Dbasedir=`pwd` -f v.xml
+
View
@@ -0,0 +1,62 @@
+TEI-Simple
+==========
+TEI Simple aims to define a new _highly-constrained_ and _prescriptive_ subset of the Text Encoding Initiative (TEI) Guidelines suited to the representation of early modern and modern books, a formally-defined set of processing rules which permit modern web applications to easily present and analyze the encoded texts, mapping to other ontologies, and processes to describe the encoding status and richness of a TEI digital text.
+
+# Useful links
+
+Documented ODD for Simple: http://htmlpreview.github.io/?https://github.com/TEIC/TEI-Simple/blob/master/teisimple.html for differences
+and http://htmlpreview.github.io/?https://github.com/TEIC/TEI-Simple/blob/master/teisimple.odd.html for full version
+
+Documented extensions for processing: http://htmlpreview.github.io/?https://github.com/TEIC/TEI-Simple/blob/master/tei-pm.html for differences and http://htmlpreview.github.io/?https://github.com/TEIC/TEI-Simple/blob/master/tei-pm.odd.html for full version
+
+TEI Simple schema (RELAX NG): https://raw.githubusercontent.com/TEIC/TEI-Simple/master/teisimple.rng
+
+Test file https://github.com/TEIC/TEI-Simple/blob/master/tests/testsimple.xml (before) and http://htmlpreview.github.io/?https://github.com/TEIC/TEI-Simple/blob/master/tests/testsimple.html (after)
+
+# Background
+
+The Text Encoding Initiative (TEI) has developed over 20 years into a key technology in text-centric humanities disciplines, with an extremely wide range of applications, from diplomatic editions to dictionaries, from prosopography to speech transcription and linguistic analysis. It has been able to achieve its range of use by adopting a descriptive rather than prescriptive approach , by recommending customization to suit particular projects, and by eschewing any attempt to dictate how the digital texts should be rendered or exchanged. However, this flexibility has come at the cost of relatively limited success in interoperability. In our view there is a distinct set of uses (primarily in the area of digitized ‘European’-style books) that would benefit from a prescriptive recipe for digital text; this will sit alongside other domain-specific, constrained TEI customizations, such as the very successful Epidoc in the epigraphic community. TEI-Simple may become a prototype for a new family of constrained customizations. For instance, a TEI Simple MS for manuscript based work could be built on top of the ENRICH project, drawing on many of the lessons and some of the code for TEI Simple.
+
+The TEI has long maintained an introductory subset (TEI Lite) and a constrained customization for use in outsourcing production to commercial vendors (TEI Tite), but the former permits enormous variation and the latter neglects anything requiring expertise in the source material. More importantly, neither addresses processing of the encoded text. The present project can be viewed in some ways as a revision of TEI Lite, re-examining the basis of the choices therein, focusing it for a more specific area, and adding a "cradle to grave" processing model that associates the TEI Simple schema with explicit and standardized options for displaying and querying texts. This means being able to specify what a programmer should do with particular TEI elements when they are encountered, allowing programmers to build stylesheets that work for everybody and to query a corpus of documents reliably.
+
+This proposal, TEI Simple, will focus on interoperability, machine generation, and low-cost integration. The TEI architecture facilitates customizations of many kinds; TEI Simple aims to produce a complete 'out of the box' customization which meets the needs of the many users for whom the task of creating a customization is daunting or seems irrelevant. TEI Simple in no way intends to constrain the expressive liberty of encoders who do not think that it is either possible or desirable to follow this path. It does, however, promise to make life easier for those who think there is some virtue in travelling that path as far as it will take you, which for quite a few projects will be far enough. Some users will never feel the need to move beyond it, others will outgrow it, and when they do they will have learned enough to do so.
+
+‘Comparability and interoperability with other resources’ are an increasingly important topic on various Digital Humanities agendas. Echoes of it are found in a recent ‘work set construction’ Mellon grant to the Hathi Trust Research Centre. Under the heading ‘Wissenschaftliche Sammlungen’ it is a major part of an ambitious DARIAH project anchored at the SUB Göttingen. Progress towards it may be slow, tedious, and partial, but ‘simplicity, interoperability, broad use and reuse’, and ‘comparability and interoperability with other resources’ are important goals to keep in mind for many purposes. For a lot of current and future users of the TEI the really important benefits come from the simple stuff, and beyond some level of complexity they begin to feel some sympathy with Andrew Prescott's not very kind phrase about ‘angels dancing on angle brackets.’ [http://ahh.sagepub.com/content/early/2011/11/30/1474022211428215]
+
+A major driver for this project is the texts created by phase 1 of the EEBO-TCP project, which will be placed in the public domain on 1 January 2015. Another 45,000 texts will join over the following five years, creating by 2020 an archive of 70,000 consistently encoded books published in England from 1475 to 1700, including works of literature, philosophy, politics, religion, geography, science and all other areas of human endeavor. When we compare the query potential of the EEBO TCP texts in their current and quite simple encoding with flat file versions of those text, it is clear that the difference in query potential is very high, especially if you add to that coarse encoding simple forms of linguistic annotation or named entity tagging that can be added in a largely algorithmic fashion. During 2012 and 2013 extensive work has been undertaken at Northwestern, Michigan and Oxford to enrich these texts and bring them into line with the current TEI Guidelines (where necessary working with the TEI to modify the Guidelines). TEI Simple will use this corpus as a point of departure and will provide its users with a friendlier environment for manipulating EEBO texts in various projects. But TEI Simple should not be understood as an EEBO specific project. We believe that, given the extraordinary degree of internal diversity in the EEBO source files, a project that starts from them can, with appropriate modifications, accommodate a wide range of printed texts differing in language, genre, or time and place of origin.
+
+# Objectives
+
+TEI Simple has the following high-level objectives:
+
+1. Definition of a new highly constrained and prescriptive subset of the Text Encoding Initiative (TEI) Guidelines suited to the representation of early modern and modern books. The degree of detail supported will be sufficient to encompass, at a minimum, the current practices of the TCP's EEBO, ECCO, and Evans collections plus those of other major European initiatives such as Text Grid or the DTA in Germany, and the Consortium Cahier in France.
+
+1. Creation of a notation (as an extension to TEI's ODD metalanguage) for specifying processing rules for TEI encoded texts, referencing web standards such as XPath, CSS and XSL FO.
+The paper on “Documenter des “attentes applicatives” (processing expectations)” by Frédéric Glorieux and Vincent Jolivet at TEI Members Meeting 2013 (http://digilab2.let.uniroma1.it/teiconf2013/program/papers/abstracts-paper/ ) also addresses this area.
+
+1. Reference implementations of processing rules defined for this TEI subset.
+
+1. Formal mapping of the TEI elements used by Simple to the CIDOC CRM, allowing for full interoperability with the Europeana Data Model, in order to facilitate the participation of projects in the Europeana repositories.
+
+1. Definition and implementation of machine-readable descriptions of the encoding status and richness of TEI texts, providing a “TEI Performance Indicators” indicating to a user what they can expect to use the text for.
+
+The aim is to lower the access barriers to working with TEI-encoded texts in various web environments. Programmers familiar with a particular web environment, whether Django, Drupal, eXist, Ruby on Rails, or others will be able to integrate TEI Simple-based projects into their environment with moderate effort and with no more than their usual tools and skills.
+
+The project will adhere to the following principles:
+
+* As little overlap as possible, and as much compatibility as possible, with existing repository projects
+
+* At least as prescriptive as level 3 of the Best Practices for TEI in Libraries
+
+* Encompassing I18N principles at all times
+
+* Useable implementations of all features
+
+Outcomes from TEI Simple, consisting of a documented definition in ODD of the TEI subset, a set of processing rules, and extensions to the TEI ODD language to record processing expectations, will be fully integrated into the TEI infrastructure with ongoing maintenance by the TEI Technical Council.
+
+TEI Simple is intended to be complementary to community projects like the TAPAS project, and to the established work of TextGrid, the German Text Archive (the DTA ‘base format’, which shares many of the goals of TEI Simple) and other national projects.
+
+
+# Trello Board
+
+![Trello Board](https://trello.com/b/FYYJk2FC.png "Trello Board")
Binary file not shown.
View
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
View
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Oops, something went wrong.

0 comments on commit dddf33f

Please sign in to comment.