XML-files generated from LangSci books
Python
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
code
data
doc
schemas
LICENSE
README.md

README.md

Structured data from LangSci books

About

This repository provides linguistic example sentences in XML format that were extracted from open access books published with Language Science Press. There are both glossed and unglossed sentences, depending on the original source.

The actual data files are located in the data directory. Their structure is described by the RELAX NG schema in schemas/LinguisticExamples.rnc.

Documentation

For a short and gentle introduction to linguistic example sentences and their XML representation in this repository, have a look at our user guide.

In addition, we provide some Python demo code that illustrates how the XML files can be parsed and used. The user guide also contains a short description of the demo code.

Data sources

The files in the data directory are named after the main author or editor of the book from which the examples were extracted. In particular:

License

Copyright: (c) Language Science Press 2014-2015.

All data, code and documentation in this repository is published under the Creative Commons Attribution 4.0 Licence (CC BY 4.0).