Skip to content

UCREL/CL-metaheaders

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CL-MetaHeaders

An open standard for metadata interchange via hidden headers for computational and corpus linguistics.

Corpus researchers, along with many other disciplines in science are being put under continual pressure to show accountability and reproducibility in their work. This is unsurprisingly difficult when the researcher is faced with a wide array of methods and tools through which to do their work; simply tracking the operations done can be problematic, especially when toolchains are often configured by the developers, but left largely as a black box to the user. Here we present a scheme for encoding this `meta data' inside the corpus files themselves in a structured data format, along with a proof-of-concept tool to record the operations performed on a file.

We are currently working towards completing version 1.0.x, but the draft documentation is available here in LaTeX format in the Active Draft/ folder.

Website and Validation Tool

About

Metadata header format for computational and corpus linguistics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published