Skip to content

rgriscom/LingMetaX

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

50 Commits
 
 
 
 
 
 
 
 

Repository files navigation

LingMetaX

Author: Richard Griscom (Leiden University Centre for Linguistics)

Description: This script converts linguistic metadata from a .csv format produced by KoBoToolbox to the XML format used by the Lameta linguistic metadata editor. This is useful for linguists who are using the ODK Metadata Method to create linguistic metadata during fieldwork.

The script is available in two versions:

  • Google Colab Notebook - This interactive online version can be run without any installation or setup.
  • Python Script - This version must be downloaded and run on your own computer. You must have Python and the pandas library installed.

Citation: Griscom, Richard T. 2020. LingMetaX. https://github.com/rgriscom/LingMetaX/

Dependencies: pandas

Data Preparation: The script accepts two .csv files as input:

  • Participants CSV - Contains metadata about the participants who have contributed to the creation of linguistic data.
  • Sessions CSV - Contains metadata about recording sessions involving participants.

Metadata must be formatted according to the provided templates provided in this project. See the CSV Template Descriptions for more information. If you have used one of the XLS form templates from the ODK Metadata Method and exported data from KoBoToolbox, then the data will already be in the correct format.

Using the downloadable script: The downloadable Python script version also has the capability of automatically organizing resource files together with metadata. For this feature to work correctly, the following must hold true:

  1. Filenames for all resource files should be identical to the corresponding session ID in the session metadata
  2. Resource files for consent should consist of the participant’s full name followed by “_Consent” (e.g. Mariamu Anyawire_Consent.wav)
  3. Photos of participants should consist of the participant’s full name followed by “_Photo” (e.g. Mariamu Anyawire_Photo.jpg).

Download the script and place it in the same folder as the metadata files, and either in the same folder as the resource files or in the parent folder of the folders which contain the resource files. Python and pandas must be installed in order to successfully run the script.

About

Linguistic metadata from .csv to Lameta XML

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages