Skip to content

rhaver/Starling-cs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Accessing Starling Databases

This project provides documentation on the file structure of Starling databases and contains code for a tool to view and extract data from them.

What is Starling?

According to the Starling website:

STARLING is a software package designed by Sergei Starostin for various types of linguistic text and database processing, including handling of linguistic fonts in the DOS and WINDOWS operating systems, operations with linguistic databases and Internet presentation of linguistic data.

Starling seems to have been first developed in the 1980s and its database file structure still reflects this fact. Furthermore it was crafted in a time before Unicode and therefore uses its own character encodings. It also incorporates a custom markup format.

Despite its relatively widespread use in certain linguistic circles, it is a virtually undocumented piece of software with a user interface that is far from intuitive. In order to have access to data sets that are only available in Starling format, without having to depend on Starling itself, this project contains information on its database file structure, as well as a tool to read Starling databases.

Starling database structure

The Starling database file (DBF) structure is based on the dBase III format. It has been largely reverse engineered by prof.dr. Brouwer and the documentation provided here is an extension of his work.

A description of the Starling database file structure can be found in the document entitled Accessing Starling Databases.

StarlingDBF reader

A small tool to read and export the data contained within Starling databases is the StarlingDBF reader. It can be used to:

  • Open a Starling database (DBF) file
  • View its contents and its metadata (such as print settings) as Unicode text
  • Perform Unicode normalization
  • Get a count on the absolute frequency of all characters in the database
  • Correct the nesting and/or replace the notation of markup tags
  • Export as Unicode to a CSV file

StarlingDBF reader 1.0 screenshot

Source code

The source code is written in C# and included as a Microsoft Visual C# Express 2010 solution.

Windows executable

A windows executable can be found here.

Note that upon opening a Starling database file, any unknown character that it comes across is reported in the log.

About

Starling database description and tools

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages