DEPRECATED WARNING

We are not using correspondentia anymore

Correspondentia

Python library to map correspondence tables in different formats to data structures.

A quick example:

from correspondentia import match_fields

numbers_to_names = {
    1: [{"value": "one", "type": "exact"}],
    2: [{"value": "two", "weight": 0.5, "type": "disaggregation"},
        {"value": "deux", "weight": 0.5, "type": "disaggregation"}],
}

my_data = [{
    'count': 1,
    'name': 'foo'
}, {
    'count': 2,
    'name': 'bar'
}]

list(match_fields(my_data, numbers_to_names, "count"))
> [{'count': 'one', 'name': 'foo'},
   {'count': 'two', 'name': 'bar', 'correspondentia_allocation': 0.5},
   {'count': 'deux', 'name': 'bar', 'correspondentia_allocation': 0.5}]

match_fields return a generator.

Input data

Input data should be an iterable of objects supporting the dictionary interface.

Input tables

correspondentia currently can import the following formats:

CSVs following the simple schema

We plan to also eventually support the following:

RDF (Turtle) correspondence tables following the BONSAI spec
CSVs with BONSAI ontology predicates

You can also write custom importers, or define correspondence tables manually. In either case, the correspondence table data should include at least the following fields (additional fields are also allowed):

{
    "label in origin schema (usually str, but can be int or float)": {
        "value": "label in destination schema (usually str, but can be int or float)",
        "type": one of ["exact", "disaggregation"],
        "weight": float, # optional
    }
}

Simple CSV schema for input tables

A CSV with two required and one optional columns.

First column: Label in origin schema
Second column: Label in destination schema
Third column (optional): Weight used for disaggregation.

If matching is 1-N or N-1, just use multiple rows with redundant labels.

CSVs should follow the Open Knowledge CSV spec. Do not use column headers.

Installation

Installation via normal pathways; currently has no dependencies.

Contributing

Follow standard fork/pull-request procedure.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
ci		ci
correspondentia		correspondentia
tests		tests
.appveyor.yml		.appveyor.yml
.gitignore		.gitignore
.travis.yml		.travis.yml
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
pytest.ini		pytest.ini
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DEPRECATED WARNING

Correspondentia

Input data

Input tables

Simple CSV schema for input tables

Installation

Contributing

About

Releases

Packages

Contributors 2

Languages

License

BONSAMURAIS/correspondentia

Folders and files

Latest commit

History

Repository files navigation

DEPRECATED WARNING

Correspondentia

Input data

Input tables

Simple CSV schema for input tables

Installation

Contributing

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages