Skip to content

Alyawarra (kinship) data dictionary is missing names for entities/relations #2

Open
@cmeb45

Description

@cmeb45

Hello!

Thank you for putting together and releasing these exercises. I am currently going through Exercise 6 and I noticed that the data dictionary lookups for the Alyawarra (kinship) dataset (data/kinship/bin/idx2ent.npy and data/kinship/bin/idx2rel.npy) appear to only contain indices and not the names of the entities/relations. It would seem that this information is necessary for the interpretability of the t-SNE visualization and k-NN of the entity/relation embeddings.

I did find a dictionary for the original Alyawarra dataset from 1971 at Kinsources. However, the codes for the relation types there range from 1 to 29, while the indices in data/kinship/bin/idx2rel.npy range from 0 to 25, so I am not sure of the mapping between these two sets of values.

Is there anything that I am missing? Any insights would be very helpful.

Many thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions