Description
Hello!
Thank you for putting together and releasing these exercises. I am currently going through Exercise 6 and I noticed that the data dictionary lookups for the Alyawarra (kinship) dataset (data/kinship/bin/idx2ent.npy
and data/kinship/bin/idx2rel.npy
) appear to only contain indices and not the names of the entities/relations. It would seem that this information is necessary for the interpretability of the t-SNE visualization and k-NN of the entity/relation embeddings.
I did find a dictionary for the original Alyawarra dataset from 1971 at Kinsources. However, the codes for the relation types there range from 1 to 29, while the indices in data/kinship/bin/idx2rel.npy
range from 0 to 25, so I am not sure of the mapping between these two sets of values.
Is there anything that I am missing? Any insights would be very helpful.
Many thanks!