Skip to content

SEMA-JOIN: Joining Semantically-Related Tables Using Big Table Corpora

Notifications You must be signed in to change notification settings

Yeye-He/Semantic-Join

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Semantic-Join

SEMA-JOIN: Joining Semantically-Related Tables Using Big Table Corpora

Overview

This is the benchmark data set used in our experiments described here.

There are 50 test cases of joinable web tables, collected from Google Tables. Each test case has two key columns taken from two seperate tables, which while not equi-join-able, have semantic relationships that can be used to produce joins.

Data set description

There are two files per test case:

  • CaseN_input.txt: this is a test case containing two key columns from two seperate tables, separated by an empty line.
  • CaseN_ground.txt: this is the ground truth join results manually labelled, with join-able keys in the same row.

About

SEMA-JOIN: Joining Semantically-Related Tables Using Big Table Corpora

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages