Skip to content

allenai/S2abEL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

S2abEL: A Dataset for Entity Linking from Scientific Tables

This repository provides access to the dataset S2abEL and is the official implementation of S2abEL: A Dataset for Entity Linking from Scientific Tables.

Installation

To create a conda environment named s2abel and install requirements run:

git clone https://github.com/allenai/S2abEL.git
cd S2abEL
conda create -y --name s2abel python==3.9.5
conda activate s2abel
chmod +x install.sh
./install.sh

Note that you might need to change torch version in install.sh depending on your CUDA version.

Datasets

We release the following S2abel datasets: ExtractedTables & Papers, CellTypeClassification, AttributedSourceMatching, and EntityLinking here: https://github.com/allenai/S2abEL/blob/main/data/release_data.tar.gz.

Detailed data schema is here.

Training and Evaluation

We offer notebooks for training and evaluating each of the sub-tasks mentioned in the paper, namely: cell type classification, attributed source mathing & candidate generation, and entity disambiguation.

License

S2abEL is released under the Apache 2.0 license.

Citation

@misc{lou2023s2abel,
      title={S2abEL: A Dataset for Entity Linking from Scientific Tables}, 
      author={Yuze Lou and Bailey Kuehl and Erin Bransom and Sergey Feldman and Aakanksha Naik and Doug Downey},
      year={2023},
      eprint={2305.00366},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published