Skip to content

broken-dream/FGCS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

FGCS

repo for paper 《FGCS: A Fine-grained Scientific Information Extraction Dataset in Computer Science》

Data format

Each line represents a instance and is organized in the following format:

{
  "tokens": ["We", "start", "from", "analyzing", "the", "procedures", ...],
  "ners": [
    [16, 16, "Metric"], 
    [13, 13, 16, 16, "Metric"],
    ...
  ],
  "relations": [
    [0, 1, "Hyponym-of"],
    ...
  ],
  "discontinuous": true
}

Entries in ners are organized as [token_start, token_end, type] for general entities and [first_span_start, first_span_end, second_span_start, second_span_end, type] for discontinuous entities.

Entries in relations are organized as [head_entity_index, tail_entity_index, type].

discontinuous indicated whether there is a discontinuous entity in this sentence.

About

repo for paper 《FGCS: A Fine-grained Scientific Information Extraction Dataset in Computer Science》

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published