Skip to content

Joint Bootstrapping Machines for High Confidence Relation Extraction : NAACL-HLT 2018 Long Paper.

Notifications You must be signed in to change notification settings

pgcool/Joint-Bootstrapping-Machines

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

"Joint Bootstrapping Machines for High Confidence Relation Extraction" in conference proceedings of NAACL-HLT 2018.
	
	
Dependecies:

- Python 2.7, Numpy, NLTK, Gensim, jellyfish, whoosh, etc.

	
Directory Structure:

Joint-Bootstrapping-Machines

  resources
    freebase-easy-14-04-14
	  freebase_facts.txt 
	  - Download from url http://freebase-easy.cs.uni-freiburg.de/dump/
	  
  data
    input
      sentences.txt 
      - Download corpus from url: https://drive.google.com/file/d/0B0CbnDgKi0PyM1FEQXJRTlZtSTg/view)
    output
      BREE
        REL_ACQUIRED_ORG_ORG
          relationships_baseline.txt 
          - The output file containing a list of the relationships extracted from BREE system)
      BRET
        REL_ACQUIRED_ORG_ORG
          relationships_config5.txt
          - The output file containing a list of the relationships extracted from BRET system)
      BREJ
        REL_ACQUIRED_ORG_ORG
          relationships_config9.txt
          - The output file containing a list of the relationships extracted from BREJ system)
		  
    code
      automatic_evaluation
        index_dir
        - Directory of corpus-index
		
        index_whoosh.py
        - To create corpus-index in directory index_dir.
		
        Sentence.py
        - To extract entities infomation, clean and filter.
		
        easy_freebase_clean.py
        - To collect relationships facts from Freebase and prepare databases.
		
        large_scale_evaluation_freebase.py
        - To automatically evaluate relation extraction systems on large-scale 
          (https://akbcwekex2012.files.wordpress.com/2012/05/8_paper.pdf).
			

Usage (Evaluation):
 
    $ python large_scale_evaluation_freebase.py threshold system_output rel_type database root_dir corpus-index
	
	
Example:

    $ cd Joint-Bootstrapping-Machines/code/automatic_evaluation
 
    $ python large_scale_evaluation_freebase.py 0.5 ../../data/output/BREE/REL_ACQUIRED_ORG_ORG/relationships_baseline.txt acquired ../../resources/freebase-easy-14-04-14/freebase_facts.txt ../../data ../../data/input/sentences.txt ./index_dir
	
	
	
	
	
	
	
	

About

Joint Bootstrapping Machines for High Confidence Relation Extraction : NAACL-HLT 2018 Long Paper.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages