Description

This repository contains the experimental code used in pre-training the KBIR and KeyBART models as described in Learning Rich Representation for Keyphrases (https://arxiv.org/pdf/2112.08547.pdf) and to appear in Findings of NAACL 2022.

Some of the code builds on top of code from HuggingFace Transformers (https://github.com/huggingface/transformers) and also takes inspiration from SpanBERT (https://github.com/facebookresearch/SpanBERT)

Running the pre-training

Use the two bash scripts for running pre-training for KBIR and KeyBART respectively.

Accessing Pre-trained models

Models are uploaded to HuggingFace along with Model Cards describing usage.

KBIR: https://huggingface.co/bloomberg/KBIR

KeyBART: https://huggingface.co/bloomberg/KeyBART

Citation

  @article{kulkarni2021kbirkeybart,
        title={Learning Rich Representation of Keyphrases from Text},
        author={Mayank Kulkarni and Debanjan Mahata and Ravneet Arora and Rajarshi Bhowmik},
        journal={arXiv preprint arXiv:2112.08547},
        year={2021}
      }

License

KBIR and KeyBART are Apache 2.0. The license applies to the pre-trained models as well.

Contact

For any questions reach out to mkulkarni24@bloomberg.net

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
model		model
model_cards		model_cards
trainer		trainer
utils		utils
LICENSE		LICENSE
README.md		README.md
generate_keyphrase_universe.py		generate_keyphrase_universe.py
pretrain_runner.py		pretrain_runner.py
run_pretrain_kp_infill_replacement_bart_kg_oagkx.sh		run_pretrain_kp_infill_replacement_bart_kg_oagkx.sh
run_pretrain_kp_infill_replacement_oagkx.sh		run_pretrain_kp_infill_replacement_oagkx.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Description

Running the pre-training

Accessing Pre-trained models

Citation

License

Contact

About

Releases

Packages

Contributors 2

Languages

License

bloomberg/kbir_keybart

Folders and files

Latest commit

History

Repository files navigation

Description

Running the pre-training

Accessing Pre-trained models

Citation

License

Contact

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages