CyBERT: Contextualized Embeddings for the Cybersecurity Domain

This repository provides pre-trained weights of CyBERT, a language representation model for the cybersecurity domain. This model can be used for a variety of cybersecurity-based downstream tasks such as named entity recognition, and multi-class classification.

Refer to the published paper for more information, and please use the following citation below when testing and/or re-purposing provided materials:

Citation

@inproceedings{ranade2021cybert,
  title={CyBERT: Contextualized Embeddings for the Cybersecurity Domain},
  author={Ranade, Priyanka and Piplai, Aritran and Joshi, Anupam and Finin, Tim and others},
  booktitle={IEEE International Conference on Big Data},
  year={2021}
}

Downloading Models

Available model releases will be available here in the Releases directory. Current model links are also available below:

CyBERT-Base-MLM v1.1 - fine-tuned on BERT-Base-Cased, with extended cybersecurity vocabulary.

Fine-tuning CyBERT

To use CyBERT for fine-tuning cybersecurity tasks using the provided models, load the model and extended tokenizer using Hugging Face **from_pretrained module:

from transformers import BertTokenizer, BertForMaskedLM

tokenizer = BertTokenizer.from_pretrained("path to CyBERT directory")
model = BertForMaskedLM.from_pretrained("path to CyBERT directory")

Contact information

For help or issues, please contact Priyanka Ranade

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CyBERT: Contextualized Embeddings for the Cybersecurity Domain

Citation

Downloading Models

Fine-tuning CyBERT

Contact information

About

Releases

Packages

License

priyankaranade1/CyBERT

Folders and files

Latest commit

History

Repository files navigation

CyBERT: Contextualized Embeddings for the Cybersecurity Domain

Citation

Downloading Models

Fine-tuning CyBERT

Contact information

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages