LogRoBERTa: An Innovative Model for Detecting Hidden Anomalies in Long and Complex Log Sequences

This paper introduces an innovative log detection model named LogRoBERTa. Leveraging the RoBERTa model for pre-training, LogRoBERTa captures contextual information from software logs, comprehends complex log structures, and employs Attention-based Bi-LSTM for log anomaly detection.

Datasets

LogRoBERTa and other benchmark models are implemented on HDFS and BGL datasets
Please note that due to the large size of the datasets, we have not included the original files in the GitHub repository. You can download the corresponding dataset files by clicking on the provided link.

Experiment

You can refer to the code structure of our LogRoBERTa model in the files LogRoBERTa_HDFS.py and LogRoBERTa_BGL.py. LogRoBERTa_HDFS.py is designed for the HDFS dataset, while LogRoBERTa_BGL.py is designed for the BGL dataset.
In the paper's Section V. EXPERIENCES -> C. RQ3: Comparative Enhancement Effects of Each Module in LogRoBERTa, you will find our specific configurations and experimental results for our comparative experiments located in the Module folder.

TABLE IV: Settings of the Three Modules.

Model	Module 1	Module 2	Module 3
LogRoBERTa	RoBERTa	Bi-LSTM	Attention
Comparison 1	BERT	Bi-LSTM	Attention
Comparison 2	-	Bi-LSTM	Attention
Comparison 3	RoBERTa	LSTM	Attention
Comparison 4	RoBERTa	Bi-LSTM	-

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
img		img
logparser		logparser
module		module
LogRoBERTa_BGL.py		LogRoBERTa_BGL.py
LogRoBERTa_HDFS.py		LogRoBERTa_HDFS.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LogRoBERTa: An Innovative Model for Detecting Hidden Anomalies in Long and Complex Log Sequences

Datasets

Experiment

TABLE IV: Settings of the Three Modules.

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LogRoBERTa: An Innovative Model for Detecting Hidden Anomalies in Long and Complex Log Sequences

Datasets

Experiment

TABLE IV: Settings of the Three Modules.

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages