Skip to content

WulffHunter/log_generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Apache Log Translator

This repository contains tools used for generating synthetic Apache logs and the tools needed to parse reference empirical logs, see paper "On Automatic Parsing of Log Records" for details.

Apache Fake Log Generator

For the Apache Fake Log Generator tool, please see the ./generator/ folder. A complete description of how to use the tool in the ./generator/README.md file.

Real Log Converter

For the tool used to parse real logs VA, VB, and VC , and convert them into a format ingestible by a machine learning model, please see the ./real_log_cleaner folder. The origins of the logs are as follows: VA, VB, and VC. Additional details are given in ./real_log_cleaner/README.md file.

Sample Logs Used

To view all of the sample log files used (including the three real log files, as well as the five generated log files mentioned in the paper), please visit the data repository.

Publication

The details of the tool and the data are given in a preprint. The final version of the paper was published in proceedings of the International Conference on Software Engineering (ICSE’21); you can see the recording of the presentation here. Please cite the tool and data as

@INPROCEEDINGS{rand2021log,
  author={Rand, Jared and Miranskyy, Andriy},
  booktitle={2021 IEEE/ACM 43rd International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER)}, 
  title={{On Automatic Parsing of Log Records}}, 
  year={2021},
  pages={41-45},
  doi={10.1109/ICSE-NIER52604.2021.00017}
}

License

This project is licensed under the MIT License.

Contact Us

If you have found a bug or came up with a new feature -- please open an issue or pull request.

Acknowledgments

This work was supported and funded by Ryerson University and Natural Sciences and Engineering Research Council of Canada.