Bro Analysis Tools (BAT): Processing and analysis of Bro network data with Pandas, scikit-learn, and Spark
Switch branches/tags
Nothing to show
Clone or download
Latest commit e5f1d4f Oct 11, 2018

Bro Analysis Tools (BAT) travis Coverage Status supported-versions license

The BAT Python package supports the processing and analysis of Bro data with Pandas, scikit-learn, and Spark

BroCon 2017 Presentation

Data Analysis, Machine Learning, Bro, and You! (Video)

Why BAT?

Bro already has a flexible, powerful scripting language why should I use BAT?

Offloading: Running complex tasks like statistics, state machines, machine learning, etc.. should be offloaded from Bro so that Bro can focus on the efficient processing of high volume network traffic.

Data Analysis: We have a large set of support classes that help bridge from raw Bro data to packages like Pandas, scikit-learn, and Spark. We also have example notebooks that show step-by-step how to get from here to there.

Example: Pull in Bro Logs as Python Dictionaries

from bat import bro_log_reader
    # Run the bro reader on a given log file
    reader = bro_log_reader.BroLogReader('dhcp.log')
    for row in reader.readrows():

Output: Each row is a nice Python Dictionary with timestamps and types properly converted.

{'assigned_ip': '',
'id.orig_h': '',
'id.orig_p': 68,
'id.resp_h': '',
'id.resp_p': 67,
'lease_time': datetime.timedelta(49710, 23000),
'mac': '00:20:18:eb:ca:54',
'trans_id': 495764278,
'ts': datetime.datetime(2012, 7, 20, 3, 14, 12, 219654),
'uid': 'CJsdG95nCNF1RXuN5'}

Example: Bro log to Pandas DataFrame (in one line of code)

from bat.log_to_dataframe import LogToDataFrame
    # Create a Pandas dataframe from a Bro log
    bro_df = LogToDataFrame('/path/to/dns.log')

    # Print out the head of the dataframe

Output: All the Bro log data is in a Pandas DataFrame with proper types and timestamp as the index

                                                    query      id.orig_h  id.orig_p id.resp_h
2013-09-15 17:44:27.631940          1030
2013-09-15 17:44:27.696869       1030
2013-09-15 17:44:28.060639       1030
2013-09-15 17:44:28.141795       1030
2013-09-15 17:44:28.422704       1030

More Examples

  • Easy ingestion of any Bro Log into Python (dynamic tailing and log rotations are handled)
  • Bro Logs to Pandas Dataframes and Scikit-Learn
  • Dynamically monitor files.log and make VirusTotal Queries
  • Dynamically monitor http.log and show 'uncommon' User Agents
  • Running Yara Signatures on Extracted Files
  • Checking x509 Certificates
  • Anomaly Detection
  • See BAT Examples for more details.

Analysis Notebooks

BAT enables the processing, analysis, and machine learning of realtime data coming from Bro.


$ pip install bat



  • The DummyEncoder is based on Tom Augspurger's great PyData Chicago 2016 Talk

SuperCowPowers Logo