GitHub - 48ca/obfs4-fp: An analysis of fingerprintability of packet traces from obfs4

This is code for analyzing the fingerprintability of obfs4proxy packet traces while using the obfs4 pluggable transport. This code is experimental and as written for my capstone project at the University of Virginia.

What is contained:

fpgen: Fingerprint generator
classification: Fingerprint classification
obfs4: obfsproxy fork submodule that contains the modified obfs4 PT

For more information about each module, look at their respective READMEs.

Setup

Download Tor Browser and place it into fpgen/tor-browser_en-US. Install Tor (separate from the TBB) somewhere such that it can be found in the PATH.
Setup python3 and install the packages in requirements.txt.

$ ./setup-venv.sh

Pull all submodules if not pulled already.

$ ./pull-submodules.sh

Setup a Go runtime environment that is up-to-date enough to build obfs4proxy. Build obfs4proxy.

$ cd obfs4; ./build.sh

Edit fpgen/env.sh to include your information. This includes your network interface ($IF) and the bridge IP ($BRIDGE). Set $CAPDIR and $LOGFILE to suit your needs. source this file after you are done making changes. These environment variables are used by several scripts in this repository.
Start a bridge and configure it to run obfs4proxy. Edit torrc here to ensure that the Tor instance here connects to the bridge via obfs4proxy.

Usage

Before doing anything, make sure your environment is up-to-date and that you have sourced fpgen/env.sh.

Getting packet traces.

(Working directory: traces)

Start Tor.

$ ./tor.sh

If you want to observe status logs during a run, tail the LOGFILE you set in env.sh.

$ tail -f $LOGFILE

If you want to observe obfsproxy logs during a run, tail its log file.

$ tail -f tor-data/pt_state/obfs4proxy.log

Start fpgen.

(venv) $ python3 fpgen.py

While fpgen is running, you can pause the fetching with pause() or stop it entirely with stop(). You can add other commands on the fly. You can configure Tor to open a control port or socket if you so desire.

Upon completion, all traces will be available in the CAPDIR you set earlier. All traces that failed to download will have suffixed with '.bad'.

Analysis and feature extraction.

(Working directory: fpgen)

To analyze a specific dump, you can run my analyze.py script.

(venv) $ python3 analyze.py path/to/trace.pcap

To generate the CSV that represents the final fingerprints, run csv-gen.sh, and place the output into the fingerprints directory. The classification scripts expect the fingerprints to be placed there.

(venv) $ ./csv-gen.sh | tee fingerprints/$FP_NAME.csv

Classification.

(Working directory: classification/)

To train and test a Random Forest model on the generated fingerprints:

(venv) $ python3 classify.py $FP_NAME

To do the same but with a multi-level perceptron learner, use

(venv) $ python3 classify-nn.py $FP_NAME

I provide a utility script to generate 10 models and save the best one in 'auto-good-models'.

(venv) $ ./good-classify.sh $FP_NAME

To test the accuracy of the models in 'auto-good-models':

(venv) $ ./test.py $FP_NAME

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
classification		classification
fpgen		fpgen
obfs4 @ 926014d		obfs4 @ 926014d
report		report
traces		traces
utils		utils
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
env.sh		env.sh
pull-submodules.sh		pull-submodules.sh
requirements.txt		requirements.txt
setup-venv.sh		setup-venv.sh

License

48ca/obfs4-fp

Folders and files

Latest commit

History

Repository files navigation

Setup

Usage

Getting packet traces.

Analysis and feature extraction.

Classification.

About

Resources

License

Stars

Watchers

Forks

Languages