DPT-OpenCV

The code in the repository was used in an investigation into Deep Parameter Optimisation/Tuning on OpenCV. The work was published in two papers:

Bruce, B. R. et al. Deep Parameter Optimisation for Face Detection Using the Viola-Jones Algorithm in OpenCV. SSBSE 2016 (https://doi.org/10.1007%2F978-3-319-47106-8_18)

Bruce, B. R. Deep Parameter Optimisation for Face Detection Using the Viola-Jones Algorithm in OpenCV: A Correction. UCL Dept. of Computer Science Research Note RN/17/01 (http://www.cs.ucl.ac.uk/fileadmin/UCL-CS/research/Research_Notes/RN_17_07.pdf)

Basic Setup

Run ./setup.bsh to download and install OpenCV, the testcases and setup the project (This downloads over a GB of data. It may take some time to complete)

Please note that cmake, make, boost, build-essential, bc and a Java Development Kit are all required for setup.bsh and the experiments to run successfully ( sudo apt-get install cmake make libboost-all-dev build-essentia ldefault-jdk ). Everything documented here has only been tested on an Ubuntu 14.04.4 and Ubuntu 16.04.1 OS. It is not known if this would be replicable on anyother OS, unix-based or otherwise.

To compile and run the application:

CMake .

make

./classify_images [directory containing images]

classify_images should output which images have and which images do not contain faces (1 == face, 0 == not face)

Profiling the software

The directory "profiling" provides two scripts "setup_profiling.bsh" and "remove_profiling.bsh" which, when executed from the "profiling" directory shall setup the application for profiling and revert this action respectively

"profiling/callgrind.out.11739" was produced using the "valgrind" tool on a subset of the positive and negative testcases (after profiling was setup using "setup_profiling.bsh" :

valgrind --tool=callgrind ./classify_images [directory containing images]

"profiling/callgraph.png" was generated using "qcachegrind" for Mac OS X (use "kcachegrind" for similar results on Linux)

The "modifiable_files" directory contains the files which shall be modified.

Exposing the parameters

In the "modifiable_files directory there is an "expose_integers.bsh" script which when run can extract the integer constants from a program. The usage of the script is as follows:

./expose_integers.bsh [input file] [the output file] [define_file]

This script utalises the "replace_integer.pl" script

We used this create the "cascadedetect_exposed.cpp" and "cascadedetect_exposed.hpp" files along with the "replaces.hpp" file which contains all of the extracted integer constants. We add this as an include at the top of the file.

These files are copied into the appropriate directory during setup. As with the profiling directory, the scripts and data are mostly for reproducability purposes.

Filtering the extracted constants

For optimisation we want to remove all the constants that are of low value or of no use at all. Since, for this example, we have over 500 constants we wrote a script which goes through each constant and runs some basic tests to indicate whether this is worthwhile.

The "run_sensativity_filteration.bsh" script works by iterating through each constant. For each one, it first of all increments it by one. If it does not compile or produces an outcome which crashes on the "sensitivity_set" then this is considered too sensative to really be of value. If it passes this test then the integer constant has 50 added. If the program compiles, runs, and completes running in the normal amount of time (within the 95% confidence interval of the original application) for the "training_st" then we determine this constant to not be sensative enough for our requirements (it may not be necessisary at all!). The goal is to find constants which can be modified slightly without crashing but change the running of the software when modified by a small amount. A sensativity "sweet-spot" as it were.

The output of the "run_sensativity_filteration.bsh" script can be interpreted as follows (it is in a CSV format):

[Constant],[Not_too_sensative?],[time_training_set],[sensative_enough?]

We modify only those constants where [Not_too_sensative?] and [sensative_enough] are true. cat run_sensativity_filteration_output.csv | awk -F "," '($2=="true" && $4=="true"){print $1}' >replaces_selection.dat

This replaces_selection.dat is then used in the final parameter tuning step

Running Deep Parameter Tuning

The MOEA framework (2.9) is used to run the NSGA-II algorithm. This should have been downloaded and setup with the "setup.bsh" script was run. For this investigation a setup is provided: "DeepParameterTuning.java". This is very much hard-coded to OpenCV. When executed it will run for 10 generations with a population size of 100. The initial generation is seeded with the Deep Parameters in their original state and variants within the local-neighbourhood. To compile execute the following:

javac -cp ".:MOEAFramework-2.9/lib/*" DeepParameterTuning.java

To Run:

java -cp ".:MOEAFramework-2.9/lib/*" DeepParameterTuning

Results

The output running this on an Ubuntu 14.04.4 m4.large Amazon EC2 Instance (2x2.4GHz Intel Xeon E5-2676 v3 processor, 8GiB of memory, SSD Storage) can be found in the "100ind_10gen_run" directory

The "100ind_10gen_run" directory contains the following files:

DeepParameterTuning_100ind_10gen.dat -> the raw output of executing DeepParameterTuning.java

no_mod_100_run_training_set.csv -> the output of running classify_images 100 times using "fitness_function.bsh"

training_set_original.csv -> the average of "no_mod_100_run_training_set.csv"

pareto_optimal_training_set.csv -> The Pareto Optimal solutions, derived from "DeepParameterTuning_100ind_10gen.dat"

time_correctness_pareto_front_training_set.pdf -> The Pareto Optimal solutions from "pareto_optimal_training_set.csv" plotted. Includes "training_set_original.csv" to show the original appplication (not Pareto Optimal, in blue as opposed to the Pareto Frontier in RED)

test_set_original.csv -> the original program running the test set

pareto_optimal_test_set.csv -> The Pareto Optimal solutions from "pareto_optimal_trianing_set.csv" run on the testset. Also the original ("test_set_original.csv"). One solution from "pareto_optimal_training_set.csv" crashed when running testset, another was dominated. This is noted in the file.

time_correctness_pareto_front_test_set.csv -> The Pareto Optimal solutions from "pareto_optimal_test_set.csv" plotted

Patches

Patches have been provided in the "patches" directory. These allow opencv to be patched with the solutions found in "100ind_10gen_run/pareto_optimal_test_set.csv" . Please consult "patches/README.txt" for more info on this

Name		Name	Last commit message	Last commit date
Latest commit History 78 Commits
100ind_100gen_run_evolutionary		100ind_100gen_run_evolutionary
100ind_100gen_run_systematic		100ind_100gen_run_systematic
100ind_10gen_run_evolutionary		100ind_10gen_run_evolutionary
100ind_10gen_run_systematic		100ind_10gen_run_systematic
modifiable_files		modifiable_files
patches		patches
profiling		profiling
10k_eval_test_compare.pdf		10k_eval_test_compare.pdf
10k_eval_training_compare.pdf		10k_eval_training_compare.pdf
1k_eval_test_compare.pdf		1k_eval_test_compare.pdf
1k_eval_training_compare.pdf		1k_eval_training_compare.pdf
CMakeLists.txt		CMakeLists.txt
DeepParameterTuning.java		DeepParameterTuning.java
DeepParameterTuning_100ind_100gen_evolutionary.java		DeepParameterTuning_100ind_100gen_evolutionary.java
DeepParameterTuning_100ind_100gen_systematic.java		DeepParameterTuning_100ind_100gen_systematic.java
DeepParameterTuning_100ind_10gen_evolutionary.java		DeepParameterTuning_100ind_10gen_evolutionary.java
DeepParameterTuning_100ind_10gen_sytematic.java		DeepParameterTuning_100ind_10gen_sytematic.java
LICENSE		LICENSE
README.md		README.md
classify_images.cpp		classify_images.cpp
fitness_function.bsh		fitness_function.bsh
get_95_confidence_lower.R		get_95_confidence_lower.R
get_95_confidence_upper.R		get_95_confidence_upper.R
haarcascade_frontalface_alt.xml		haarcascade_frontalface_alt.xml
load.bsh		load.bsh
load_and_evaluation.bsh		load_and_evaluation.bsh
load_and_evaluation_test_set.bsh		load_and_evaluation_test_set.bsh
negative_10000.dat		negative_10000.dat
positive_10000.dat		positive_10000.dat
replaces_selection.dat		replaces_selection.dat
run_sensativity_filteration.bsh		run_sensativity_filteration.bsh
run_sensativity_filteration_output.csv		run_sensativity_filteration_output.csv
setup.bsh		setup.bsh
testset_evaluation_script.bsh		testset_evaluation_script.bsh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DPT-OpenCV

Basic Setup

Profiling the software

Exposing the parameters

Filtering the extracted constants

Running Deep Parameter Tuning

Results

Patches

About

Releases

Packages

Languages

License

BobbyRBruce/DPT-OpenCV

Folders and files

Latest commit

History

Repository files navigation

DPT-OpenCV

Basic Setup

Profiling the software

Exposing the parameters

Filtering the extracted constants

Running Deep Parameter Tuning

Results

Patches

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages