Master Thesis

Topic: Pre-Trained Denoising Autoencoders Long Short-Term Memory Networks as probabilistic Models for Estimation of Distribution Genetic Programming

Institution: Johannes Gutenberg University Mainz, Chair of Business Administration and Computer Science (FB 03)

Abstract

English

Denoising Autoencoder Genetic Programming (DAE-GP) is an Estimation of Distribution Algorithm in the domain of Genetic Programming that uses Denoising Autoencoders Long Short-Term Memory Networks (DAE-LSTM) as probabilistic models for sampling new populations of solutions. This thesis investigates the possible benefits and downsides of using pre-training for the DAE-LSTM networks of DAE-GP for four real world symbolic regression problems. The experiments conducted did show that pre-training can drastically reduce the number of epochs that are necessary for the DAE-LSTM training at each generation of the DAE-GP search. Another interesting finding was that pre-training also increases the levenshtein edit distance between individual solutions inside the population which is a metric for the diversity of a population. Unfortunately, pre-training did not yield any improvements in the final fitness or size of solutions, despite significantly increasing the total run-time for DAE-GP.

Deutsche Fassung

Denoising Autoencoder Genetic Programming (DAE-GP) ist ein Estimation of Distribution Algorithmus (EDA) aus dem Forschungfeld der genetischen Programmierung (GP). In DAE-GP werden Denoising Autoencoders Long Short-Term Memory Netzwerke (DAE-LSTM) als probabilistische Modelle verwendet um neue Populationen von Lösungen für eine evolutionäre Suche zu erzeugen. Diese Masterarbeit untersucht die Vor- und Nachteile des Einsatzes einer Pre-Training Strategie für die DAE-LSTM Netzwerke von DAE-GP an vier Datensätzen für symbolische Regression. Die durchgeführten Experimente haben gezeigt, dass Pre-Training die Anzahl von Trainingsepochen für die DAE-LSTM Netzwerke in jeder Generation statistisch signifikant reduzieren konnte. Außerdem zeigt sich, dass Pre-Training die Levenshtein Editierdistanz, ein Maß für die Populationsdiversität, signifikant erhöhen konnte. Leider konnte Pre-Training die Qualität der jeweils besten gefundenen Lösungen, weder im Bezug auf ihre Fitness noch auf ihre Größe, verbessern. Auch führte Pre-Training in den durchgeführten Experimenten zu einer starken Erhöhung der Laufzeit des DAE-GP Algorithmus.

Keywords

Genetic Programming, Estimation of Distribution Algorithms, Denoising Autoencoder Genetic Programming, Pre-Training, Long Short-Term Memory Networks, Symbolic Regression

Documents available:

Full Thesis: paper.pdf (RMarkdown Source File: paper.Rmd)

PDF-Slide Show: slides.pdf (RMarkdown Source File: slides.Rmd)

Name		Name	Last commit message	Last commit date
Latest commit History 289 Commits
.vscode		.vscode
csl		csl
data		data
header_files		header_files
latexify		latexify
mermaid		mermaid
notes		notes
orga		orga
ref		ref
statutory_declaration		statutory_declaration
tables		tables
tex		tex
.gitignore		.gitignore
README.md		README.md
hoehn_kolloquium_foliensatz_final.pdf		hoehn_kolloquium_foliensatz_final.pdf
master_thesis.Rproj		master_thesis.Rproj
paper.Rmd		paper.Rmd
paper.lof		paper.lof
paper.lot		paper.lot
paper.pdf		paper.pdf
paper.tex		paper.tex
paper.toc		paper.toc
slides.Rmd		slides.Rmd
slides.log		slides.log

roman91DE/master_thesis

Folders and files

Latest commit

History

Repository files navigation

Master Thesis

Abstract

English

Deutsche Fassung

Keywords

Documents available:

About

Topics

Resources

Stars

Watchers

Forks

Languages