Thesis Title: Complex Networks in Manufacturing - Suitability and Interpretation
Author: Yamila Mariel Omar
Expected Defense Date: 8th July 2021
Institution: University of Luxembourg
This repository contains all the code necessary to reproduce my PhD Thesis. Find below a description of the files.
data
: The raw data used for my thesis corresponds to thetrain_date.csv
file in the Kaggle competition titled "Bosch Production Line Performance". Semi-processed data is available in this folder instead.figures
: Some of the figures in my Thesis are available in this folder.flow_networks
: all the code corresponding to the Chapter of "Flow Networks" is available in this directory.
from_timestamp_to_paths.py
: code used to process the train_date.csv data from Kaggle (see here) to obtain manufacturing paths and manufacturing edges.path_data_cleaning.py
: code necessary to remove "noisy" data. It produces two files: clean manufacturing paths and clean edges.
graphfile.py
: class used to open and write data from/to text files.graph.py
: class used to do complex network analysis.betweenness_centrality.py
: functions necessary to calculate the Betweenness centrality. Read docstrings for details.clustering_coefficient
: functions necessary to calculate the Clustering Coefficient. See docstrings for details.depth_first_search.py
: functions needed for the Depth First Search algorithm. Some functions to determine strongly connected components are also available here.pagerank.py
: functions needed to calculate the PageRank algorithm.
traditional_topological_metrics.py
will produce the results in Chapter 7, titled "Traditional Topological Metrics".main_pagerank.py
will produce the results in Chapter 8, titled "PageRank".
main_for_entropy_code_profiling.py
: To profile the code in the Entropy Chapter (Chapter 9) only the data from product family F2 was used. This snipped is the one profiled (otherwise, it would require HPC use).
plot_network.py
: code used to generate the DOT file to plot the manufacturing network in the Thesis.