Unreally Notebooks

This repo contains all jupyter notebooks used for planing and developing the Unreally pipeline. (Each folder represents a step in the pipeline)

The following contains brief discribtions of each notebook. If you commit one yourself, please add some infos to this file as well.

Step 1 - Vectors

This folder deals with the first step of the ML pipeline: Vectorization of text & the necessesary cleaning of data. Also note that it contains a .zip with the datasets used

Top5k Bow-TF

explores the SemEval2016 A Gold dataset
contains a custom function for cleaning text
aims to understand how one could create a dictionary with the top 5k words of a dataset

5k-MultiDF Vocab

extends the function of the previous notebook to compute one 5k vocab of multiple data frames
more precise functions
displays the top 5k words as a data frame

Step 2 - MLP

The second step of our pipeline: The Multi-Layer Perceptron.

the_unreally_notebook

The current state-of-the-art notebook. This model reached about 72.5% accuracy.

data_balancing

Notebook for balancing the mapped data to imporove training

NN_and_eval

(Outdated) First tests to evaluate the neural net

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
Step1-Vectors		Step1-Vectors
Step2-MLP		Step2-MLP
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Unreally Notebooks

Step 1 - Vectors

Top5k Bow-TF

5k-MultiDF Vocab

Step 2 - MLP

the_unreally_notebook

data_balancing

NN_and_eval

About

Releases

Packages

Contributors 2

Languages

unreally-ai/notebooks

Folders and files

Latest commit

History

Repository files navigation

Unreally Notebooks

Step 1 - Vectors

Top5k Bow-TF

5k-MultiDF Vocab

Step 2 - MLP

the_unreally_notebook

data_balancing

NN_and_eval

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages