What character from The Office would say that quote?
DunderAI is a web application that uses deep learning with LSTM and Transformer models to predict which character from the popular TV show "The Office" is most likely to have said a given quote. With this application, users can test their knowledge of the show's characters and their mannerisms.
DunderAI uses a deep learning model trained on quotes from the top 211 characters in The Office.
The two available models for inference are the Long Short-Term Memory (LSTM) and a Tranformer models. LSTM networks are a form of recurrent neural networks (RNN) that are particularily good at processing sequences of data. Transformers, (DunderAI builds off of BERT) are very good at processing sequence data as well, but with a different mechanism, 'self-attention'. When a user inputs a quote, the model uses the input text to predict which character is most likely to have said the line; model predictions are all based upon what language and speech patterns the network could pick up on.
- Easy to use web interface
- Input new or adjusted quote to receive character prediction
- Ability to handle direct quotes accurately from any episode of "The Office" even deleted scenes
conda env create -f environment.yml
conda activate DunderAI
cd DunderAI && python manage.py
- Visit http://127.0.0.1:8000/ in your web browser.
- Input a quote from "The Office" into the text field.
- Click the "Submit" button to receive a prediction of which character said the quote. (Assuming you have a fastai model in data/models)
DunderAIFull.mp4
DunderAI was created by myself using Python, Fastai, Django, and HTML/CSS. The dataset used to train the LSTM and BERT models was obtained from [Kaggle]. This all started as a healthy competition with a friend to see who could train the better model- an excuse to learn more about NLP.
DunderAI is licensed under the MIT License. Please see the LICENSE file for more information.
DunderAI is a fan project and is not affiliated with the TV show "The Office" or its creators in any way.
Footnotes
-
Top 21 characters being the 21 characters with the most voice lines. ↩