loug-garou--

Speech analyse and intention classification in the game of werewolves

Data

Data noted down manually from the TV show Pandakill.

Environments and tools

Python 3.6.3
ipython 6.1.0
jupyter 1.0.0
HanLP 0.1.41 (python interface pyHanLP)
jieba 0.39
Keras 2.0.6 (with tensorflow 1.0.0 back end)
Pandas 0.20.3
Scikit-Learn 0.19.1

File introduction

original_text the original noted down subtitle files named by the episode.

demo-game-english.txt for english readers, this is a piece of translated noted down subtitles.

data_speech_all.csv all the annotated and arranged speech part data.

s1e101_behavior.json, s1e102_behavior.json, s1e102_behavior.json annotated and arranged behavior part data exemple.

userdict.txt user dictionary which contains special game terms.

prepare_easymode.py, prepare_hardmode.py when construct the speech part and the behavior part, I had hesitations. In the beginning, I have tried to modelize the game to get general architecture to construct the two parts automatically by prepare_hardmode.py. But it turned out to be too difficult because the game is dynamic and unpredictable, there is so many exceptions and variations that to construct a general architecture becomes too expensive even considering the simplest rules. So I just extracted the paragraphs of speech for the speech part into csv files using prepare_easymode.py, and manually noted down the behavior records into json files.

intent_classification_data_explore.ipynb shows the basic statistical information of the speech data. I also add a little baseline of unsupervised clustering and look at its entropy to know it's effect.

intent_classification_learning.ipynb shows the machine learning methods and their performance including hyper parameter tuning for the intent classification task of the speech part.

intent_classification_RNN.py, intent_classification_w2v.py use RNN LSTMs to the classification task, one with pretrained Chinese word embedding (cf. https://github.com/Embedding/Chinese-Word-Vectors) another not.

intent_classification_LSTM.ipynb shows the hyper parameters tuning of the neural network with GridSearch

speech_summary_final.ipynb uses the results of intent classification task, along with the syntax parsing with HaNLP to get precise sentence meaning, then to summarize the speech turn of each player. It then uses these information to construct a configuration of the game situation. Along with the behavior record information, it then calculates the penalty score of each possible wolf team every speech turn. The team who gets the least penalty score is the predicted wolf team.

qui-sont-les-loups-garou.pdf the PPT for my defense

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

loug-garou--

Data

Environments and tools

File introduction

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.ipynb_checkpoints		.ipynb_checkpoints
original_text		original_text
README.md		README.md
data_speech_all.csv		data_speech_all.csv
demo-game-english.txt		demo-game-english.txt
intent_classification_LSTM.ipynb		intent_classification_LSTM.ipynb
intent_classification_RNN.py		intent_classification_RNN.py
intent_classification_data_explore.ipynb		intent_classification_data_explore.ipynb
intent_classification_learning.ipynb		intent_classification_learning.ipynb
intent_classification_w2v.py		intent_classification_w2v.py
prepare_easymode.py		prepare_easymode.py
prepare_hardmode.py		prepare_hardmode.py
qui-sont-les-loups-garous.pdf		qui-sont-les-loups-garous.pdf
s1e101_behavior.json		s1e101_behavior.json
s1e102_behavior.json		s1e102_behavior.json
s1e103_behavior.json		s1e103_behavior.json
speech_summary_final.ipynb		speech_summary_final.ipynb
userdict.txt		userdict.txt

ExeCuteRunrunrun/loup-garou

Folders and files

Latest commit

History

Repository files navigation

loug-garou--

Data

Environments and tools

File introduction

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages