Style Evaluation for Author Embedding

This repository contains a framework to perform author embedding evaluation on a writing style axis.

Requirements

spacy >= 3.0.1
nltk >= 3.5

Description

Feature extraction

It first extracts for a raw text corpus stylistic features. Data directory should be organized as follow (see dataset directory for example) :

.\data_dir\author1
          \author1\text1.txt
          \author1\text2.txt
          ...
.\data_dir\author2
          \author2\text1.txt
          \author2\text2.txt
          ...
...

To perform extraction run :

from extractor import build_authorship
data_dir = "dataset\\English"
authorship = build_authorship(data_dir)

from extractor import create_stylometrics
stylo_df = create_stylometrics(authorship)

Authorship is a dataframe linking each author to its textual production, while stylo_df contains all stylometric features by text.

Embedding evaluation

To perform embedding evaluation, run the following code with your custom embeddings :

import numpy as np
embeddings = np.random.randn(3,3,512)

from regressor import style_embedding_evaluation, multi_style_evaluation

# To evaluate a single embedding method
res_df = style_embedding_evaluation(embeddings[0], stylo_df, n_fold=2, output="agg")

# To evaluate several embedding methods
df_results = multi_style_evaluation(embeddings, names["model1", "model2", "model3"], features=stylo_df, n_fold=2)

You can then produce a spyder chart as follow :

from regressor import style_spyder_charts
style_spyder_charts(df_results)

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
__pycache__		__pycache__
dataset		dataset
image		image
README.md		README.md
dale-chall.pkl		dale-chall.pkl
extractor.py		extractor.py
fr_extractor.py		fr_extractor.py
french.pickle		french.pickle
regressor.py		regressor.py
syllabation.py		syllabation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Style Evaluation for Author Embedding

Requirements

Description

Feature extraction

Embedding evaluation

About

Releases

Packages

Languages

EnzoFleur/style_embedding_evaluation

Folders and files

Latest commit

History

Repository files navigation

Style Evaluation for Author Embedding

Requirements

Description

Feature extraction

Embedding evaluation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages