Skip to content

cligs/pyzeta

master
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 
 
 
 
 

pyzeta

DOI

The pyzeta scripts are a Python implementation of Craig's Zeta score for contrastive text analysis.

What is the purpose?

This script implements a relatively simple measure for distinctive features in two groups of texts. It allows you to find out which words are characteristic of one group of texts when compared to another group of texts.

The underlying measure of distinctiveness or keyness has been proposed by John Burrows under the name of Zetahence the name of this Python package.

Getting help

  • The sampledata folder contains some examples of what input pyzeta needs and what output it produces
  • The howto.md file contains a brief tutorial of sorts for running analyses with pyzeta.

Requirements and installation

  • Requirements: Python 3 with pandas, numpy, sklearn, pygal, treetaggerwrapper (and TreeTagger)