This repository explores the frequency distribution and type-token relationships within an author-based corpus comprised of popular novels by renowned writers. We investigate whether these corpora exhibit Zipfian behavior in frequency distribution, analyze rank-frequency relationships through line fitting, and assess lexical richness using Heaps' law.
For details, refer to the related blog post.