# Chapter 6 – Life: Moore's Multiform *Byron*

The final chapter of *Romanticism and the Contingent Self* is devoted to Thomas Moore's [*Letters and Journals of Lord Byron, with Notices of his Life*](https://www.lordbyron.org/contents.php?doc=ThMoore.1830.Contents) (1830-31), which was recognised in its day as a classic of literary biography. As in previous chapters, the aim is twofold. First, I seek to integrate a genre of 'contingent selfhood' into a general morphology of Romantic literature. Second, I see to establish appropriate aesthetic standards for judging 'formless' works of contingent selfhood. In this case, I focus on only a single case study. Moore's *Byron* is a vast text, and incorporates both 'monstrous' and 'multiple' portrayals of the self. Only a small number of literary biographies are available for comparison, so the analysis uses a smaller corpus.

In [1]:
from romanticself import BiographyCorpus, Biography
from nltk import word_tokenize
import pandas as pd
import numpy as np
import seaborn as sns
from matplotlib import pyplot as plt
from seaborn import objects as so
corpus = BiographyCorpus("data/biography-corpus/", tokenizer = word_tokenize)
moore = corpus.moore

sns.set_theme(style="darkgrid", palette="gray")

# Section 6.2 Plot: Ordered and Disordered lives

What is the shape of the corpus? The texts in the corpus differ in a number of ways. Some are 'lives and letters' biographies that mix primary sources with the biographer's text. Others are more straightforward novelistic biographies. They vary greatly in length, and crucially also in sentence length. The sentence length is important, because it could affect the sentiment analysis. However, the variation appears to be fairly minimal: only Elizabeth Gaskell and Medwin write sentences that are more than 2-3 words longer or shorter than Moore's.

In [2]:
def biography_stats(bio: Biography) -> dict:
    return {
        "Author": bio.author,
        "Date": bio.date,
        "Title": bio.title,
        "Author words": len([token for token in bio.iter_tokens(allowed_authors=bio.author_id)]),
        "Subject words": len([token for token in bio.iter_tokens(allowed_authors={bio.subject})]) if bio.extension == ".xml" else 0,
        "Other words": len([token for token in bio.iter_tokens(allowed_authors=bio.all_authors - {bio.author_id} - {bio.subject})]) if bio.extension == ".xml" else 0,
        "All words": len([token for token in bio.iter_tokens(allowed_authors = None)]),
        "Mean sentence length": np.mean([len(word_tokenize(sent.text)) for sent in bio.sentences])
    }

figure_6_1 = pd.DataFrame.from_records([biography_stats(bio) for bio in corpus]).sort_values("All words", ascending=False, ignore_index=True)
figure_6_1.to_csv("figures/figure_6_1.csv")
display(figure_6_1)

Unnamed: 0,Author,Date,Title,Author words,Subject words,Other words,All words,Mean sentence length
0,John Gibson Lockhart,1837,"Memoirs of the Life of Sir Walter Scott, Bart.",466072,533282,0,999354,35.088445
1,Thomas Moore,1830,Letters and Journals of Lord Byron,260671,368442,7496,636609,33.342586
2,Elizabeth Gaskell,1857,The Life of Charlotte Brontë,209379,0,0,209379,27.288414
3,Thomas Medwin,1847,The Life of Percy Bysshe Shelley,140419,2600,1291,144310,38.167152
4,John Galt,1830,The Life of Lord Byron,103888,782,10072,114742,36.989684
5,Robert Southey,1813,The Life of Nelson,112030,0,0,112030,33.164594


Who wrote the 10000 'other' words in Galt's biography?

In [4]:
corpus.galt.author_breakdown

Counter({'JoGalt1839': 103888,
         'LdBroug1': 1818,
         'Unknown': 854,
         'LdByron': 782,
         'PeShell1822': 474,
         'AlMavro1865': 351,
         'Stend1842': 6575})

In [5]:
corpus.to_csv("data/biography-corpus/all-sentences.csv")

AttributeError: 