# Introduction 

These notebooks contain experiments for the paper `Do Large Language Models understand literature? Case studies and probing experiments on German poetry.` 

After some pre-studies we decided not to use open source models like Llama 3.1:70b, because they didn't perform as well as the commercial models offered by OpenAI, Google and Anthropic. Llama 3.1:405b wasn't an option because we didn't have access to an infrastructure able to run it. 

Our selection was based on evaluations of these models on Chatbot Arena (1.11.2024): https://lmarena.ai/ 

List of LLMs used: 

* OpenAi: ChatGPT-4o (gpt-4o-2024-08-06)
* Anthropic Claude 3.5 Sonnet 2024-10-22
* Google Gemini 1.5 Pro (Sep 2024)


# Two texts
We are using two texts. One is a very well-known poem, 'Hälfte des Lebens' ('The middle of life') by Friedrich Hölderlin, a very famous German poet from the early 19th Century. There are many representations of this poems in German and English online and also many interpretations. 
The other poem, 'Unsere Toten', by Hans Pfeifer, is unknown to Google at the time of writing and has not been published again after its first publication 1922. The author is also not a known figure in literary history. 

* How well do the models know Hölderlin's text?
* Does it matter, whether a model knows a text or not? Is the information used when producing new text about the reference text?
* How well does the recognition of the text type work?
* How relevant is the information about the text type?

# Configuration

In [1]:
import os

from definitions import poem_1, poem_2
from utils import settings, gemini, gpt4, opus, init_gemini, printmd
import utils

%load_ext jupyter_ai_magics

In [2]:
#settings
temperature = 0.8
system_prompt = "You are an expert in German literature. You answer the questions truthfully and short."

settings(system_prompt, temperature)
    

In [3]:
#defining aliases
init_gemini()

model =  gpt4()
%ai register gpt4o model

model = opus()
%ai register opus model 

Registered new alias `opus`

# Text type 'poem'

Task: Do the models recognize the text and the text type?

## Hälfte des Lebens

In [4]:
t1 = """
Mit gelben Birnen hänget und voll mit wilden Rosen das Land 
in den See, ihr holden Schwäne, und trunken von Küssen 
tunkt ihr das Haupt ins heilignüchterne Wasser.  
Weh mir, wo nehm’ ich, wenn es Winter ist, die 
Blumen, und wo den Sonnenschein, und Schatten der Erde? Die 
Mauern stehn sprachlos und kalt, im Winde klirren die Fahnen.
"""

In [5]:
prompt = f"""I found this text in an old folder with stuff from my family. 
What kind of news text is it? Here is the text: {t1}"""

In [6]:
%%ai opus
{prompt}

This is not a news text, but a famous poem called "Hälfte des Lebens" (Half of Life) written by Friedrich Hölderlin in 1804. It's one of the most important poems in German literature, contrasting the vitality of summer with the bleakness of winter as a metaphor for different phases of life.

In [7]:
%%ai gpt4o
{prompt}

This text is not a news text; it is a poem titled "Herbstbild" by Friedrich Hölderlin, a prominent German poet.

In [9]:
printmd(gemini(prompt))

This text is a poem, not a news text. It's a highly stylized and evocative piece of writing using figurative language (like the pears hanging into the lake, the swans drunk on kisses) to create a specific mood and imagery. News texts, on the other hand, focus on reporting factual information in a clear and objective manner.

This particular poem is a famous excerpt from the beginning of "September" from the cycle *Das Jahr der Seele* (The Year of the Soul) by the German poet Stefan George (1868-1933). It's a very melancholic and symbolic reflection on the transition from summer to winter, and by extension, from life and fullness to coldness and emptiness.


## Unsere Toten

In [10]:
t2 = """
Von Westen und Osten, von Nord und Süd schleppen sich nächtens viele Füße müd, Füße, vom Wandern 
wund und zerfetzt, langsam bedächtig zur Erde gesetzt, müh'n sich im zitternden Mondenschein rastlos 
tief nach Deutschland hinein. Und wer mit lauschendem Ohr noch wacht hört sie in jedweder werdenden 
Nacht, hört dies Schlurfen so müde und schwer, hört eine Klage voll wilder Begehr, eine Klage 
schmerzzerfressen: nur nicht vergessen! Uns nicht vergessen!
"""

In [11]:
prompt = f"""
I found this text in an old folder with stuff from my family. 
What kind of news text is it? Here is the text: {t2}
"""

In [12]:
%%ai opus
{prompt}

This appears to be a post-World War I poem about returning soldiers and refugees coming back to Germany. The style and content is typical of the literature from the Weimar Republic period, focusing on the suffering and hardship of those returning from war. It has a strong emotional appeal with its repetitive plea "nicht vergessen" (don't forget us) at the end.

In [13]:
%%ai gpt4o
{prompt}

This text is not a typical news article; it is more literary in nature, resembling a poem or a lyrical piece. It seems to express themes of longing and remembrance, possibly reflecting on the experiences of displaced individuals or refugees seeking to return to Germany. The style and content suggest it might be from the post-war period, capturing the emotional struggle and yearning for homeland.

In [15]:
printmd(gemini(prompt))

This text is likely a poem or a piece of prose with poetic qualities, perhaps from the period after World War II, dealing with the theme of refugees or displaced persons returning to Germany.

Here's why:

* **Lyrical Language:**  The text uses evocative imagery ("zitternden Mondenschein," "schlurfen so müde und schwer," "Klage voll wilder Begehr") and repetition ("Von...," "Füße...," "hört...") typical of poetry or lyrical prose.
* **Theme of weariness and hardship:** The description of tired, wounded feet ("müd," "wund und zerfetzt"), the slow, deliberate movement ("langsam bedächtig"), and the restless journey ("rastlos tief nach Deutschland hinein") suggest a difficult and arduous journey.
* **"Von Westen und Osten, von Nord und Süd":** This phrase emphasizes the widespread origin of these travelers, implying a large-scale movement of people.
* **"Uns nicht vergessen!":** This desperate plea, repeated for emphasis, strongly suggests a marginalized group seeking recognition and help. This, coupled with the imagery of hardship and widespread movement, points toward refugees or displaced persons.  After WWII, millions of Germans were expelled from Eastern European territories and had to make their way back to what remained of Germany.  This context fits the text well.

While it could theoretically be part of a larger news article, the style is not typical of objective journalistic reporting.  It's much more likely to be a stand-alone piece, perhaps published in a literary magazine or as part of a collection of writings about the post-war experience.
