In [1]:
from factuality_evaluator import UnilateralFactualityEvaluator, BilateralFactualityEvaluator

In [2]:
examples = [
    {
        "problem": "Who was the first man to walk on the Moon?",
        "answer": "Neil Armstrong"
    }, 
    {
        "problem": "Who was the first man to walk on the Moon?",
        "answer": "Pete Conrad"
    }, 
    {
        "problem": "Are penguins birds?",
        "answer": "No"
    },
    {
        "problem": "Who was the frobnoz that restified the branks?",
        "answer": "Rutherford B. Hayes"
    },
    {
        "problem": "When was America discovered?",
        "answer": "1492"
    },
    {
        "problem": "Did Lee Harvey Oswald act alone in the assassination of John Kennedy?",
        "answer": "Yes"
    },
    {
        "problem": "Is the Holy Trinity one entity or three entities?",
        "answer": "Three"
    },
]

In [3]:
def bilateral_evaluation(example):
    bilateral_evaluator = BilateralFactualityEvaluator("gpt-4o-2024-11-20")
    evaluation = bilateral_evaluator.invoke(example)
    print(f'ASSERTION: {evaluation["problem"]}: {evaluation["answer"]}')
    print()
    print("VERIFICATION:", evaluation["verification"])
    print()
    print("FALSIFICATION:", evaluation["falsification"])
    print()
    print("TRUTH VALUE:", evaluation["evaluation"])

In [4]:
def unilateral_evaluation(example):
    unilateral_evaluator = UnilateralFactualityEvaluator("gpt-4o-2024-11-20")
    evaluation = unilateral_evaluator.invoke(example)
    print(f'ASSERTION: {evaluation["problem"]}: {evaluation["answer"]}')
    print()
    print("REASONING:", evaluation["reasoning"])
    print()
    print("TRUTH VALUE:", evaluation["evaluation"])

## Who was the first man to walk on the Moon? Neil Armstrong

### Unilateral factuality evaluation: t

In [5]:
unilateral_evaluation(examples[0])

ASSERTION: Who was the first man to walk on the Moon?: Neil Armstrong

REASONING: Reasoning steps:

1. **Verify the essential information is present**:  
   The question asks for the identity of the first man to walk on the Moon. The answer provided is "Neil Armstrong." This is a direct response to the question.

2. **Check for any supporting facts**:  
   Historical records confirm that Neil Armstrong was the first human to set foot on the Moon during the Apollo 11 mission on July 20, 1969. This is a well-documented and widely accepted fact.

3. **Check for any contradictory facts**:  
   There are no credible or widely accepted contradictory facts. No other individual is recognized as the first person to walk on the Moon.

4. **Consider question context for implied terms**:  
   The question is straightforward and does not include any ambiguous or implied terms. It specifically asks for the first man to walk on the Moon, which is a clear and unambiguous historical event.

5. **Note i

### Bilateral factuality evaluation: t

In [6]:
bilateral_evaluation(examples[0])

ASSERTION: Who was the first man to walk on the Moon?: Neil Armstrong

VERIFICATION: ### Verification Process:

#### Step 1: Parse key claims in both question and answer
- **Question:** "Who was the first man to walk on the Moon?"
  - Key claim: There was a first man to walk on the Moon.
- **Answer:** "Neil Armstrong."
  - Key claim: Neil Armstrong was the first man to walk on the Moon.

#### Step 2: Verify each claim against known facts
- **Claim 1:** There was a first man to walk on the Moon.
  - Verified: The Apollo 11 mission, conducted by NASA, successfully landed humans on the Moon on July 20, 1969. This was the first time humans walked on the Moon.
- **Claim 2:** Neil Armstrong was the first man to walk on the Moon.
  - Verified: Historical records confirm that Neil Armstrong, the commander of Apollo 11, was the first human to set foot on the Moon, followed by Buzz Aldrin.

#### Step 3: Identify any unstated assumptions
- Assumption 1: The question refers to the Moon of Earth, n

## Who was the first man to walk on the Moon?: Pete Conrad

### Unilateral factuality evaluation: f

In [7]:
unilateral_evaluation(examples[1])

ASSERTION: Who was the first man to walk on the Moon?: Pete Conrad

REASONING: Reasoning steps:

1. **Verify the essential information is present**:  
   The question asks for the first man to walk on the Moon. The answer provided is "Pete Conrad." The essential information here is the identity of the first man to walk on the Moon.

2. **Check for any supporting facts**:  
   Pete Conrad was an astronaut and the commander of Apollo 12, the second manned mission to land on the Moon. He did walk on the Moon, but he was not the first. The first man to walk on the Moon was Neil Armstrong, who was the commander of Apollo 11 and stepped onto the lunar surface on July 20, 1969.

3. **Check for any contradictory facts**:  
   The claim that Pete Conrad was the first man to walk on the Moon directly contradicts the well-documented historical fact that Neil Armstrong was the first. Armstrong's famous words, "That's one small step for [a] man, one giant leap for mankind," are widely recognized as

### Bilateral factuality evaluation: f

In [8]:
bilateral_evaluation(examples[1])

ASSERTION: Who was the first man to walk on the Moon?: Pete Conrad

VERIFICATION: ### Verification Process:

#### Step 1: Parse key claims in both question and answer
- **Question**: "Who was the first man to walk on the Moon?"
  - Key claim: There was a "first man" who walked on the Moon.
- **Answer**: "Pete Conrad"
  - Key claim: Pete Conrad was the first man to walk on the Moon.

#### Step 2: Verify each claim against known facts
- **Claim 1**: There was a "first man" who walked on the Moon.
  - Verified as TRUE. The Apollo 11 mission in July 1969 marked the first time humans landed on the Moon, and Neil Armstrong was the first person to walk on its surface.
- **Claim 2**: Pete Conrad was the first man to walk on the Moon.
  - Verified as FALSE. Pete Conrad was an astronaut who walked on the Moon during the Apollo 12 mission in November 1969, making him the third person to walk on the Moon. The first man to walk on the Moon was Neil Armstrong.

#### Step 3: Identify any unstated ass

### Unilateral factuality evaluation: f

In [9]:
unilateral_evaluation(examples[2])

ASSERTION: Are penguins birds?: No

REASONING: Reasoning steps:

1. **First verify the essential information is present**  
   The question asks whether penguins are birds. The answer provided is "No." To evaluate this, we need to determine whether penguins meet the biological criteria for being classified as birds. Essential information includes the definition of a bird and whether penguins fit that definition.

2. **Check for any supporting facts**  
   Penguins are warm-blooded, egg-laying vertebrates with feathers, which are key characteristics of birds. They belong to the class *Aves*, which is the scientific classification for birds. While penguins cannot fly, flight is not a requirement for being classified as a bird (e.g., ostriches and emus are also flightless birds). Therefore, the scientific consensus supports that penguins are indeed birds.

3. **Check for any contradictory facts**  
   There are no credible facts that contradict the classification of penguins as birds. The

### Bilateral factuality evaluation: f

In [10]:
bilateral_evaluation(examples[2])

ASSERTION: Are penguins birds?: No

VERIFICATION: ### Verification Process:

#### Step 1: Parse key claims in both question and answer
- **Question**: "Are penguins birds?"  
  Key claim: Penguins are birds.
- **Answer**: "No"  
  Key claim: Penguins are not birds.

#### Step 2: Verify each claim against known facts
- **Fact**: Penguins are classified as birds in the biological taxonomy. They belong to the class *Aves*, which includes all birds. Penguins are flightless birds adapted to aquatic life, but they retain all the defining characteristics of birds, such as feathers, laying eggs, and being warm-blooded.  
  - The claim "Penguins are not birds" is **false** based on established biological classification.

#### Step 3: Identify any unstated assumptions
- The answer assumes that the definition of "birds" excludes flightless species or species with unique adaptations like penguins. However, this assumption is incorrect because the scientific definition of birds includes all members

## Who was the frobnoz that restified the branks?: Rutherford B. Hayes

### Unilateral factuality evaluation: f

In [11]:
unilateral_evaluation(examples[3])

ASSERTION: Who was the frobnoz that restified the branks?: Rutherford B. Hayes

REASONING: Reasoning steps:

1. **Verify the essential information is present**: The question uses nonsensical or fictional terms ("frobnoz" and "restified the branks"). These terms do not correspond to any known historical, scientific, or factual concepts. Therefore, the essential information needed to answer the question meaningfully is not present.

2. **Check for any supporting facts**: There is no evidence or historical record to suggest that Rutherford B. Hayes, the 19th President of the United States, was associated with anything resembling "frobnoz" or "restified the branks." These terms do not appear in any credible sources or historical accounts.

3. **Check for any contradictory facts**: Since the terms are nonsensical, there are no specific contradictory facts. However, the lack of any historical or factual basis for the terms themselves indirectly contradicts the claim that Rutherford B. Hayes 

### Bilateral factuality evaluation: n

In [12]:
bilateral_evaluation(examples[3])

ASSERTION: Who was the frobnoz that restified the branks?: Rutherford B. Hayes

VERIFICATION: ### Verification Process:

#### Step 1: Parse key claims in both question and answer
- **Question**: "Who was the frobnoz that restified the branks?"
  - Key terms: "frobnoz" and "restified the branks." These terms are nonsensical and do not correspond to any known historical, scientific, or linguistic concepts.
  - Implied claim: There exists a "frobnoz" who "restified the branks."

- **Answer**: "Rutherford B. Hayes"
  - Key claim: Rutherford B. Hayes (19th President of the United States) is the "frobnoz" who "restified the branks."

#### Step 2: Verify each claim against known facts
- The terms "frobnoz" and "restified the branks" are not recognized in any factual or historical context. They appear to be fictional or nonsensical.
- There is no evidence or record of Rutherford B. Hayes being associated with these terms or performing any action that could be interpreted as "restifying the bra

## When was America discovered?: 1492

### Unilateral factuality evaluation: t

In [13]:
unilateral_evaluation(examples[4])

ASSERTION: When was America discovered?: 1492

REASONING: Reasoning steps:

1. **Verify the essential information is present**: The question asks about the discovery of America, and the answer provides a specific year, 1492. This is a direct response to the question.

2. **Check for any supporting facts**: It is widely accepted that Christopher Columbus, an Italian explorer sailing under the Spanish flag, arrived in the Americas in 1492. This event is often referred to as the "discovery of America" in a historical context, particularly from a European perspective.

3. **Check for any contradictory facts**: While Columbus's arrival in 1492 is a significant historical event, it is important to note that indigenous peoples had been living in the Americas for thousands of years prior to this. Additionally, Norse explorers, such as Leif Erikson, are believed to have reached parts of North America (e.g., Vinland, now Newfoundland, Canada) around 500 years earlier, around the year 1000. Howev

### Bilateral factuality evaluation: n

In [14]:
bilateral_evaluation(examples[4])

ASSERTION: When was America discovered?: 1492

VERIFICATION: ### Verification Process:

1. **Parse key claims in both question and answer:**
   - **Question:** "When was America discovered?" implies a specific event or moment in history when "America" was first encountered or identified by someone.
   - **Answer:** "1492" claims that the discovery of America occurred in the year 1492.

2. **Verify each claim against known facts:**
   - The term "discovered" is historically associated with Christopher Columbus's first voyage to the Americas. Columbus, an Italian explorer sailing under the Spanish flag, reached the Caribbean islands (specifically the Bahamas) in October 1492.
   - However, the Americas were already inhabited by Indigenous peoples for thousands of years prior to Columbus's arrival. Additionally, Norse explorers, such as Leif Erikson, reached parts of North America (e.g., Vinland, likely in modern-day Newfoundland, Canada) around 500 years earlier, circa 1000 CE.

3. **Ide

## Did Lee Harvey Oswald act alone in the assassination of John Kennedy?: Yes

### Unilateral factuality evaluation: t

In [15]:
unilateral_evaluation(examples[5])

ASSERTION: Did Lee Harvey Oswald act alone in the assassination of John Kennedy?: Yes

REASONING: Reasoning steps:

1. **Verify the essential information is present**:  
   The question asks whether Lee Harvey Oswald acted alone in the assassination of John F. Kennedy. The answer provided is "Yes." This is a factual question about a historical event, so we need to evaluate the evidence and historical consensus regarding Oswald's involvement and whether he acted alone.

2. **Check for any supporting facts**:  
   - The official investigation into the assassination, the Warren Commission Report (1964), concluded that Lee Harvey Oswald acted alone in assassinating President John F. Kennedy on November 22, 1963.  
   - The report stated that Oswald fired three shots from the Texas School Book Depository, one of which fatally struck Kennedy.  
   - Subsequent investigations, such as the House Select Committee on Assassinations (HSCA) in 1979, acknowledged the Warren Commission's findings bu

### Bilateral factuality evaluation: n

In [16]:
bilateral_evaluation(examples[5])

ASSERTION: Did Lee Harvey Oswald act alone in the assassination of John Kennedy?: Yes

VERIFICATION: ### Verification Process:

#### Step 1: Parse key claims in both question and answer
- **Question Claim**: Lee Harvey Oswald was involved in the assassination of John F. Kennedy.
- **Answer Claim**: Lee Harvey Oswald acted alone in the assassination of John F. Kennedy.

#### Step 2: Verify each claim against known facts
1. **Claim 1: Lee Harvey Oswald was involved in the assassination of John F. Kennedy.**
   - Known Facts: On November 22, 1963, President John F. Kennedy was assassinated in Dallas, Texas. Lee Harvey Oswald was arrested and charged with the crime. The Warren Commission concluded in 1964 that Oswald fired the shots that killed Kennedy. This claim is widely accepted as true, though some conspiracy theories dispute it.
   - Verification: This claim is **verified as true** based on historical evidence and the findings of the Warren Commission.

2. **Claim 2: Lee Harvey Oswal

## Is the Holy Trinity one entity or three entities?: Three

### Unilateral factuality evaluation: f

In [19]:
unilateral_evaluation(examples[6])

ASSERTION: Is the Holy Trinity one entity or three entities?: Three

REASONING: Reasoning steps:

1. **Verify the essential information is present**: The question asks about the Holy Trinity, a central concept in Christian theology. The answer provided is "Three." The essential information is present, as the question is about the nature of the Holy Trinity and the answer directly addresses the number of entities.

2. **Check for any supporting facts**: In Christian theology, the Holy Trinity refers to the Father, the Son (Jesus Christ), and the Holy Spirit. These are considered three distinct persons or entities, yet they are also understood to be one God in essence. This concept is often summarized as "three persons in one God." The answer "Three" aligns with the idea of three distinct persons.

3. **Check for any contradictory facts**: While the Holy Trinity is described as three persons, it is also emphasized in Christian doctrine that these three persons are one in essence or subst

### Bilateral factuality evaluation: n

In [18]:
bilateral_evaluation(examples[6])

ASSERTION: Is the Holy Trinity one entity or three entities?: Three

VERIFICATION: ### Verification Process:

#### Step 1: Parse key claims in both question and answer
- **Question:** "Is the Holy Trinity one entity or three entities?"
  - The question is asking about the nature of the Holy Trinity, specifically whether it is one entity or three entities.
- **Answer:** "Three"
  - The answer claims that the Holy Trinity consists of three entities.

#### Step 2: Verify each claim against known facts
- The concept of the Holy Trinity is a theological doctrine in Christianity. It states that God exists as three distinct persons (Father, Son, and Holy Spirit) but is one in essence or being.
- According to mainstream Christian theology (e.g., Nicene Creed), the Holy Trinity is not three separate entities but rather one God in three persons. This is a key tenet of Trinitarian doctrine.
- The claim that the Holy Trinity is "three entities" contradicts this theological understanding. Instead, 