# LLM Fine-Tuning

**WIP**


In [1]:
from tinytextgrad.prompt import Prompt, load_prompt


def apply_prompt(
    prompt: Prompt,
    inputs: list[str],
) -> list[str]:
    """
    Calls the LLM with the prompt and inputs.
    Returns a list of strings representing the generated text.
    """
    results = [prompt.call_llm(input) for input in inputs]
    for i, result in enumerate(results):
        print(f"Input {i+1}:")
        print(inputs[i])
        print(f"Output {i+1}:")
        print(result)
    return results

In [None]:
data = [
    "The process of cellular respiration converts biochemical energy from nutrients into adenosine triphosphate (ATP), and releases waste products. Cellular respiration is a set of metabolic reactions and processes that take place in the cells of organisms to convert chemical energy from oxygen molecules or nutrients into ATP, and then release waste products.",
    "The first human heart transplant was performed by Dr. Christiaan Barnard on December 3, 1967, in Cape Town, South Africa. The patient, Louis Washkansky, lived for 18 days after the surgery.",
    "Rosalind Franklin was a British biophysicist and X-ray crystallographer whose work was critical in the understanding of the molecular structures of DNA, RNA, and viruses. She is best known for her photograph of DNA, known as Photo 51, which contributed significantly to the discovery of the DNA double helix by Watson and Crick.",
    "The Great Wall of China is a series of fortifications made of various materials, including stone, brick, tamped earth, wood, and other materials. It was built along the northern borders of China to protect against invasions and raids from various nomadic groups. The most well-known sections were built by the Ming Dynasty (1368-1644).",
    "The Theory of General Relativity, formulated by Albert Einstein, describes the gravitational force as a curvature of spacetime caused by mass and energy. One of its most famous predictions is the bending of light around massive objects, which has been confirmed through various experiments, including the observation of a solar eclipse in 1919.",
    "Marie Curie was a Polish-born physicist and chemist who conducted pioneering research on radioactivity. She was the first woman to win a Nobel Prize and the only person to win Nobel Prizes in two different scientific fields—Physics (1903) and Chemistry (1911). Her discoveries included the elements polonium and radium.",
    "The internet is a global network of interconnected computers that communicate via standardized protocols. It enables a wide range of services, including the World Wide Web, email, and file sharing. The internet has revolutionized communication, commerce, and access to information.",
]

GENERATE_QA_PAIRS_PROMPT = load_prompt(name="generate_qa_pairs")
results = []
for inp in data:
    print(f"Input: {inp}")
    result = apply_prompt(
        prompt=GENERATE_QA_PAIRS_PROMPT,
        inputs=[inp],
    )
    results.append(
        {
            "instruction": GENERATE_QA_PAIRS_PROMPT.template,
            "input": inp,
            "output": result,
        }
    )

In [None]:
import json
from textwrap import dedent

from tinytextgrad import call_llm

prompt = dedent("""
json, dataset, generator

Extract question/answer pairs from the given text into a JSON array of objects:

```json
[
  { "q": <question>, "a": <answer> },
  ...
]
```
""").strip()

text = dedent("""
"1. What are some examples of services enabled by the internet?\nAnswer: Some examples of services enabled by the internet include the World Wide Web, email, file sharing, video conferencing, online gaming, social media platforms, and online banking.\n\n2. How has the internet revolutionized communication?\nAnswer: The internet has revolutionized communication by enabling instant messaging, video calls, email, social media interactions, and real-time collaboration tools, which have made global communication faster, easier, and more accessible.\n\n3. In what ways has the internet impacted commerce?\nAnswer: The internet has had a significant impact on commerce by enabling e-commerce platforms, online marketplaces, digital advertising, online payment systems, and global supply chains, which have transformed the way businesses operate and reach customers.\n\n4. Can you explain the concept of standardized protocols in relation to the internet?\nAnswer: Standardized protocols in the context of the internet refer to the set of rules and conventions that govern how data is transmitted, received, and interpreted between interconnected computers. Examples of standardized protocols include TCP/IP, HTTP, SMTP, and FTP.\n\n5. How does the internet facilitate access to information?\nAnswer: The internet facilitates access to information by providing a vast repository of resources such as websites, databases, online libraries, and search engines, which allow users to quickly find and retrieve information on a wide range of topics from anywhere in the world."
""").strip()

result = call_llm(
    prompt=prompt,
    prompt_input=text,
)
print(json.dumps(result, indent=2))

In [None]:
import json

results[0]["output"] = json.dumps(
    [
        {
            "q": "What is the main purpose of cellular respiration?",
            "a": "The main purpose of cellular respiration is to convert biochemical energy from nutrients into adenosine triphosphate (ATP) and release waste products.",
        },
        {
            "q": "How does cellular respiration convert chemical energy into ATP?",
            "a": "Cellular respiration converts chemical energy from oxygen molecules or nutrients into ATP through a series of metabolic reactions and processes within cells.",
        },
        {
            "q": "What are the waste products released during cellular respiration?",
            "a": "The waste products released during cellular respiration include carbon dioxide and water.",
        },
        {
            "q": "Why is ATP important for cells?",
            "a": "ATP is essential for cells as it serves as the main energy currency, providing energy for various cellular processes and activities.",
        },
        {
            "q": "Can cellular respiration occur without oxygen?",
            "a": "Yes, cellular respiration can occur without oxygen through a process called anaerobic respiration, but it is less efficient compared to aerobic respiration which requires oxygen.",
        },
        {
            "q": "How does cellular respiration differ from photosynthesis?",
            "a": "Cellular respiration is the process by which cells convert energy from nutrients into ATP, while photosynthesis is the process by which plants and some other organisms convert light energy into chemical energy stored in glucose.",
        },
        {
            "q": "What organelle is primarily responsible for cellular respiration in eukaryotic cells?",
            "a": "Mitochondria are the organelles primarily responsible for cellular respiration in eukaryotic cells.",
        },
        {
            "q": "How is the energy stored in ATP used by cells?",
            "a": "The energy stored in ATP is used by cells to power various cellular processes such as muscle contraction, protein synthesis, and active transport across cell membranes.",
        },
    ]
)
results[1]["output"] = json.dumps(
    [
        {
            "q": "Who performed the first human heart transplant and when did it take place?",
            "a": "Dr. Christiaan Barnard performed the first human heart transplant on December 3, 1967, in Cape Town, South Africa.",
        },
        {
            "q": "What was the name of the patient who received the first human heart transplant?",
            "a": "The patient's name was Louis Washkansky.",
        },
        {
            "q": "How long did the patient, Louis Washkansky, live after the heart transplant surgery?",
            "a": "Louis Washkansky lived for 18 days after the surgery.",
        },
        {
            "q": "What was the significance of the first human heart transplant performed by Dr. Christiaan Barnard?",
            "a": "The first human heart transplant was a groundbreaking medical achievement that opened up new possibilities in the field of organ transplantation and cardiac surgery.",
        },
        {
            "q": "How did the medical community and the public react to the news of the first human heart transplant?",
            "a": "The news of the first human heart transplant generated significant interest and discussion in the medical community and among the general public, sparking debates on medical ethics, organ donation, and the future of surgical interventions.",
        },
    ]
)
results[2]["output"] = json.dumps(
    [
        {
            "q": "What is Rosalind Franklin best known for in the field of molecular biology?",
            "a": "Rosalind Franklin is best known for her photograph of DNA, known as Photo 51, which contributed significantly to the discovery of the DNA double helix by Watson and Crick.",
        },
        {
            "q": "How did Rosalind Franklin's work in X-ray crystallography impact the understanding of DNA, RNA, and viruses?",
            "a": "Rosalind Franklin's work in X-ray crystallography provided crucial insights into the molecular structures of DNA, RNA, and viruses, which laid the foundation for further discoveries in the field of molecular biology.",
        },
        {
            "q": "Who were the scientists credited with the discovery of the DNA double helix structure, and how did Rosalind Franklin's work play a role in it?",
            "a": "James Watson and Francis Crick are credited with the discovery of the DNA double helix structure. Rosalind Franklin's X-ray diffraction data, particularly Photo 51, provided key evidence that helped Watson and Crick formulate their model of the DNA structure.",
        },
        {
            "q": "What impact did Rosalind Franklin's contributions have on the field of biophysics and molecular biology?",
            "a": "Rosalind Franklin's contributions significantly advanced the understanding of molecular structures, particularly DNA, RNA, and viruses, in the fields of biophysics and molecular biology. Her work paved the way for groundbreaking discoveries in genetics and biochemistry.",
        },
        {
            "q": "How did Rosalind Franklin's expertise in X-ray crystallography differ from the approaches of other scientists studying DNA at the time?",
            "a": "Rosalind Franklin's expertise in X-ray crystallography provided a unique and detailed view of the molecular structure of DNA, which was different from the more theoretical and model-based approaches of other scientists studying DNA at the time.",
        },
    ]
)
results[3]["output"] = json.dumps(
    [
        {
            "q": "What materials were used in the construction of the Great Wall of China?",
            "a": "The Great Wall of China was built using various materials such as stone, brick, tamped earth, wood, and other materials.",
        },
        {
            "q": "Why was the Great Wall of China built?",
            "a": "The Great Wall of China was built along the northern borders of China to protect against invasions and raids from nomadic groups.",
        },
        {
            "q": "During which dynasty were the most well-known sections of the Great Wall of China built?",
            "a": "The most well-known sections of the Great Wall of China were built by the Ming Dynasty (1368-1644).",
        },
        {
            "q": "How did the Great Wall of China serve as a defense mechanism?",
            "a": "The Great Wall of China served as a defense mechanism by providing a barrier to protect against invasions and raids from nomadic groups.",
        },
        {
            "q": "What is the significance of the materials used in constructing the Great Wall of China?",
            "a": "The materials used in constructing the Great Wall of China, such as stone and brick, contributed to its durability and strength as a defensive structure.",
        },
        {
            "q": "How did the construction of the Great Wall of China change over different dynasties?",
            "a": "The construction of the Great Wall of China evolved over different dynasties, with each dynasty adding or modifying sections to strengthen the defenses against potential invaders.",
        },
        {
            "q": "What role did the Ming Dynasty play in the development of the Great Wall of China?",
            "a": "The Ming Dynasty played a significant role in building and fortifying the most well-known sections of the Great Wall of China, adding strategic features and enhancements to improve its defensive capabilities.",
        },
        {
            "q": "How did the Great Wall of China impact Chinese history and culture?",
            "a": "The Great Wall of China symbolizes China's historical efforts to protect its borders and has become a cultural icon representing strength, perseverance, and unity among the Chinese people.",
        },
    ]
)
results[4]["output"] = json.dumps(
    [
        {
            "q": "What is the Theory of General Relativity?",
            "a": "The Theory of General Relativity, formulated by Albert Einstein, describes the gravitational force as a curvature of spacetime caused by mass and energy.",
        },
        {
            "q": "How does the Theory of General Relativity explain the bending of light?",
            "a": "According to the Theory of General Relativity, the bending of light occurs due to the curvature of spacetime caused by massive objects.",
        },
        {
            "q": "What is one of the most famous predictions of the Theory of General Relativity?",
            "a": "One of the most famous predictions of the Theory of General Relativity is the bending of light around massive objects.",
        },
        {
            "q": "How was the prediction of light bending around massive objects confirmed?",
            "a": "The prediction of light bending around massive objects was confirmed through various experiments, including the observation of a solar eclipse in 1919.",
        },
        {
            "q": "What experimental evidence supports the Theory of General Relativity's prediction of light bending?",
            "a": "Experimental evidence supporting the Theory of General Relativity's prediction of light bending includes the observation of a solar eclipse in 1919.",
        },
        {
            "q": "How does mass and energy affect the curvature of spacetime according to General Relativity?",
            "a": "Mass and energy cause the curvature of spacetime according to General Relativity, leading to the gravitational force.",
        },
    ]
)
results[5]["output"] = json.dumps(
    [
        {
            "q": "What were Marie Curie's significant contributions to the field of science?",
            "a": "Marie Curie conducted pioneering research on radioactivity, discovered the elements polonium and radium, and was the first woman to win Nobel Prizes in two different scientific fields—Physics and Chemistry.",
        },
        {
            "q": "How many Nobel Prizes did Marie Curie win, and in which fields?",
            "a": "Marie Curie won two Nobel Prizes. She won the Nobel Prize in Physics in 1903 and the Nobel Prize in Chemistry in 1911.",
        },
        {
            "q": "What makes Marie Curie unique in the history of the Nobel Prizes?",
            "a": "Marie Curie is the only person to have won Nobel Prizes in two different scientific fields—Physics and Chemistry.",
        },
        {
            "q": "What was the significance of Marie Curie's discovery of polonium and radium?",
            "a": "Marie Curie's discovery of the elements polonium and radium was significant as it contributed to the understanding of radioactivity and had a profound impact on the field of chemistry.",
        },
        {
            "q": "How did Marie Curie's work on radioactivity influence future scientific research?",
            "a": "Marie Curie's groundbreaking research on radioactivity laid the foundation for future advancements in nuclear physics and chemistry, shaping the course of scientific discoveries in these fields.",
        },
    ]
)
results[6]["output"] = json.dumps(
    [
        {
            "q": "What are some examples of services enabled by the internet?",
            "a": "Some examples of services enabled by the internet include the World Wide Web, email, file sharing, video conferencing, online gaming, social media platforms, and online banking.",
        },
        {
            "q": "How has the internet revolutionized communication?",
            "a": "The internet has revolutionized communication by enabling instant messaging, video calls, email, social media interactions, and real-time collaboration tools, which have made global communication faster, easier, and more accessible.",
        },
        {
            "q": "In what ways has the internet impacted commerce?",
            "a": "The internet has had a significant impact on commerce by enabling e-commerce platforms, online marketplaces, digital advertising, online payment systems, and global supply chains, which have transformed the way businesses operate and reach customers.",
        },
        {
            "q": "Can you explain the concept of standardized protocols in relation to the internet?",
            "a": "Standardized protocols in the context of the internet refer to the set of rules and conventions that govern how data is transmitted, received, and interpreted between interconnected computers. Examples of standardized protocols include TCP/IP, HTTP, SMTP, and FTP.",
        },
        {
            "q": "How does the internet facilitate access to information?",
            "a": "The internet facilitates access to information by providing a vast repository of resources such as websites, databases, online libraries, and search engines, which allow users to quickly find and retrieve information on a wide range of topics from anywhere in the world.",
        },
    ]
)
print(json.dumps(results, indent=2))

In [None]:
results.append(
    {
        "instruction": "Generate a balanced set of relevant questions and their corresponding answers about the given text. Ensure the questions cover a mix of factual, analytical, and application-based types to provide both surface-level and in-depth knowledge of the subject. If any required information is not available in the text, clearly state that it is not provided.",
        "input": "The ego is a psychological concept that represents the part of the human psyche responsible for mediating between the unconscious and the conscious mind. It plays a crucial role in personality development, decision-making, and reality testing. The ego helps individuals navigate their environment, manage impulses, and maintain a sense of self. In Freudian psychoanalysis, the ego is part of a tripartite model of the psyche, which also includes the id and the superego.",
        "output": '[{"q": "What is the ego in psychological terms?", "a": "The ego is a psychological concept that represents the part of the human psyche responsible for mediating between the unconscious and the conscious mind."}, {"q": "What are the main functions of the ego?", "a": "The main functions of the ego include playing a crucial role in personality development, decision-making, reality testing, helping individuals navigate their environment, managing impulses, and maintaining a sense of self."}, {"q": "How does the ego relate to Freudian psychoanalysis?", "a": "In Freudian psychoanalysis, the ego is part of a tripartite model of the psyche, which also includes the id and the superego."}, {"q": "How does the ego help in managing impulses?", "a": "The ego helps in managing impulses by mediating between the unconscious desires (often represented by the id) and the constraints of reality, allowing individuals to make decisions that balance their needs with social and environmental demands."}, {"q": "What is the relationship between the ego and reality testing?", "a": "The ego is responsible for reality testing, which involves assessing the external world and distinguishing between internal psychological experiences and external reality, helping individuals to adapt to their environment effectively."}, {"q": "How does the ego contribute to personality development?", "a": "The ego contributes to personality development by helping individuals form a stable sense of self, make decisions, and interact with their environment in ways that shape their unique characteristics and behaviors over time."}, {"q": "What might happen if the ego is not functioning properly?", "a": "If the ego is not functioning properly, an individual might struggle with decision-making, have difficulty managing impulses, experience a distorted sense of reality, or have problems maintaining a stable sense of self. However, specific consequences are not provided in the given text."}]',
    }
)
len(results)

In [None]:
x = json.loads(results[len(results) - 1]["output"])
print(json.dumps(x, indent=2))
with open("dataset.json", "w") as f:
    json.dump(results, f, indent=2)