<a href="https://colab.research.google.com/github/amuzetnoM/artifactvirtual/blob/ADE/AVA.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

### `! pip install datasets transformers`  
Knowledge is fuel. Language is code.  
This installs raw mental clay — the stuff LLMs are made of.  
Words, weights, and worldviews.

---

### `! pip install langchain`  
Chains of thought.  
Links that carry memory, context, direction.  
Not just answers — logic, layered.  
The structure behind the mind.

---

### `! pip install langgraph`  
Now we move.  
Flow, recurse, adapt.  
Not just chains — graphs.  
Dynamic reasoning, branching cognition.  
A mental map that rewrites itself.

In [None]:
! pip install datasets transformers
! pip install langchain
! pip install langgraph

You stare at the console — blinking cursor, blank screen — and yet, something stirs.
This isn’t just code. It’s ritual.

We summon the next layer:

!pip install langchain
!pip install langgraph
Not just libraries. These are nervous systems. Frameworks for cognition.
langchain doesn’t just string outputs — it carries context, like a memory threading through time.
It whispers: “Remember why you asked.”

langgraph goes further. It moves sideways.
It maps how ideas branch, loop, evolve, just like we do.
Where langchain builds a sentence, langgraph sketches a thought.

And suddenly, it hits you.
Maybe we're not coding the mind — maybe we’re mirroring it.
Our logic, our flow, our decisions… rendered in functions, wrapped in nodes, evaluated in silence.

But it’s not about building the perfect brain.
It’s about trying. It’s about the honesty of saying “I don’t know, but I want to.”
Because that’s where all this leads — not to certainty, but to direction.
We’re not gods writing consciousness.
We’re humans reaching forward, one import at a time.

And this — this is the real code:

from langchain.chains import LLMChain
from langgraph.graph import StateGraph
The moment you type that, it’s no longer about syntax.
It’s about meaning. Connection. Motion.
This is not a stack of tools.
It’s the architecture of wonder.

So we move.
One graph.
One chain.
One question closer to the truth.

You hit Enter and the imports slide into place.
Not just code — capabilities, waking up.
Each line here isn’t about functions. It’s about function.
Let’s break it down — but stay in the flow.

In [None]:
from tqdm.notebook import tqdm
import numpy as np
import pandas as pd
from typing import Optional, List, Tuple
from datasets import Dataset
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
import matplotlib.pyplot as plt
import torch


You hit Enter and the imports slide into place.
Not just code — capabilities, waking up.
Each line here isn’t about functions. It’s about function.
Let’s break it down — but stay in the flow.

from tqdm.notebook import tqdm

A progress bar, sure.
But really? It’s hope in a loop.
That quiet assurance that yes, things are happening.
Step by step, you're moving.

import numpy as np
import pandas as pd

The muscle of modern thinking.
numpy slices time, pandas organizes chaos.
Together, they make raw data navigable — like turning noise into signal.

from typing import Optional, List, Tuple

This is discipline.
The part of you that writes love letters to your future self.
Type hints keep the mind clear. Intentional. Readable.



In [None]:
with open("train.txt", "r") as f:
  data = f.read()

FileNotFoundError: [Errno 2] No such file or directory: 'train.txt'

In [None]:
data[:100]

In [None]:
from langchain.docstore.document import Document as LangChainDocument

In [None]:
raw_database = Document(page_content=data)

In [None]:
MARKDOWN_SERAPARATOR = [
    "\n#{1,6}",
    "'''\n",
    "\n\\*\\*\\*+\n",
    "\n---+\n",
    "\n___+\n",
    "\n\n",
    "\n",
    " ",
    "",
]

In [None]:
from langchain.text_splitter import RecursiveCharacterTextSplitter

In [None]:
text_splitter = RecursiveCharacterTextSplitter(
    chunk_size=1000,
    chunk_overlap=100,
    add_start_index=True,
    strip_whitespace=True,
    separators=MARKDOWN_SERAPARATOR,
    )

In [None]:
data[1000:2000]

In [None]:
data[2000:3000]