About

This is a game written from September 27-29, 2024, for the Text Adventure Hack.

The main branch is deployed on intra-game.vercel.app

Installation instructions

This is a Next.js. It can be deployed on Vercel, run locally... or probably run elsewhere without too much trouble.

To run this you'll need a Google Gemini API key (you should be able to get one for free, and run this game using a free-tier API key). (Why Gemini?) Put the key in .env.local as GEMINI_KEY="..."

The tech stack:

Next.js
- React
- TypeScript
Tailwind for styling
Preact Signals for state management
Actual game state is stored in browser localStorage
One wee little server endpoint
The Gemini LLM

Getting Started

Sign up for a Gemini API Key and put it in .env.local:

GEMINI_KEY="your key"

Run npm install to install and npm run dev to run locally.

Open http://localhost:3000 with your browser to see the result.

Code

There are only a few important parts of the code! Note that many contain spoilers.

types.ts defines (almost) all the types used throughout the game, outside of classes
model.ts manages game state in the browser and the user/game interaction
world.ts represents a moment in the world, and is a getter for most objects
classes.ts has the class for Entity and its children (e.g., Room, Person, Player). These manage most of the actual game play!
gameobjs.ts is the instantiation of all the actual game state, individual people and rooms, etc. (though all logic is in classes.ts)
application.tsx is the entire UI
dossier.md is the game background. I wasn't able to use much of this given the constrained implementation time, but I hope to eventually!

There's a little more UI in components/ and some libraries in lib/, but they are mostly generic and secondary. Besides application.tsx app/ is mostly boilerplate or glue.

FAQ

What are you trying to accomplish with this?

This began as a submission to the RetroAI Quest Hackathon, which asked for an AI-powered retro-feeling text game. I had already been thinking a lot about this subject, so my appreciation to the hackathon runners for inspiring me to actually do it!

There's lots of LLM-based games that let the LLM hallucinate the entire story. But these have a dreamlike quality to them... things come into existance only as they are imagined. They are ungrounded. A normal text adventure has a very strict structure, with a set of formal commands to navigate that structure.

In this game I'm trying to have a bit of both. There's an underlying game model and a grounding to the story, but with opportunities for the user and LLM to navigate that together in imaginative ways.

Why Gemini?

Entirely because Gemini offers a generous free tier, making it possible to share this application without much risk of large API charges.

Is Gemini good? Eh. It's no GPT or Claude. But it'll do.

Are Gemini safety controls a problem?

Why yes they are, I'm glad I you asked!

I am entirely ok with this being a PG-rated game. Unfortunately the safety controls often enforce something closer to a G rating. Even the most minor insults will trigger safety controls, such as "you are dumb." It's hard to get any characters to fight, or to get the narrator to describe anything but de-escalation... I'm mostly okay with this, but even attempting will often lead to a safety violation. Once I get to implementing the dance-off battle will Gemini thwart my attemps with its safety controls? Time will tell!

How does the LLM interaction work?

In general the LLM should always output responses enclosed in tags. It's not "real" HTML or XML or any kind of markup, it's just ways to wrap different kinds of output that can appear in a single response.

I deliberately did not use Tools or function calls for this. In part because Gemini isn't very good at them. But also the basic model of how a tool works isn't good for a game. The providers all generally expect:

Setup the situation as a prompt
Get the LLM to emit one or more tool calls (and handling the LLM's reluctance to actually emit multiple calls)
"Return" the "execution" of the tool call. This makes sense if the LLM is looking up information, but for a game it's mostly just updating game state and there's not interesting results.
Prod the LLM into finishing the response given the result of those executions.

I really want the LLM to simply state what should happen, and then make it happen, and not return to the LLM at all.

Also when getting an LLM to simulate some fictional entity it's very useful to present the task as something like dialog generation, and never make the LLM "pretend" to be another person. Instead the LLM plays the part of a script writer. Tags get the LLM into the mode of writing dialog or simulating the effect of actions instead of the unnecessary and difficult task of changing the LLM's primary (helpful) personality to be something else.

What's with the update stream?

If you look at the code you'll notice that the only real mutable structure is a stream of updates. There are "views" of the current state of the game given all the updates, but all edits are new updates added to the stream.

I find it's easier to work with and debug the system if the whole things is one long list of updates:

You get a chance to see all the changes; the changes become concrete saved objects. You don't just have to use lots of console.log statements and try to figure out how state updates.
You get undo that can undo everything automatically. This is important because it's common to try something, get the wrong response, change the prompt, and you want to retry that same thing.
There's an opportunity to understand history. For instance if you want to know what an NPC character experienced you can know where the NPC was in the past, including only the history that was visible to the NPC.

Is this based on something?

I've done other similar projects in the past, much of which I wrote about in Roleplaying driven by an LLM: observations & open questions, but this is an independent piece of code. I do feel like I'm developing a "game engine" of sorts, but for now I'm developing a game that happens to have an engine as part of it.

Is this generated with AI?

I use Copilot and GPT extensively, but no large chunks are created independently by AI. But much of the game dossier was created in close collaboration with GPT.

Security?

Because it all runs in the browser locally it's mostly fine... but it also means that there's a way to run arbitrary Gemini LLM calls with the deployed API key because the server isn't aware of what makes up a valid prompt or the state of any particular game. Pretty please don't abuse this. It's not even worth it, right? There's so many easier ways to run things on an LLM.

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
__tests__		__tests__
app		app
components		components
docs		docs
lib		lib
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
babel.config.json		babel.config.json
next.config.mjs		next.config.mjs
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Installation instructions

Getting Started

Code

FAQ

What are you trying to accomplish with this?

Why Gemini?

Are Gemini safety controls a problem?

How does the LLM interaction work?

What's with the update stream?

Is this based on something?

Is this generated with AI?

Security?

About

Releases

Packages

Languages

License

ianb/intra-game

Folders and files

Latest commit

History

Repository files navigation

About

Installation instructions

Getting Started

Code

FAQ

What are you trying to accomplish with this?

Why Gemini?

Are Gemini safety controls a problem?

How does the LLM interaction work?

What's with the update stream?

Is this based on something?

Is this generated with AI?

Security?

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages