Skip to content
Rinter edited this page Mar 31, 2021 · 121 revisions

The wiki documents my findings and observations attempting to condense World Info (WI) entries down as compact as possible, down to the last character. This is a fork of the Futureman format, but has deviated away from it more and more each day.

What do you need?

  • This format, like Futureman, is designed to work with EWIJSON.
    • New here? What the heck is EWIJSON?! It's a script that provide more control over how AID uses WIs. Normally, AID puts a triggered WI up at the very top (hidden to viewer), but EWI lets you place it anywhere. Because of distance weakening (the higher up something is, the weaker its importance to the AI's writing direction), combined with concepts that are inherently weak already (unusual fantasy races, weird behaviors, concepts the AI doesn't understand well), EWI is a useful tool to move these down lower to counter this problem.
  • Time. This format is tedious and time consuming. You'll need to have a token viewer handy, like this one. There are others out there. Ask around on the discord for other options.
  • Dragon model. This format relies on safe assumption and griffin is not good as dragon for doing this.

Strengths and Weaknesses

  • It's primary strength is reducing WIs to as few characters as possible. This allows more characters to be reserved for the actual story content instead, and/or being able to trigger more WIs without overwhelming the memory.
  • This format does not make any promises on being the most accurate or strongest of the formats. There is a balancing act between being accurate enough to frustrate the user and conserving memory by cutting corners and dependence on safe assumption.
  • This format normally does not concern itself with an overbearing amount of entity definition. It's more about pointing the AI in the right direction and letting it do its thing.
  • Neither a strength nor a weakness, but full disclaimer: I primarily play in 3rd person mode, and only using story input (as opposed to Do or Say input), and with a 1.1 randomness. My bread and butter genre is high fantasy medieval in a world with a wide variety of colorful races.

Character example

This character is defined with 3 WIs, and each one will be explained word for word below.

1/3: placed in the p=1 position, this is an ultra-brief WI just to establish the basics that would be more jarring to see incorrect.

< Cheeps♀AvianPink feathers>

2/3: the remainder of the character is defined here, placed at p=3 or 5 or 7 depending on their importance or difficulty (how hard it is for the AI to understand).

< Cheeps ≡ scatterbrainGreeneyesnicebeak slender BreastSmall& necklaceofbell She servant povertyin Wicklow>

3/3: their race, which can of course be recycled for any other characters using it.

<Avian ≡Beastkin tall featherbirdfolklongleg beak crest feathersinsteadhair Their arms ≡wingsyet cannotfly>

Explaining space removal and capitalization

This breakdown explains when spaces are removed, or when capitalization is done.

  • The space before a word is removed...
    1. ...if the word does not get fragmented. Common words are 1 word = 1 token, but when you mash them together, they made get fragmented up into character clusters. This is just something you'll need to check in a token viewer. If the word is already multi-token by nature, and it does not get any more fragmented than it already is by removing the space, this is fine too.
    2. ...if the word is not the subject noun (for example Cheeps is the subject noun). This is to help with exact token ID matching to help the AI reference better. In most instances, the word would have a space before it.
    3. ...if the word is the subject noun, but it's more for internal reference than it for story use. For instance, Avian is more for internal reference, used by Cheep's WI, than it is actually written in the story.
    4. ...if mashing two words together does not cause ambiguous confusion. Does the adjective apply to the word before it, or after it? Space or no space, either scenario is confusing to the AI and the adjective should be isolated from the wrong noun in some way (examples shown later).
    5. ...if the space is not following a comma. This tends to leak into the story.
  • A word is capitalized...
    1. ...if the lowercase word would get fragmented, but not the uppercase form.
    2. ...if either form are fine, but the uppercase word proves to work better in testing.
    3. ...if the uppercase word does not get mistaken as being a proper noun in some way, revealed via testing. For example, Graybeak was frequently mistaken as being a name.
    4. ...if you're repeating the subject noun through pronoun form, as if starting another sentence.

Example breakdown

First is WI 1 of 3, defined at the p=1 slot.

< Cheeps♀AvianPink feathers>
  • First, the encapsulation. Those familiar with futureman may already be aware of << >>>> encapsulation.
    • Futureman encapsulation is more resistant to the WI format leaking. Most of the time I do not encounter leaking. When I did, it was typically when I was breaking the fourth wall and prompting the AI to try and test out a WI for accuracy.
    • The odds of leaking increasing the more WIs you have currently triggered, overwhelming the content. This can be countered with the [f] filter argument.
    • More niche, I've seen the format leak when talking to characters whom intentionally have bad grammar for whatever reason.
    • Monky, author of futureman, states < > is the second best encapsulation according to his testing. It should be noted, however, there are a wide variety of tests and scenarios to account for, and what reflects in one set of variables may not reflect in another. In my experience, < > has suited me well. This format is all about shaving off every last little character, and little by little it adds up.
  • A leading space because it's the primary subject noun.
  • The name itself. If you have the luxury of picking any name you'd like, try to pick one that is token-friendly. Here are some thoughts on this:
    • If you use an irl name that's somewhat common, there's a good chance it may be a single token, and the corpus may already include gender bias to it. But it may also include some other undesirable biases, you never know. Single token names are very strong with keeping the AI's facts lined up.
    • Single token names-that-aren't-names seem to work fairly well, especially for animals. For example Lumin or Snow.
      • Names that are self descriptive of a trait (for example Snow having white fur) helps a bit, too.
    • When it comes to multi-token names, I aim for two tokens if possible. But what's more important is token length. The more open to interpretation a fragmented token is, the worse.
      • For example if a token splits up in such a way that one of the tokens is I or He or some other word, the AI could read this the wrong way.
      • Single-letter tokens may sometimes be misunderstood as being typos.
    • 3+ letter tokens (that aren't confusing words) are rather safe. In this example, Cheeps is Che + eps and there is practically no room for confusion.
    • Even if you do everything "perfectly", there's a chance it may still mangle a name due to built-in repeat penalty, depending on how frequently it has been written.
  • The gender symbol, attached directly to the name, no space. After using gender symbols for a few weeks, I've nothing but good things to say about them. But, some notes:
    • Personally, I've only really encountered the AI getting female characters' gender mixed up, especially when using fantasy races that are male biased, such as more bestial creatures. I've noticed a marked improvement in keeping these outlying cases better respected since switching to the symbol.
    • Non-binary symbols also work very well. If it's a herm you're after, ⚥ is all you need.
  • Next, the race is defined. No leading space because our race WI also lacks a leading space.
    • If the race was more of a central theme in the story, I may give it a leading space. But in general, generic entities or concepts are less sensitive to this than individual characters are.
  • Next the primary integument color. Yes, color! Everyone's favorite and most frustrating topic for AID to mess up. With EWI, color issues are drastically reduced, but here are some key notes:
    • The AI is very sensitive to distance weakening here. The pink feather here should be well respected, but the green eyes mentioned in the 2nd WI could be less accurate, but still drastically more accurate compared to normal WIs.
    • The more colors you throw into description, the more likely you'll encounter contradictions. Ask yourself what is actually important to define and what you're okay with letting the AI just take a guess on.
      • ...and also keep in mind, color testing is a trap. If you go out of your way to test colors, you may always walk away disappointed compared to how the AI will actually end up using the colors in real-world play. Most typically, the storytelling may only concern itself with eye color and primary integument color.
    • Normally, having a color before the noun is stronger than having it follow. When the AI sees a color, it looks at the words on either side of it, and in some cases, the word beyond that as well. Try to ensure these surrounding words are safe, meaning they should either be colored, or a word that does not relate to colors.
  • Finally, the integument itself, which is feathers for this character.

Next is WI 2 of 3, defined at the p=5 slot.

< Cheeps ≡ scatterbrainGreeneyeNicebeak slender BreastSmall& necklaceofbell She servant povertyin Wicklow>
  • Encapsulation and already explained, so moving onto the Triple Bar . We borrow this from the Catnip format and give it a leading space because the leading space version is only 1 token, and 2 tokens without.
    • This symbol means identical to
    • Whereas futureman/caveman is more about a devolved form of prose, this format is more about listing out attributes in a particular order, only using grammatical words to aid when tests are faltering, as you'll see in later examples.
  • The first attribute should be whatever is most important to you. The closer the attribute is to the subject noun, the more weight it will get. In this case, we wanted this character to be portrayed as comedic relief: scatterbrain
  • Like before, the only way to get the color, without a leading space, into a single token was to use its capitalized form.
  • eye is written instead of eyes. Shaving off every character we can, we always use shorter variants of words if we can get away with it, drawing on the strength of dragon's understand to rewrite this properly when actually used.
    • Sometimes you need eyes, plural, depending on what the token viewer shows you.
  • Nice serves a couple purposes.
    • Primarily, it was used to distance the color Green from beak, as it was otherwise commonly applying this color to it. Since you try to place related attributes together (we're talking about facial features in this cluster of words), there wasn't anywhere else to put beak.
    • It's intentionally ambiguous. The AI is free to apply it to the beak, the eyes, or think of it as a personality attribute. It's otherwise just a filler word.
  • beak to try and steer the AI away from thinking we're talking about an anime-style harpy.
  • While thin and slim would have been less characters than slender, this word demonstrated more likeable descriptions that I was going for, when testing. In fact, this word is arguably the strongest word in the WI as far as what a general description will write.
  • BreastSmall. No this doc is not meant to be NSFW orientated, but this example was left here to talk about the subject of bias. Female characters whom find themselves in even the slightest level adult situations are often depicted as having big breasts. You can thank the internet for that. This bias is so strong that traditional WIs seem to go ignored trying to fight it. With EWI's reduced distance penalty, it can counter this.
  • The & is used to separate Small from being applied to the necklace.
  • necklaceofbell is an odd one. You'd think bell necklace would work better, but flipping it around showed stronger results in testing.
    • You can sometimes get away with not needing to write wear or any helper words like that. If the AI sees something wearable, it knows what to do with it. Sometimes. Just something you need to test for.
  • She with a leading space because we're pretending like we're starting a new sentence here. You can only slap so many attributes onto a subject noun before you need to drop a pronoun to refresh the AI on who/what it is you're even talking about. If you start seeing stuff in the back of your WI going ignored, it may need one of these.
  • servant her job. We draw on the power of assumption here, as you don't need to actually write occupation or works because if the AI sees something job-like, it often knows what to do with it.
    • ...but not always. Some jobs are stronger than others. If you find it's going ignored, you may need to add those helper words.
      • ...or try another word for the job. Before I had maid but wasn't nearly as strong as servant.
  • povertyin Wicklow both specifies a social class and implies where she lives.
    • Like before, some words are stronger than others. Before, I had poor or ispoor, but testing showed the AI paid more attention to poverty.

Next is WI 3 of 3, also at the p=5 slot.

<Avian ≡Beastkin tall featherbirdfolklongleg beak crest feathersinsteadhair Their arms ≡wingsyet cannotfly>
  • The space before the race name was removed for nuanced reasons explained above. Consider adding the space back if you suspect the WI accuracy is not working as well as it should be.
  • Beastkin is a species category. This is a self-describing word that the Dragon model seems to understand. I do have a WI for it though: <BeastkinCategoryofspeciesanimal folk> at p=7 just for a bit of extra help, but it works fine without it.
  • tall featherbirdfolklongleg beak crest all pretty self explanatory...
  • feathersinsteadhair is of special note. The AI loves giving head hair to anything sentient. The power of instead is pretty good at steering it in another direction compared to trying to use negation trickery.
    • That said, it still gets it wrong sometimes, but this method shows a big improvement. Still looking into more, compact reliable methods...
  • Much like the She pronoun in the previous WI, Their acts as a new sentence starter.
  • arms ≡wingsyet cannotfly this is an example of trying to fight a very strong bias. When the AI sees birds/wings, it immediately thinks of flying. It took me hours of trial and error testing out different methods to come up with something that respected the concept of flightless wings.

Clone this wiki locally