Hidden Engrams: Long Term Memory for Transformer Model Inference

State-of-the art transformer models like GPT3 can generate realistic text, but the window of text the transformer is able to look at is still relatively small. Hidden Engrams aims to remedy this problem by introducing an approximation of long term memory using the transformer's hidden states. These values can then be used to quickly sort all past "memories" by relevance to the current input. Once sorted, an optimized prompt can be built including only the most relevant information

Usage

First, ensure the transformer model you want to use is configured properly in transformer.py. Engrams are incompatible across different models. To encode your own datasets, modify "encode.py" as needed to load your data example.py provides a simple example use case for this: chat bots. Previous messages are encoded and stored, then used to build future prompts

This is a very early proof-of-concept. More to come soon!

@misc{hiddenengrams,
  author = {Luke Fay aka AeroScripts},
  title = {Hidden Engrams: Long Term Memory for Transformer Model Inference},
  howpublished = {\url{https://github.com/AeroScripts/HiddenEngrams}},
  year = 2021,
  month = June
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE.md		LICENSE.md
README.md		README.md
encode.py		encode.py
engram.py		engram.py
example.py		example.py
requirements.txt		requirements.txt
transformer.py		transformer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hidden Engrams: Long Term Memory for Transformer Model Inference

Usage

About

Releases

Packages

Languages

License

AeroScripts/HiddenEngrams

Folders and files

Latest commit

History

Repository files navigation

Hidden Engrams: Long Term Memory for Transformer Model Inference

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages