A simplistic pytorch implementation of LongVit using my previous implementation of LongNet as a foundation.
ai
ml
artificial-intelligence
attention
attention-mechanism
attention-is-all-you-need
transformer-architecture
transformer-models
gpt3
gpt4
-
Updated
Nov 12, 2024 - Shell