Tiny Vision Language Action Model for Robot Control.
- This model is based on RT-2 model. But it is a very tiny one for robot control.
- Original architecture is based on tiny but robust VLM(Vision Language Models) like
MiniCPM-V2,TinyLLaVA,PaliGemmaand etc.
paligemma_based: PaliGemma based VLA model.