How long does it take to achieve convergence when starting training from PrismaticVLM?
When loading the OpenVLA checkpoint and training with BridgeV2 for 5k iterations, some success rate is observed.
However, if loading the prism-dinosiglip-224px+7b model—first pretrained on the OpenX dataset with 16 A100 GPUs for 40k iterations, then trained on BridgeV2 for 25k iterations—the success rate remains zero.