Questions regarding State/Action inputs for GR00T and Pi0.5 #36
-
|
Hi! I recently tested it on my Jetson Thor and the performance is impressive!
I'd appreciate any guidance or insights you can share. Thanks again for this awesome repository! |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 1 reply
-
|
Thank you so much for testing it on Thor! Yes, these inputs should eventually be supported. There are still quite a few features that are not fully aligned yet, such as GR00T N1.7 GPU support, state input support, FP8 / FP16 / BF16 path coverage, and some model-specific interfaces. They are all on the roadmap. Right now, my main focus is to support more model types and architectures first, so that FlashRT can cover a broader range of kernels and execution patterns. As you can see from the current performance, the core value of the project comes from the reusable kernel/runtime library underneath. Some smaller feature gaps and input interfaces are hard for me to cover quickly by myself, but they are definitely planned. The current priority is roughly:
I will also try to fill in these model input/interface gaps as much as possible. Contributions are very welcome if you are interested in helping with any of these adaptations — feel free to open a PR or discussion anytime. Thanks again for the feedback and for trying FlashRT on Thor! |
Beta Was this translation helpful? Give feedback.
-
|
updated state handling to public VLA API in #50 |
Beta Was this translation helpful? Give feedback.
Thank you so much for testing it on Thor!
Yes, these inputs should eventually be supported. There are still quite a few features that are not fully aligned yet, such as GR00T N1.7 GPU support, state input support, FP8 / FP16 / BF16 path coverage, and some model-specific interfaces. They are all on the roadmap.
Right now, my main focus is to support more model types and architectures first, so that FlashRT can cover a broader range of kernels and execution patterns. As you can see from the current performance, the core value of the project comes from the reusable kernel/runtime library underneath.
Some smaller feature gaps and input interfaces are hard for me to cover quickly by myself, bu…