in readme, the inference code is below
python scripts/inference_human_prediction.py \ --config VITRA-VLA/VITRA-VLA-3B \ --image_path ./examples/0002.jpg \ --sample_times 4 \ --save_state_local \ --use_right \ --video_path ./example_human_inf.mp4 \ --mano_path ./weights/mano \ --instruction "Left hand: None. Right hand: Pick up the picture of Michael Jackson." \
but in fact dont have --config paras
Could you provide the latest inference code that works well?
in readme, the inference code is below
python scripts/inference_human_prediction.py \ --config VITRA-VLA/VITRA-VLA-3B \ --image_path ./examples/0002.jpg \ --sample_times 4 \ --save_state_local \ --use_right \ --video_path ./example_human_inf.mp4 \ --mano_path ./weights/mano \ --instruction "Left hand: None. Right hand: Pick up the picture of Michael Jackson." \but in fact dont have --config paras
Could you provide the latest inference code that works well?