-
Notifications
You must be signed in to change notification settings - Fork 173
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Evaluation result #28
Comments
For the results in Table 1(a) of the paper, models are trained on For the transfuser model trained on Can you tell what hyperparameters you used for training? What changes did you make in the config file? Also, I'd suggest training with different seeds and evaluating multiple times to get an estimate of the variance. |
Thank you for your reply, and for the config file (transfuser/transfuser/config.py), I only changed the root_dir and viz_root. I will download the clear_weather_data to train again, with clear_weather_data, can I use the same training setting as in the transfuser/transfuser/config.py to get the results in Table 1(a)? thank you so much. |
Yes, you should be able to reproduce the results in Table 1(a). Also, make sure that you train with multiple seeds and run multiple evaluations (for Table 1(a), we trained with 3 different seeds and evaluated each seed 3 times). You don't need to run 9 evaluations as we did, but at least 3 training seeds and 1 evaluation for each is desirable. Since, evaluation on Town05_long takes a lot of time, you can also try evaluating on Town05_short. I'll also check again to make sure that there is no other source of variation. |
Thank you so much, I will get back to you when I get the results. |
Hi, I have downloaded the clear_weather_data, when I change the root_dir and viz_root, it still report an error,
It seems that rg_lidar_diag_pl_1_4.npy has saved some old path of yours, should I delete all the rg_lidar_diag_pl_1_4.npy first, and then run the train.py, since I found that if it can't find the rg_lidar_diag_pl file, it will build a new one. Is that right? Thanks. |
yes, you need to regenerate the preload files. |
Dear Authors,
Thank you for your excellent work,
I trained the model with: CUDA_VISIBLE_DEVICES=0 python train.py --id transfuser --batch_size 56
And then I evaluate the model with: run_evaluation.sh
export ROUTES=leaderboard/data/evaluation_routes/routes_town05_long.xml
export TEAM_AGENT=leaderboard/team_code/transfuser_agent.py
export TEAM_CONFIG=transfuser/log/transfuser
export CHECKPOINT_ENDPOINT=results/transfuser_result.json
export SCENARIOS=leaderboard/data/scenarios/town05_all_scenarios.json
Finally, I got the results as
"labels": [
"Avg. driving score",
"Avg. route completion",
"Avg. infraction penalty",
"Collisions with pedestrians",
"Collisions with vehicles",
"Collisions with layout",
"Red lights infractions",
"Stop sign infractions",
"Off-road infractions",
"Route deviations",
"Route timeouts",
"Agent blocked"
],
"values": [
"19.388",
"51.060",
"0.539",
"0.000",
"0.472",
"0.000",
"0.221",
"0.008",
"0.008",
"0.000",
"0.000",
"0.999"
]
Is this Avg. driving score normal as I trained with the 14_weathers_minimal_data?
I noticed that the DS should be around 33% in the paper, would you please give me some suggestions about which part I made mistakes in.
Best
The text was updated successfully, but these errors were encountered: