Segmentation failed #200

Oliverwang11 · 2024-01-26T17:31:44Z

Hi I try to evaluate the transfuser based agent using./leaderboard/scripts/local_evaluation.sh /home/<usrname>/Desktop/transfuser/carla /home/<username>/Desktop/transfuser on Ubuntu 22.04.3.

But with some errors pop out

/home/<usrname>/Desktop/transfuser/leaderboard/leaderboard/leaderboard_evaluator_local.py:89: DeprecationWarning: distutils Version classes are deprecated. Use packaging.version instead.
if LooseVersion(dist.version) < LooseVersion('0.9.10'):
./leaderboard/scripts/local_evaluation.sh: line 32: 20110 Segmentation fault (core dumped) python3 ${LEADERBOARD_ROOT}/leaderboard/leaderboard_evaluator_local.py --scenarios=${SCENARIOS} --routes=${ROUTES} --repetitions=${REPETITIONS} --track=${CHALLENGE_TRACK_CODENAME} --checkpoint=${CHECKPOINT_ENDPOINT} --agent=${TEAM_AGENT} --agent-config=${TEAM_CONFIG} --debug=${DEBUG_CHALLENGE} --resume=${RESUME}

anyone has some idea?

Thanks

The text was updated successfully, but these errors were encountered:

Kait0 · 2024-01-26T18:52:52Z

Segmentation faults are hard to analyse. I would suggest you use a debugger or print statements to find the line of code that crashes, than we can help you better.

Oliverwang11 · 2024-01-26T21:28:43Z

Thanks I will try!

Oliverwang11 · 2024-01-26T22:14:50Z

It seems like the crash is in self.module_agent = importlib.import_module(module_name) when importing the module_name which is submission_agent.
BTW I run the evaluation in my own computer with a GTX3060 GPU-6GB and 16 GB RAM

Kait0 · 2024-01-28T14:42:17Z

hm that is strange line is just trying to import the agent py file.
Are you using the conda environment from this repository?
WORK_DIR is the work dir variable in the script correct (e.g. does module_name point to the correct file?)
Maybe some second order import problem. If the submission_agent file is executed can you check how far it gets?

Oliverwang11 · 2024-01-28T20:07:35Z

Hi thanks for your reply, the work dir seems fine.
I went deep into the submission_agent.py file it seems like the code crashed at from model import LidarCenterNet when trying to load the LidarCenterNet

Kait0 · 2024-01-28T21:22:16Z

Failing somewhere specific within LidarCenterNet?

You can try commenting this line. It sometimes makes problems since it depends on an external cuda lib. Its optional for an ablation so its fine to turn it off.

transfuser/team_code_transfuser/model.py

Line 11 in 22b3ccd

from point_pillar import PointPillarNet

Oliverwang11 · 2024-01-28T22:07:53Z

Aha after I comment this line from point_pillar import PointPillarNet the crash disappear, but there is another problem pop up
seems like the town map is loaded but the ego car haven't shown.

./leaderboard/scripts/local_evaluation.sh /home/oliverwang/Desktop/transfuser/carla /home/oliverwang/Desktop/transfuser
/home/oliverwang/Desktop/transfuser/leaderboard/leaderboard/leaderboard_evaluator_local.py:89: DeprecationWarning: distutils Version classes are deprecated. Use packaging.version instead.
if LooseVersion(dist.version) < LooseVersion('0.9.10'):
/home/oliverwang/Desktop/transfuser
-----submission_agen line 209
-----submission_agen line 16
-----submission_agen line 18
-----submission_agen line 20
-----submission_agen line 21
-----submission_agen line 209
-----submission_agen line 209
Registering the global statistics

Do you have any clue? Thanks!

Kait0 · 2024-01-28T22:28:14Z

"-----submission_agen line" I suppose these are your debug prints.
I don't see an error here. these are just warnings and prints.
It might be that you need to delete the results.json file, because the code thinks it already finished all routes.

SY-LG · 2024-01-31T09:14:32Z

I came accross this segmentation days before. Check if your cuda and pytorch stuff versions matches.

d at from model import LidarCenterNet when trying to load the LidarCenterNet

To check if you are facing the same situation as mine, you can trace even further, and eventually it would turn out that the segmentation fault take place somewhere irrelevant with transfuser but relevant with pytorch stuff

SY-LG · 2024-01-31T09:18:09Z

BTW, the version relationships are kind of ambiguous for pytorch and mmcv stuffs, sometimes even need to test it yourself.
You can try this setup, it works fine for me.

ubuntu 22.04
nvidia-smi：12.0
sudo apt install -cuda-toolkit：10.1
torch: 1.8.1+cu101
torch-scatter: 2.0.7
torchaudio: 0.8.1
torchvision: 0.9.1
mmcv-full: 1.71
timm: 0.3.2
mmdet: 2.26.0
py-trees: 0.8.3

Kait0 closed this as completed Feb 5, 2024

jingzhengli mentioned this issue May 21, 2024

I met an Error during evaluation. How can I solve it? #215

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Segmentation failed #200

Segmentation failed #200

Oliverwang11 commented Jan 26, 2024 •

edited

Loading

Kait0 commented Jan 26, 2024

Oliverwang11 commented Jan 26, 2024

Oliverwang11 commented Jan 26, 2024 •

edited

Loading

Kait0 commented Jan 28, 2024

Oliverwang11 commented Jan 28, 2024

Kait0 commented Jan 28, 2024

Oliverwang11 commented Jan 28, 2024 •

edited

Loading

Kait0 commented Jan 28, 2024

SY-LG commented Jan 31, 2024

SY-LG commented Jan 31, 2024

Segmentation failed #200

Segmentation failed #200

Comments

Oliverwang11 commented Jan 26, 2024 • edited Loading

Kait0 commented Jan 26, 2024

Oliverwang11 commented Jan 26, 2024

Oliverwang11 commented Jan 26, 2024 • edited Loading

Kait0 commented Jan 28, 2024

Oliverwang11 commented Jan 28, 2024

Kait0 commented Jan 28, 2024

Oliverwang11 commented Jan 28, 2024 • edited Loading

Kait0 commented Jan 28, 2024

SY-LG commented Jan 31, 2024

SY-LG commented Jan 31, 2024

Oliverwang11 commented Jan 26, 2024 •

edited

Loading

Oliverwang11 commented Jan 26, 2024 •

edited

Loading

Oliverwang11 commented Jan 28, 2024 •

edited

Loading