Why the training code snippet contains BEV feature and doesn't have language output ?

Hi, i have 2 question about the codebase :
1. Why the training code snippet contains BEV feature ? (carllava only use RGB image as input)
https://github.com/OpenDriveLab/ETA/blob/aa510c19cc69b263aa22f7f7bc0014e0a6176a09/carformer/carformer/ponderer.py#L335 

2. Most of VLM sacrifices speed for explainable (language) output enhancing the reasoning & robustness of predicted waypoint. However, i didn't saw any language output in the codebase of ETA ? 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Why the training code snippet contains BEV feature and doesn't have language output ? #2

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Why the training code snippet contains BEV feature and doesn't have language output ? #2

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions