Skip to content

Does the sam-branch use the Vary initialization for dense OCR? #3

@Ucas-HaoranWei

Description

@Ucas-HaoranWei

Hi,
I read your report, and I think the pipeline is very similar to Vary. I have a question:
Does the sam-branch use the Vary initialization for dense OCR? Based on my experiments, the vision latent output by the original Sam is noisy for text-latent-based LLM.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions