instruction-visual-semantic-consistency

An instruction–visual semantic consistency framework that explicitly aligns instructions with visual observations by identifying and preserving landmark regions before visual compression. This project uploads files by improving NaVid-VLN-CE, introducing the instruction-visual-semantic-consistency framework, which further enhances its original level.

In the repository, the train file is the training code, landmark_head is the functional code, and the arch file is the modification of NaVid-VLN-CE, which includes the interface of landmark_head.py.

Notice to Readers We would like to remind readers that the code in this repository is directly associated with our manuscript submitted to The Visual Computer, titled:

"Enhancing Cross-Modal Semantic Alignment for Vision-and-Language Navigation in Continuous Environments"

This repository supports the research presented in the paper and includes implementations that reflect the methods and experiments described in the manuscript.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

instruction-visual-semantic-consistency

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

field789/instruction-visual-semantic-consistency

Folders and files

Latest commit

History

Repository files navigation

instruction-visual-semantic-consistency

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages