Extending Large Language Models to Multimodality for non-English Languages

This codebase provides the scripts that were used in the paper. The pipeline directory contains a folder for each main step of the pipeline, that are:

The data, models and reports folders are supposed to be used to keep data after processing, models after training and evaluation results.

You will find a README markdown file in each folder which explains what has been done and how to use the code.

However, please note that our experiments were done using Singularity for containerization and increased reproducibility, therefore, most scripts are launched using it. We recommend installing Singularity on the system you want to test the codebase on.

Finally, git-lfs was also used to download data, we also recommend installing it.

If you are interested in data and models, you can find them in this collection.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data		data
models		models
pipeline		pipeline
reports		reports
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Extending Large Language Models to Multimodality for non-English Languages

About

Uh oh!

Releases

Packages

Languages

License

swapUniba/LVLMs-NonEnglish

Folders and files

Latest commit

History

Repository files navigation

Extending Large Language Models to Multimodality for non-English Languages

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages