Add support for Vietnamese sign language #73

nqkhanh2002 · 2024-06-22T06:03:15Z

Description:

This PR adds support for the Vietnamese sign language dataset. Download the required video files manually from this link and place them in the sign_language_datasets/datasets/vn_sign/manual/vn_sign directory.

AmitMY · 2024-06-22T07:32:53Z

hi! so this is not really a dataset, it is only the alphabet.
Do you not want to include a dataset full of signs of Vietnamese?

nqkhanh2002 · 2024-06-22T12:36:13Z

I will try to review the format on the repo's available datasets and recreate the dataset

nqkhanh2002 · 2024-06-27T12:06:22Z

Hi @AmitMY ,
I have collected about 4000 sign language videos from this https://qipedc.moet.gov.vn/dictionary. I have looked at previous data sets like autsl , the next thing I need to create is openpose, comprehensive so that translate can support Vietnamese, right? Thank you for your quick support!

AmitMY · 2024-06-27T13:27:40Z

if your collection is automatic, you can update the data loader you wrote, to download these videos.
if you just downloaded them otherwise, you can use video_to_pose from https://github.com/sign-language-processing/pose to create MediaPipe poses, then you could use them directly in the spoken-to-signed-translation library

nqkhanh2002 · 2024-06-27T14:24:03Z

Yes @AmitMY , I was able to create a .pose file and extract it into gif according to this notebook https://colab.research.google.com/drive/1UtBmfBIhUa2EdLMnWJr0hxAOZelQ50_9?usp=sharing (image attached) but I need more language support Vietnamese into the main pipline as you said before

Add support for this dictionary in sign-language-processing/datasets (Make a PR)

Which I have seen in the definitions of other datasets have data additions such as openpose and holistic and .poseheader files (like autsl dataset )
Actually, after many issues I'm still feeling confused about what needs to be done. Thank you very much if you can give specific instructions. Thank you very much

AmitMY · 2024-06-27T15:30:48Z

You have two paths:

Managed by sign.mt

If you were to make a dataloader, with all videos (or links to videos) and words in Vietnamese, in a PR here in this repository, I would download and load this data into sign.mt, where poses would be extracted, etc.

Managed by yourself

If you don't want to make a dataloader, or you want to run the translation service yourself, you need to create a lexicon in the https://github.com/sign-language-processing/spoken-to-signed-translation project (for example, https://github.com/sign-language-processing/spoken-to-signed-translation/tree/main/assets/dummy_lexicon)

There, your CSV file will define all paths and words. Your directory will include all pose files you extract yourself.
Then, you could run the commands in that repository and it will generate sentences for you.

nqkhanh2002 · 2024-06-29T03:46:24Z

Hi @AmitMY
I have extracted all the necessary poses from the video and now I am editing the file download_lexicon.py. But I see that in the loaded code, there is an existing sign_suisse word like _POSE_HEADERS from tfds. How can I have this? Thank you very much

AmitMY · 2024-06-29T09:08:04Z

If you are going by "Managed by yourself", you don't need to modify download_lexicon.
you just need to create a lexicon (csv file + folder) the same way the dummy lexicon is set up.
you can use download_lexicon as an example

nqkhanh2002 · 2024-06-29T09:21:56Z

Thank you @AmitMY
I have seen but I see that the process functions to automatically create the index.csv file are already in the download_lexicon.py file so I am trying to modify it and I have a problem that with the process using text_to_gloss the error is Language vi is not supported
en I found on IANA_TAGS

AmitMY · 2024-06-29T14:47:28Z

possibly because it is not supported using https://github.com/adbar/simplemma
spacy does support vietnamese, so use the spacy lematizer.
i close this PR.
if you have issues with that repository, open them there.

Add support for Vietnamese sign language

737e8be

nqkhanh2002 mentioned this pull request Jun 22, 2024

Enhancing Pipeline with Vietnamese Language Support sign/translate#159

Closed

AmitMY closed this Jun 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for Vietnamese sign language #73

Add support for Vietnamese sign language #73

nqkhanh2002 commented Jun 22, 2024

AmitMY commented Jun 22, 2024

nqkhanh2002 commented Jun 22, 2024

nqkhanh2002 commented Jun 27, 2024

AmitMY commented Jun 27, 2024

nqkhanh2002 commented Jun 27, 2024

AmitMY commented Jun 27, 2024

nqkhanh2002 commented Jun 29, 2024

AmitMY commented Jun 29, 2024

nqkhanh2002 commented Jun 29, 2024 •

edited

Loading

AmitMY commented Jun 29, 2024

Add support for Vietnamese sign language #73

Add support for Vietnamese sign language #73

Conversation

nqkhanh2002 commented Jun 22, 2024

AmitMY commented Jun 22, 2024

nqkhanh2002 commented Jun 22, 2024

nqkhanh2002 commented Jun 27, 2024

AmitMY commented Jun 27, 2024

nqkhanh2002 commented Jun 27, 2024

AmitMY commented Jun 27, 2024

Managed by sign.mt

Managed by yourself

nqkhanh2002 commented Jun 29, 2024

AmitMY commented Jun 29, 2024

nqkhanh2002 commented Jun 29, 2024 • edited Loading

AmitMY commented Jun 29, 2024

nqkhanh2002 commented Jun 29, 2024 •

edited

Loading