-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add support for Vietnamese sign language #73
Conversation
hi! so this is not really a dataset, it is only the alphabet. |
I will try to review the format on the repo's available datasets and recreate the dataset |
Hi @AmitMY , |
if your collection is automatic, you can update the data loader you wrote, to download these videos. |
Yes @AmitMY , I was able to create a .pose file and extract it into gif according to this notebook https://colab.research.google.com/drive/1UtBmfBIhUa2EdLMnWJr0hxAOZelQ50_9?usp=sharing (image attached) but I need more language support Vietnamese into the main pipline as you said before
Which I have seen in the definitions of other datasets have data additions such as openpose and holistic and .poseheader files (like autsl dataset ) |
You have two paths: Managed by sign.mtIf you were to make a dataloader, with all videos (or links to videos) and words in Vietnamese, in a PR here in this repository, I would download and load this data into sign.mt, where poses would be extracted, etc. Managed by yourselfIf you don't want to make a dataloader, or you want to run the translation service yourself, you need to create a lexicon in the https://github.com/sign-language-processing/spoken-to-signed-translation project (for example, https://github.com/sign-language-processing/spoken-to-signed-translation/tree/main/assets/dummy_lexicon) There, your CSV file will define all paths and words. Your directory will include all pose files you extract yourself. |
Hi @AmitMY |
If you are going by "Managed by yourself", you don't need to modify |
Thank you @AmitMY |
possibly because it is not supported using https://github.com/adbar/simplemma |
Description:
This PR adds support for the Vietnamese sign language dataset. Download the required video files manually from this link and place them in the
sign_language_datasets/datasets/vn_sign/manual/vn_sign
directory.