Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dose it support Chinese song? #6

Open
yangkang2318 opened this issue Jun 29, 2022 · 10 comments
Open

Dose it support Chinese song? #6

yangkang2318 opened this issue Jun 29, 2022 · 10 comments

Comments

@yangkang2318
Copy link

Hello,this project is amazing!But dose it support Chinese song?

@mathigatti
Copy link
Owner

Hey! No, but it's a good idea, I just added that feature :) Please try it and let me know how that goes

@yangkang2318
Copy link
Author

yangkang2318 commented Jun 30, 2022

There is a bug in line 33,and occurs TypeError: 'int' object is not callable.Because when run in python,the "voice_index" is identified as "int" object instead of "voice_index" method.
image

@yangkang2318
Copy link
Author

yangkang2318 commented Jun 30, 2022

And you should modify line 26 in "__main__.py",the last args should be "args["language"]"
image

@yangkang2318
Copy link
Author

yangkang2318 commented Jun 30, 2022

It should be chinese instead of mandarin
image
Beacuse in your code is "chinese"
image

so there should be "chinese".
image

@mathigatti
Copy link
Owner

Oh sorry and thanks for reviewing it, I just pushed some possible fixes I will test it thoroughly tomorrow

@yangkang2318
Copy link
Author

yangkang2318 commented Jun 30, 2022

When using the Chinese song ,the performance is worse.Maybe the file"lyrics_tokenizer.py" caused the results.
Because chinese words doesn't need to tokenize into subwords

@mathigatti
Copy link
Owner

mathigatti commented Jul 1, 2022

ah right, I don't know much about japanese and mandarin. If you can send me a text example and how it should be splitted into notes that would be great.

For example in english, if I have this sentence "hello this is my song" and I have 6 notes it's splitted into something like this: ["he", "llo", "this", "is", "my", "song"], then I assign each text piece to each note.

@mathigatti
Copy link
Owner

mathigatti commented Jul 5, 2022

Do you have some text to try? I added a basic tokenization, given some text like this: 曲项向天歌 it assigns each character to a different note

"曲项向天歌" is converted into this [u'\u66f2', u'\u9879', u'\u5411', u'\u5929', u'\u6b4c']

@yangkang2318
Copy link
Author

I fix this problem,because the musescore decode the midi into .xml is wrong.
Thanks for your help and project!

@mathigatti
Copy link
Owner

Awesome! Do you want to share how you fixed it?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants