Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Seg fault with char_level=True #6

Open
stevenhillis opened this issue Jun 15, 2021 · 1 comment
Open

Seg fault with char_level=True #6

stevenhillis opened this issue Jun 15, 2021 · 1 comment

Comments

@stevenhillis
Copy link

I have a list of hypotheses and a list of references. Both have one element, a string. The hypothesis string has a length of 115233. The reference string has a length of 141575. I need to compute both WER and CER. When I compute WER with fastwer.score(hyps, refs), everything is normal. But, when I compute CER with fastwer.score(hyps, refs, char_level=True), my program pauses for an unexpected period of a few seconds, then prints "Segmentation fault", and then exits.

@shedar
Copy link

shedar commented Dec 20, 2021

I had a similar issue. After looking at the source code, I guess it's out of memory error. Here it tries to allocate a vector of M*N size of int32 items. In your case, 115234 * 141575 *4 is over 60 gigabytes of memory just for this structure.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants