Skip to content

word frequency counter#9

Merged
ninpnin merged 1 commit intodevfrom
wf-dict
Jan 15, 2025
Merged

word frequency counter#9
ninpnin merged 1 commit intodevfrom
wf-dict

Conversation

@BobBorges
Copy link
Copy Markdown
Contributor

we recently deployed a swedish dehyphenator tool that can use a word frequency list to aid end-line hyphen correction. This script makes such an ordered wordlist and stores it in a json file

@BobBorges BobBorges requested a review from ninpnin September 19, 2024 14:49
@ninpnin
Copy link
Copy Markdown
Contributor

ninpnin commented Jan 15, 2025

Is this (still) needed? I would imagine there are CLI tools for this, we can grep out "<" and ">" to get rid of the XML tags.

@BobBorges
Copy link
Copy Markdown
Contributor Author

Is this (still) needed? I would imagine there are CLI tools for this, we can grep out "<" and ">" to get rid of the XML tags.

I mean, it's there already and we know it works exactly as we want without trying to parse xml with grep (even I don't do that :D ).

@BobBorges
Copy link
Copy Markdown
Contributor Author

I think we should probably keep a script for this for consistency.

@ninpnin ninpnin merged commit 90120a3 into dev Jan 15, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants