You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I want the website urls to be tokenized as a single noun effectively so would expect www.google.com to tokenize as "www.google.com". I am happy to fork this repo and would like to contribute.
Thanks for repo btw, it's useful.
The text was updated successfully, but these errors were encountered:
Tokenizer::WhitespaceTokenizer.new.tokenize "www.google.com"
=> ["www", ".", "g","o","o","g","l","e",".","c","o","m"]
I want the website urls to be tokenized as a single noun effectively so would expect www.google.com to tokenize as "www.google.com". I am happy to fork this repo and would like to contribute.
Thanks for repo btw, it's useful.
The text was updated successfully, but these errors were encountered: