[documentation] `split_word_tokens` behaviour

The `split_word_tokens` function lacks documentation, leaving users in the dark about its behavior. While it works fine with strings containing only words, it fails to tokenize correctly when the input mixes words and symbols, resulting in an inaccurate tokenization.

![image](https://github.com/SCANL/ProjectSunshine/assets/11615441/5c0c1dca-91a5-4ed9-8eb3-f863a54d38c5)

Providing a documentation comment about its expected behavior could lead to replacing the library if it is not intended or correcting the tests.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[documentation] `split_word_tokens` behaviour #2

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[documentation] split_word_tokens behaviour #2

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

[documentation] `split_word_tokens` behaviour #2