Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using SOMPY for text clustering? #65

Closed
Landre opened this issue Sep 22, 2017 · 1 comment
Closed

Using SOMPY for text clustering? #65

Landre opened this issue Sep 22, 2017 · 1 comment

Comments

@Landre
Copy link

Landre commented Sep 22, 2017

Hi I'm new to SOM's and stumbled across your SOMPY library. Your examples work pretty well so far but I wonder if I also can use this library to cluster texts and how the feature vectors for training etc should look like to achieve that. Do you have a guess/hint or, by any chance, an example? My goal is to use the data of Grimm's fairytales. I want to cluster the text by its affiliation to a fairytale. If I get a new text-input, I want to get a SOM that clusters text in a way, that I can see the relations between the input text and fairytales. Is this possible?

Greetings

@sevamoo
Copy link
Owner

sevamoo commented Oct 17, 2017

If you manage to convert your text to a data matrix, where rows are your texts and columns are the chosen feature, SOM will do its part. Your question is out of the scope this library. Take a look at gensim library examples for vectorization of texts.

@sevamoo sevamoo closed this as completed Oct 17, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants