Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question] Why do Document-Based Scatterplots need category? #44

Open
fredguth opened this issue Mar 27, 2019 · 1 comment
Open

[Question] Why do Document-Based Scatterplots need category? #44

fredguth opened this issue Mar 27, 2019 · 1 comment

Comments

@fredguth
Copy link

Sorry to ask via issue tracker, tried to find the answer in the referred arxived article and did not know of any other better channel.

I am trying to figure out how the Document-Based Scatterplot works.

I get that it uses Tf-Idf on unigrams of the text and takes the 2 first unigrams of the vector (the most different terms?) as axis. But what function is applied to each document to find its x-y position? Its "nearess" to each term?

Besides, I don't understand why we need to provide Category in this case. I understood it uses category to colorize the points, but anything else? Because if it's just that, it seems a hard constraint to Document-Based Scatterplot for something one may not need. But I guess I am missing something.

@fredguth
Copy link
Author

Related to the previous question, how can I find out which term was used as axis?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant