You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
That is, use Anserini to extract the tf-idf vectors that feed the classifiers in scikit-learn.
Whoever is interested in taking this on, I can guide step by step, but the first step is to write Collection class to index 20 newsgroup in its raw original format.
The text was updated successfully, but these errors were encountered:
Let's try and replicate 20 newsgroup classification w/ scikit-learn using Anserini:
https://scikit-learn.org/stable/tutorial/text_analytics/working_with_text_data.html
That is, use Anserini to extract the tf-idf vectors that feed the classifiers in scikit-learn.
Whoever is interested in taking this on, I can guide step by step, but the first step is to write Collection class to index 20 newsgroup in its raw original format.
The text was updated successfully, but these errors were encountered: