-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix: NoveltyEvaluator; new EntropyEvaluator #205
Conversation
Merge branch '2.0.0' of https://github.com/guoguibing/librec into 2.0.0 # Conflicts: # core/src/main/java/net/librec/data/convertor/TextDataConvertor.java # core/src/main/java/net/librec/eval/ranking/NoveltyEvaluator.java # core/src/main/resources/driver.classes.props
NoveltyEvaluator complete new...
Sorry for change list: There are only 2 new classes and the config file. Have had problems to sync my fork. (I am new to github) |
@dvelten BTW, we can include your documentation of NMFItemItem into ours(https://www.librec.net/dokuwiki/doku.php). You can email it to me at sunyatong65536@gmail.com after your optimization. Thank you for your contribution ! |
The Implementation of NoveltyEvaluator is not that mentioned in the paper. But it is also a well known and often used Evaluator. But it is more a 'Diversity'-'Entropy' - Evaluator only.
I have now created two Implementations. The new corrected NoveltyEvaluator and an EntropyEvaluator containing an modified version of the old NoveltyEvaluator.
Both Implementation use Entropy or Information to do calculations. The new EntropyEvaluator does not take into account if the item purchased is popular or unpopular in respect of probability purchased.
Instead of this the probability space is different.
NoveltyEvaluator: Probability of item purchased
EntropyEvaluator: Probability of item to be in the recommender result list
Both Implementations could be thought to make the same for loops to sum the Entropy of one result list.
But in the EntropyEvaluator you can do an optimization and rewrite the for loop over users into an for loop over items with the pre-calculated item counts (item probabilities).
Best wishes
Daniel