Add Multinomial and Bernoulli Naive Bayes algorithms #183

sgrigory · 2021-11-29T22:41:41Z

Currently linfa-bayes crate contains Gaussian Naive Bayes algorithm. It should not be very difficult to add other kinds of Naive Bayes present in sklearn:

Multinomial Naive Bayes
Bernoulli Naive Bayes

For a new algorithm one needs to reimplement methods joint_log_likelihood and update_feature_log_prob and the hyperparameters - the rest of the code stays more or less the same.

I have created a draft implementation of Multinomial Naive Bayes in this branch, based on the current code of Gaussian Naive Bayes and the sklearn implementation of MultinomialNB. At the moment a large part of code is copy-pasted from Gaussian Naive Bayes, but it is possible to refactor both to deduplicate the shared code.

Would you consider this a useful feature to have? If yes, I can finalise the draft and open a PR .

@VasanthakumarV @bytesnake, tagging you since you authored and reviewed the original Gaussian Naive Bayes implementation in #51

The text was updated successfully, but these errors were encountered:

bytesnake · 2021-11-30T09:16:12Z

Hi @sgrigory,

It should not be very difficult to add other kinds of Naive Bayes present in sklearn

no not really, the question is rather how we want to add them. The type system should allow us to be generic over the distribution, there are some distribution libraries in Rust but few with MAP estimation.

For a new algorithm one needs to reimplement methods joint_log_likelihood and update_feature_log_prob and the hyperparameters - the rest of the code stays more or less the same.

sounds like a good candidate for a trait

I have created a draft implementation of Multinomial Naive Bayes in this branch, based on the current code of Gaussian Naive Bayes [..]

👍

Would you consider this a useful feature to have? If yes, I can finalise the draft and open a PR .

yes, we would accept such a PR. To be really useful, we have to figure out how-to

handle mixed-type datasets
handle distributions with different parametrizations

It may therefore be refactored in the future, but nevertheless we will accept such a PR gladly :)

yuancc06 · 2022-08-02T10:48:04Z

Hi. I have tried multinomial naive bayes and it works very well in predicting the correct result. However, in some cases I need to get the joint likelihood for further calculations, but I cannot get those numbers because the corresponding function is in pub(crate). I wonder if the developers have plans to make the likelihood/probability function public. Thank you.

YuhanLiin · 2022-08-07T01:28:20Z

That would require making the NaiveBayes trait public. I'd accept a PR which does this, but with the other method hidden from the docs so that people don't rely on the traits for things other than joint likelihood.

sgrigory mentioned this issue Jan 11, 2022

Add Multinomial Naive Bayes to linfa-bayes #190

Merged

wildart mentioned this issue Jun 9, 2022

Bernoulli Naive Bayes #226

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Multinomial and Bernoulli Naive Bayes algorithms #183

Add Multinomial and Bernoulli Naive Bayes algorithms #183

sgrigory commented Nov 29, 2021 •

edited by YuhanLiin

Loading

bytesnake commented Nov 30, 2021

yuancc06 commented Aug 2, 2022

YuhanLiin commented Aug 7, 2022

Add Multinomial and Bernoulli Naive Bayes algorithms #183

Add Multinomial and Bernoulli Naive Bayes algorithms #183

Comments

sgrigory commented Nov 29, 2021 • edited by YuhanLiin Loading

bytesnake commented Nov 30, 2021

yuancc06 commented Aug 2, 2022

YuhanLiin commented Aug 7, 2022

sgrigory commented Nov 29, 2021 •

edited by YuhanLiin

Loading