Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CATE vs Propensity #6

Closed
soliverc opened this issue Jan 6, 2020 · 5 comments
Closed

CATE vs Propensity #6

soliverc opened this issue Jan 6, 2020 · 5 comments

Comments

@soliverc
Copy link

soliverc commented Jan 6, 2020

I am just following the tutorial, and I have generated two columns: CATE and Propensity. The tutorial recommends selecting users with a high uplift score which is CATE.

Is the Propensity column any use to us at all? Or can I just disregard it? The propensity may be a positive number, and CATE could be negative some times. I'm not sure how to interpret the scores when this happens.

@Minyus
Copy link
Owner

Minyus commented Jan 7, 2020

Propensity score is used to compute CATE (uplift score).
The range of propensity score, which is estimated probability to be treated, is between 0 and +1.
CATE is difference of probability values, so the range is between -1 and +1.

You can find explanation at:
https://github.com/Minyus/causallift/blob/develop/README.md

@Jami1141
Copy link

Is it necessary to calculate propensity?
I have an A/B test, therefore, I know which samples are treated and which are not.
Later I plan to use Causallift model for later predictions on new data. If I do not need propensity for now since I use A/B test, do I need it for prediction?
May I ask you to explain what is this propensity for and what does it mean?

Thanks

@Minyus
Copy link
Owner

Minyus commented Mar 21, 2020

For A/B test (RCT) data, propensity score estimation is not needed, so you can set enable_ipw False.

CausalLift(train_df, test_df, enable_ipw=False)

For observational data (data not from A/B Test or RCT), treatment should have been chosen based on a different probability (propensity score) for each sample, so IPW (Inverse Probability Weighting) using propensity score can be used optionally.

CausalLift(train_df, test_df, enable_ipw=True)

@Jami1141
Copy link

Thanks for your respond. So it means when I use A/B test data for training the model, I do not need to have propensity but later when I want to use mode for later prediction I have to put enable_ipw= True
That is true?

@Minyus
Copy link
Owner

Minyus commented Mar 22, 2020

That is true?

No.

I added explanation in the following sections in README.md.

https://github.com/Minyus/causallift#how-causallift-works
https://github.com/Minyus/causallift#how-to-run-inferrence-prediction-of-cate-for-new-data-with-treatment-and-outcome-unknown

enable_ipw flag is used only during training.

@Minyus Minyus closed this as completed Mar 28, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants