Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is it possible to use Snorkel for Regression problem? #1072

Open
ff-priya opened this issue Mar 11, 2019 · 4 comments
Open

Is it possible to use Snorkel for Regression problem? #1072

ff-priya opened this issue Mar 11, 2019 · 4 comments
Assignees
Labels
feature request no-stale Auto-stale bot skips this issue

Comments

@ff-priya
Copy link

In my problem output is an integer number. I have some rules by which I can say about the possible range in which that number will be. So for some input if actual output is 1000. I have some rules by which I can say it should be between 800 - 1200.

I have lot of data where I can say about approximate value of output but have a very small data-set that gives me exact value of output. I can make some rules that will give range in which output value will lie.
Can I use Snorkel to do so?

@ajratner
Copy link
Contributor

Hi @ff-priya,

If you are asking about whether Snorkel can do a traditional regression problem, as your title suggests (i.e. predict values in some continuous range), the answer is no, but this is a direction of future work we are pursuing, so stay tuned!

If you are asking whether you could have Snorkel output discretized ranges of values (e.g. {"800-1200", "1200-1600"}), then yes, seems like you could indeed do that!

Hope this helps,
Alex

@github-actions
Copy link

This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days.

@rjurney
Copy link

rjurney commented Nov 7, 2019

@ajratner do you have a sense of how snorkel might approach this problem? Both in the sense of writing LFs for buckets atm as well as in snorkel supporting regression in the future? I guess LFs would return continuous values if Snorkel supported this? At the moment it seems challenging to map LFs to ranges... I guess exploratory data analysis will turn up something?

@vincentschen vincentschen added the no-stale Auto-stale bot skips this issue label Nov 18, 2019
@panoschal
Copy link

Supposing one wants to have a float between 0 and 1 as output, I am wandering if this would make sense: assign labels 0 and 1 with LFs like binary classification, and after training the model, using the attached confidence scores (with return_probs) as the continuous variable output.
This way you can't have LFs return float, so you lose some accuracy you could provide there, but maybe with enough LFs this makes sense.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request no-stale Auto-stale bot skips this issue
Projects
None yet
Development

No branches or pull requests

5 participants