option to allow repeats #90

gureckis · 2014-04-23T06:12:56Z

currently psiturk defaults to database-level blocking of workers such that a worker who appears in the database can't take the task again. in some cases this is not desired behavior. we should create a config option 'allow_repeat' which toggles this. also, might be good to allow option 'always_show_instructions' because sometimes if a worker has already done a task once they don't need to read the instructions part again.

lindauer · 2015-09-16T14:58:38Z

I tried to make this change (at least in a hacky hard-coded way) last night and my changes seemed to have no effect. Upon further investigation, it seems like the blocking might be occurring via a copy of experiment.py on the ad server, rather than the local one that I am changing. Is that an accurate assessment? If so, is there a way to bypass the ad server or otherwise test a change like this?

deargle · 2017-02-08T21:22:51Z

Reopening because it's still not implemented on the psiTurk ad server.

gureckis · 2017-02-09T04:27:10Z

moving discussion of this over here.

i added a new field to the psiturk cloud for 'allow_repeats'. i don't think it broke any thing but let me know (please test your branch @deargle). it is a property of ads both in the sandbox and regular ad server now. however the sandbox ad server basically ignores this field because it always lets you repeat (helpful for debugging). the live ad server should assume if you don't specify that you want to block repeats.

now the question is what the behavior should be concerning allow repeats. the logic in the section of the code can be kind of complex with so many different conditions that can come up. in addition the status codes are impacted by the hitId/assignementId values as well). what do people want?

quick summary: I think there are four conditions

STARTED = subject started the experiment, meaning they got past the instructions part. psiturk includes a way to indicate this because often instructions include the critical manipulation and once you "START" you can't start over.
COMPLETED = subject is finished with whole task, displays a thanks page
status >= SUBMITTED: currently they get an error message for trying to repeat. I am thinking this one should be removed if allow_repeats enabled. but what should happen?
either NOT_ACCEPTED or ALLOCATED: participant has not yet agreed to the consent form. they might not have even accepted the HIT yet. this just displays the ad.

i guess the proposal is that everything stays the same except if allow_repeats is turned on you can see the ad if your status >= SUBMITTED.

but is that right? I'm thinking that maybe the issue lies in the psiturk client check_worker_status function. here it looks in your local db for a particular worker and returns information about this person. although the function itself is passed the workerId and assignmentId from psiturk.org the later is currently ignored.
https://github.com/NYUCCL/psiTurk/blob/master/psiturk/experiment.py#L180

since i've never done this the questions is if you do the task multiple times you get a new assignmentId as well. so really there needs to be something more complex in check_worker_status where if allow_repeats is false, it looks for the workerId ignoring assignmentId and behaves as usual. In contrast if allow_repeats is true then you always lookup workerId and assignmentId and so repeats with different assignmentIds just act as usual?

phew.

deargle · 2017-02-09T05:12:08Z

please test your branch @deargle

Just tested, had great success. ad id if that's helpful.

i guess the proposal is that everything stays the same except if allow_repeats is turned on you can see the ad if your status >= SUBMITTED.

The way @jhamrick implemented it is nice. She only changed 19 lines in experiment.py to get it working. It's not a simple check for if status >= SUBMITTED, because people who quit early aren't allowed back in, even if repeats are allowed.

since i've never done this the questions is if you do the task multiple times you get a new assignmentId as well. so really there needs to be something more complex in check_worker_status where if allow_repeats is false, it looks for the workerId ignoring assignmentId and behaves as usual. In contrast if allow_repeats is true then you always lookup workerId and assignmentId and so repeats with different assignmentIds just act as usual?

Her code does just what you're describing. If I understand you.

I'm sure I missed something important in what you said.

gureckis · 2017-02-09T05:13:53Z

the sandbox ad you just posted did not set allow_repeats to true, was that what you expected?

deargle · 2017-02-09T05:18:37Z

Yes. Here's another that does set it to true.

gureckis · 2017-02-09T05:34:09Z

perfect, your tests works well.

yes @jhamrick approach is in the right flavor, especially the part that selectively checks against either workerId or workerId+assignmentId. my only concern about her code is that what happens if you first run the task with allow_repeats false then true? or vice versa? run against the same table you'd have to do a bit more error checking to avoid problems.

deargle · 2017-02-09T17:09:12Z

my only concern about her code is that what happens if you first run the task with allow_repeats false then true? or vice versa?

Switching back and forth should be fine in theory, yeah? Whether allow_repeats is True or False, it checks if the number of matches (workerId or workerId+assignmentId) == 0. This should work regardless of the pattern of past settings for allow_repeats for an experiment.

gureckis · 2017-02-09T17:13:41Z

it's a little confusing to me. if you allow_repeats (so there are multiple copies of the same worker in the db) and then disallow repeats, what status should be returned for the worker when the ad server queries? basically in the disallow repeats case the check_worker_status is if the worker has done the task before. could there be a situation where the worker accepted but did not complete multiple assignments? in this case they should technically be allowed to run in the new assignment even though the number of matches at the worker level is >0. its more like number of matches where the status is above some threshold maybe... these are perhaps unlikely cases but certainly possible.

if allow_repeats = False after a period where it is True then use the highest status associated with that worker as the exclusion criteria.

deargle · 2017-02-09T21:21:31Z

I think `allow_repeats` permits workers to retake an experiment only if they completed the most recent attempt. That means that check_worker_status should return the status for the most recent db entry for a given worker. At least, that's the way I think it should work. I'll review the code later when not on mobile.

…

On Thu, Feb 9, 2017, 12:13 PM Todd Gureckis ***@***.***> wrote: it's a little confusing to me. if you allow_repeats (so there are multiple copies of the same worker in the db) and then disallow repeats, what status should be returned for the worker when the ad server queries? basically in the disallow repeats case the check_worker_status is if the worker has done the task before. could there be a situation where the worker accepted but did not complete multiple assignments? in this case they should technically be allowed to run in the new assignment even though the number of matches is >0. its more like number of matches where the status is above some threshold maybe... — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#90 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABHsfeGBIZOHgGAx9e1s5HgmPMS_E7fnks5ra0lGgaJpZM4B0rJi> .

deargle · 2017-02-11T01:31:03Z

Forget everything I said in the previous comment here, there's no checking of the previous submission. Maybe that can be implemented later, but as it stands, it's a fresh start every time. I vote we go ahead with the code as it currently is (including this commit) until moar feature requests are made.

I'll update the docs if you give a 👍

deargle · 2017-02-15T04:03:50Z

@gureckis docs are ready. Is the ad server ready to go --

wait, hold that, I guess the takeaway from this whole conversation is that all the work is done in check_worker_status, and that the ad server doesn't need to do anything with the allow_repeats param sent to it. Oh well, we can pass it anyways.

I'll merge.

gureckis · 2017-02-15T04:05:59Z

I think that is right. The new ad server field isn't terrible to have though in case we want to alter some of the error messages in the future based on the type of experiment it is.

This is one of the features that making an automated test for is a little tricky i think.

gureckis · 2017-02-15T04:07:47Z

Oh actually that reminds me that in my unfinished branch on this I included something like always_show_instructions which allows you to configure if the instruction phase should run every time or only the first time the task is completed. the idea being on repeats you might want to skip the instruction phase (if you have it). possible a new feature request but it naturally ties to the allow repeats issue.

gureckis added the enhancement label Apr 23, 2014

gureckis mentioned this issue Sep 6, 2014

Allow participants to do experiment more than once #143

Closed

jhamrick mentioned this issue Feb 29, 2016

Allow repeat participants #214

Merged

braingineer mentioned this issue Jan 11, 2017

adding easy heroku deployment with ssl support #254

Open

deargle closed this as completed in #214 Feb 8, 2017

deargle reopened this Feb 8, 2017

gureckis added a commit that referenced this issue Feb 9, 2017

Issue #90: proposal for dealing with repeats

3d77ccf

if allow_repeats = False after a period where it is True then use the highest status associated with that worker as the exclusion criteria.

ahafri mentioned this issue Jul 6, 2018

Allow_repeats prevent same condition for a given worker #322

Open

deargle closed this as completed Jun 28, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

option to allow repeats #90

option to allow repeats #90

gureckis commented Apr 23, 2014

lindauer commented Sep 16, 2015

deargle commented Feb 8, 2017

gureckis commented Feb 9, 2017

deargle commented Feb 9, 2017

gureckis commented Feb 9, 2017

deargle commented Feb 9, 2017

gureckis commented Feb 9, 2017 •

edited

deargle commented Feb 9, 2017

gureckis commented Feb 9, 2017 •

edited

deargle commented Feb 9, 2017 via email

deargle commented Feb 11, 2017

deargle commented Feb 15, 2017

gureckis commented Feb 15, 2017

gureckis commented Feb 15, 2017

option to allow repeats #90

option to allow repeats #90

Comments

gureckis commented Apr 23, 2014

lindauer commented Sep 16, 2015

deargle commented Feb 8, 2017

gureckis commented Feb 9, 2017

deargle commented Feb 9, 2017

gureckis commented Feb 9, 2017

deargle commented Feb 9, 2017

gureckis commented Feb 9, 2017 • edited

deargle commented Feb 9, 2017

gureckis commented Feb 9, 2017 • edited

deargle commented Feb 9, 2017 via email

deargle commented Feb 11, 2017

deargle commented Feb 15, 2017

gureckis commented Feb 15, 2017

gureckis commented Feb 15, 2017

gureckis commented Feb 9, 2017 •

edited

gureckis commented Feb 9, 2017 •

edited