WIP/ENH: adding support for categorial factors #527

dengemann · 2012-10-08T14:01:45Z

This relates to our recent discussion on the mailing list

added privat _recode function that internally recodes the x factor
added additional positional argument, a dict that allows the user to specify the remapping done by _recode
i would have prefered a kwarg but this however messes up the ax.plot below. The options i see whithin this approach are a) allowing users to optinoally pass a tuple like (x, x_levels). so no additional positional argument is required and b) explicitly passing a dict with plotting parameters instead of **kwargs

Wdyt?

- added privat _recode function that internally recodes the x factor - added additional positional argument, a dict that allows the user to specify the remapping done by _recode - i would have prefered a kwarg but this however messes up the ax.plot below. The options i see whithin this approach are a) allowing users to optinoally pass a tuple like (x, x_levels). so no additional positional argument is required and b) explicitly passing a dict with plotting parameters instead of **kwargs Wdyt?

jseabold · 2012-10-24T21:02:58Z

statsmodels/graphics/factorplots.py

@@ -4,7 +4,7 @@
 import utils


-def interaction_plot(x, trace, response, func=np.mean, ax=None, plottype='b',
+def interaction_plot(x, trace, response, x_levels, func=np.mean, ax=None, plottype='b',


Are we okay with adding args like this? I don't much mind, but it breaks backwards compatibility.

jseabold · 2012-10-24T21:21:38Z

Do you think we really need the x_levels argument? Couldn't we just check the dtype in the plot and call _recode with some default levels e.g., range(n_unique)? Thoughts?

dengemann · 2012-10-24T22:00:14Z

On 24.10.2012, at 23:21, Skipper Seabold notifications@github.com wrote:

Hi,

Do you think we really need the x_levels argument?

would be happy to drop it -- feels unnatural to me. although on the other hand side it's really explicit --- less magic

Couldn't we just check the dtype in the plot and call _recode with some default levels e.g., range(n_unique)? Thoughts?

yes, makes sense. the thing that made me hesitate with something like this was that users might associate certain levels with 'hierarchical' meaning, which might not always be obvious from looking at the data. Fo instance you might want to put your control condition on zero (left) and your treatment condition on one (right).
makes sense?

D

—
Reply to this email directly or view it on GitHub.

jseabold · 2012-10-24T22:02:14Z

Sure. We can update the ticklabels with the categories though, so this may alleviate some of this - they'll never see the levels. Now if you really want to control treatment on left, etc. you might be better off rolling your own plot?

dengemann · 2012-10-24T22:45:54Z

On 25.10.2012, at 00:02, Skipper Seabold notifications@github.com wrote:

Sure. We can update the ticklabels with the categories though, so this may alleviate some of this - they'll never see the levels.

yes, sure -- setting ticklabels from categorials would rock
Now if you really want to control treatment on left, etc. you might be better off rolling your own plot?

point taken.
—

Reply to this email directly or view it on GitHub.

josef-pkt · 2012-10-25T16:55:19Z

just a generic comment:

It takes me 5 minutes to understand what the argument names mean, even with reading the doc string.

dengemann · 2012-10-25T17:01:56Z

indeed, something got messed up in the doc string.

i'll update the commit in the course of the next days to reflect the current state of the discussion.
thanks!

On 25.10.2012, at 18:55, Josef Perktold notifications@github.com wrote:

just a generic comment:

It takes me 5 minutes to understand what the argument names mean, even with reading the doc string.

—
Reply to this email directly or view it on GitHub.

dengemann · 2012-10-25T17:09:40Z

... or did you refer to the arg names in general (pre-commit)?

On 25.10.2012, at 18:55, Josef Perktold notifications@github.com wrote:

just a generic comment:

It takes me 5 minutes to understand what the argument names mean, even with reading the doc string.

—
Reply to this email directly or view it on GitHub.

josef-pkt · 2012-10-25T17:27:37Z

in general, I think already before your changes.
Mainly I didn't understand what "trace" means, why we have a letter x, but y is "response"

(factor1, factor2, response)
(x1, x2_levels, response)
(endog, exog, groups)

in general: x1 could be continuous if we have continuous-categorical interaction.

I'm reading the function completely out of context and never tried it, so it's not obvious to me what this means, except for the basic doc string example.

I don't have a comment about the pull request directly, since I haven't figured out the levels and labels yet. (busy with other things.)

dengemann · 2012-11-14T18:56:28Z

Closing this one, continued on clean PR.

dengemann added 2 commits October 8, 2012 15:48

ENH/WIP: adding simple example demonstrating categorial factorplots

7c61c31

jseabold reviewed Oct 24, 2012
View reviewed changes

dengemann mentioned this pull request Nov 14, 2012

ENH: support categorial x-axis factors in interaction_plot #569

Closed

dengemann closed this Nov 14, 2012

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP/ENH: adding support for categorial factors #527

WIP/ENH: adding support for categorial factors #527

dengemann commented Oct 8, 2012

jseabold Oct 24, 2012

jseabold commented Oct 24, 2012

dengemann commented Oct 24, 2012

jseabold commented Oct 24, 2012

dengemann commented Oct 24, 2012

josef-pkt commented Oct 25, 2012

dengemann commented Oct 25, 2012

dengemann commented Oct 25, 2012

josef-pkt commented Oct 25, 2012

dengemann commented Nov 14, 2012

WIP/ENH: adding support for categorial factors #527

WIP/ENH: adding support for categorial factors #527

Conversation

dengemann commented Oct 8, 2012

jseabold Oct 24, 2012

Choose a reason for hiding this comment

jseabold commented Oct 24, 2012

dengemann commented Oct 24, 2012

jseabold commented Oct 24, 2012

dengemann commented Oct 24, 2012

josef-pkt commented Oct 25, 2012

dengemann commented Oct 25, 2012

dengemann commented Oct 25, 2012

josef-pkt commented Oct 25, 2012

dengemann commented Nov 14, 2012