Ability to specify categories / ranges in spatial markov method #721

stuartlynn · 2015-11-30T20:35:06Z

Right now the spatial markov methods calculate the quantile bins for the markov chain. It would be great to have the option to pass through either a list of bins or categories for the data. So for example

 bins = [0,10, 20 ,30 ,40 ,50 ,60 ,70, 80, 90, 100]
 sm = pysal.Spatial_Markov(data, weights, bins= bins, fixed = True)

or

categories = [ 'republican', 'democrat']
sm = pysal.Spatial_Markov(data, weights, categories= categories, fixed = True)

Where the data for a geometry in the first would be numbers in the range 0 ... 100 which would get mapped to the specified bins and in the second example each step in the input data would be either republican or democrat and the chain would calculate the probabilities of switching between the two.

The text was updated successfully, but these errors were encountered:

ljwolf · 2015-11-30T20:59:01Z

@sjsrey I think this might be done well by breaking off l403-l410 into a _classify() method that could get passed over by providing the categories ahead of time. If we make this a separate callable, it'd also make it easy to subclass into your own classifier method, defining

class YourMarkov(pysal.Spatial_Markov):
... init-ing goes here ...
    def _classify(self, k):
           return np.random.randint(0,k, size=len(self.y))

or what have you.

sjsrey · 2015-11-30T21:15:39Z

@ljwolf yes, adding a general _classify() method would allow for user specified classes.

There are a few things we should keep in mind, however, related to the request from @stuartlynn. First, is that the spatial markov method assumes the attribute is continuous not discrete as it uses the spatial lag. For discrete values, the interpretation of the lag is not straightforward.

Second, the current implement uses quantiles so that the alternative and null hypotheses are clear. H0 is that there is only one kxk transition matrix, H1 has k (kxk) transition matrices (i.e., different transition dynamics for different strata based on the lag class). So as k grows, one has to estimate many more parameters under H1 - which might be an issue depending on sample size.

We could change this so that the number of classes/strata for the lag does not have to be equal to the number of states in the chain.

ljwolf · 2015-11-30T21:36:15Z

Ahh, I see. So, in a naive dichotomous case, you'd need to figure out some reasonable way to ensure that somelag_spatial_categoricalwas intelligible. Seems reasonable to use mode, but ties could get weird... interesting.

ljwolf · 2016-07-16T22:24:17Z

Lag categorical was submitted, but we still need to allow for this abstraction.

ljwolf · 2017-07-11T22:41:36Z

This'll move to giddy and is still of active interest.

sjsrey added this to the Release + 1 milestone Jan 21, 2016

sjsrey added the Enhancement label Jan 21, 2016

ljwolf mentioned this issue Feb 19, 2016

Lag Categorical & Find Bins #745

Merged

ljwolf mentioned this issue Apr 19, 2017

spatial markov when data are already in discrete form #936

Closed

ljwolf closed this as completed Jul 11, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ability to specify categories / ranges in spatial markov method #721

Ability to specify categories / ranges in spatial markov method #721

stuartlynn commented Nov 30, 2015

ljwolf commented Nov 30, 2015

sjsrey commented Nov 30, 2015

ljwolf commented Nov 30, 2015

ljwolf commented Jul 16, 2016

ljwolf commented Jul 11, 2017

Ability to specify categories / ranges in spatial markov method #721

Ability to specify categories / ranges in spatial markov method #721

Comments

stuartlynn commented Nov 30, 2015

ljwolf commented Nov 30, 2015

sjsrey commented Nov 30, 2015

ljwolf commented Nov 30, 2015

ljwolf commented Jul 16, 2016

ljwolf commented Jul 11, 2017