### Motivation

Very simple motivation: 
- when agents choose between two options, do stronger preferences imply faster choices? 
- Can we test this?
- If not, what models can we use to describe their behavior?
- Can we test those models too?

### Time-Preference Monotonicity [to be renamed]

Suppose an agent is choosing between alternatives in $X$, and the choice takes a certain amount of time $\tau$. For any $x, y \in X$:

- let $F^{xy}(t)$ be the probability that the agent makes a choice by time $t$.

- let $p^{xy}(t)$ be the probability that the agent picks $x$ conditional on stopping at time $t$.

- let $p(x, y)$ denote the marginal probability that the agent picks $x$ over $y$.

**Time-Preference Monotonicity Axiom v1 (needs a better name)** 
For alternatives $x, y, z \in X$, suppose without loss of generality that $p(x, y), p(x, z) \ge 0.5$. Then $p(x, y) \ge p(x, z)$ implies $F^{xy}(t) \ge F^{xz}(t)$.

**Time-Preference Monotonicity Axiom v2 (stronger version)** 
For alternatives $x, w, y, z \in X$, suppose without loss of generality that $p(x, y) \ge p(w, z) \ge 0.5$. Then $F^{xy}(t) \ge F^{wz}(t)$.

Intuitively, this means that if the agent prefers $x$ to $y$ more strongly than they prefer $x$ to $z$, they can also make decisions more quickly. In particular, $\tau_{xz} \succcurlyeq \tau_{xy}$ (weak first-order stochastic dominance).

We aim to make a few contributions:

1. Show that common choice models satisfy this axiom
    - DDM model does, we think
2. Develop simple, frequentist tests of this axiom. This is simple except for the multiple testing problem.
    - This test will ideally quantify deviations from the axiom, because in the real world the null will definitely be false
    - This is pretty simple and may have more power in finite samples than nonparametric tests than competitors, for some alternatives
3. Extend common models to accomodate some violations of this axiom
    - In particular, we want to extend the DDM model
4. Develop tests for those extensions as well
    - This is to-do!
    
    
Other models:
  - Accumulator models
  - Full DDM
      - Boundary constant, randomized starting point, randomized drift, random delay

### Characterization of DDM Models

DDM model from [Prof. Strzalecki's paper](https://www.pnas.org/content/117/52/33141.short) assumes we have utilities $u : X \to \mathbb{R}$, a boundary $b : \mathbb{R}_{+} \to \mathbb{R}_{+}$ such that

$$p^{xy}(t) = p^*(t, u(x) - u(y), b) $$
$$F^{xy}(t) = F^*(t, u(x) - u(y), b). $$

Note the boundary and the volatility are fixed over different pairs of alternatives $x, y \in X$.

**Theorem**: If $p, F$ admit a DDM representation, then they satisfy the TPMA axiom.

Proof: in three steps.

*Claim 1 (trivial claim)*: $p(x, y) \ge p(x, z)$ if and only if $\delta_{xy} = u(x) - u(y) \ge u(x) - u(z) = \delta_{xz}$.

Proof: The stochastic process $Z_t^{xy} = \delta_{xy} t + \alpha B_t$ *jointly* stochastically dominates the stochastic process $Z_t^{xz} = \delta_{xz} t + \alpha B_t$. There is therefore a (trivial) coupling such that $\delta_{xy} t + \alpha B_t > \delta_{xz} t + \alpha B_t$, so if the agent chooses $x$ over $z$, then they also choose $x$ over $y$.

*Claim 2*: $\delta_{xy} \ge \delta_{xz}$ if and only if $|Z_t^{xy}| \succcurlyeq |Z_t^{xz}|$ for each fixed $t$.


Proof: note $\delta_{xy} \ge \delta_{xz} \ge 0$ by assumption, so $|\delta_{xy}| \ge |\delta_{xz}|$. The rest is just properties of folded normals.

Marginally, $Z_t^{xy} \sim \mathcal{N}(\delta_{xy} t, \alpha^2 t)$. It suffices to show that if $Z \sim N(\mu, \sigma^2)$, then $\mathbb{P}(|Z| > c)$ is increasing in $|\mu|$. Equivalently, we can show the CDF is decreasing in $|\mu|$. 

The CDF of a folded normal for $x > 0$ is
$$\frac{1}{2} \left[\text{erf}\left(\frac{x+\mu}{\sigma \sqrt{2}} \right) + \text{erf}\left(\frac{x-\mu}{\sigma \sqrt{2}} \right) \right] $$

We can take $\sigma = 1 / \sqrt{2}$ without loss of generality.

The derivative with respect to $\mu$ is
$$\propto \exp(-(x+\mu)^2) - \exp(-(x-\mu)^2) $$

This is negative when $\mu > 0$ and positive when $\mu < 0$.

*This claim isn't quite strong enough to prove the stopping time part.*


**Claim 3**: We can construct a coupling of $Z^{xy}$ and $Z^{xz}$ such that $|Z^{xy}| \ge |Z^{xz}|$ for all $t$ simaltaneously.

Sketch:

1. Brownian motion is the limit of a time-discrete gaussian process. For example, suppose we have $Z^{xy}_{2 \epsilon} = Z^{xy}_{\epsilon} + N(0, \epsilon)$, etc. So we'll start by constructing a version which is coupled such that at discrete time-points $\epsilon, 2 \epsilon, 3 \epsilon, \dots,$, $|Z^{xy}_{k \epsilon}| \ge |Z^{xz}_{k \epsilon}|$ for all $k$.



2. Fix $\epsilon$. We can ensure that $|Z^{xy}_{\epsilon}| \ge |Z_{\epsilon}^{xz}|$ by the previous analysis. 

AHA! What if:
$$\mathbb{P}\left(|Z^{xy}_{\epsilon} + N(\delta, \epsilon)| \ge c\right) \mid |Z_{\epsilon}^{xy}|$$
is increasing in $|Z_{\epsilon}^{xy}|$?

This is a mixture of folded normals. It's conceivable the mixture stochastically dominates a regular folded normal.

(Note because $|Z_{\epsilon}^{xy} + N(0, \epsilon)|$ is distributed as a folded normal with mean $|Z_{\epsilon}^{xy}|$ no matter the sign of $Z_{\epsilon}^{xy}$.)

3. Take the limit as $\epsilon \to 0$: this is easier said than done, but we'll muddle through. We have a few options:
    
    - We can use the same tools that people used to prove the existence of Brownian motion (wavelets, etc)
    
    - We can be clever with CDFs or something to show that as $\epsilon \to 0$, the probability that $Z^{xz}$ crosses the boundary before $Z^{xy}$ goes to zero. I actually prefer this option.

### Tests of the axiom

#### The simplest test(s)

1. Once you know $\rho(x, y) \ge \rho(x, z) \ge 0.5$ with confidence, this is just a test of stochastic dominance. There are a myriad of ways to get $p$-values for stochastic dominance tests:
    - $t$-test, because $\tau_{xz} \succcurlyeq \tau_{xy}$ implies the one-sided null $\mathbb{E}[\tau_{xy}] < \mathbb{E}[\tau_{xz}]$. 
    - more sophisticated tests https://www.jstor.org/stable/3082041?seq=1

2. You can test this for all $x, y, z \in X$ (or random subsets if that's too computationally expensive) and then apply your favorite multiple testing correction to these $p$-values

3. There are also other clever things you can do to narrow down which trios to test 

#### Possibly clever-er tests

1. Idea: define an estimand which equals $0$ under the null. Then, estimate it. Here are some interpretable but probably mathematically inconvenient examples:


- $$\max_{x,y,z \in X \text{ s.t. } x \succcurlyeq y \succcurlyeq  z} \max_{t \in \mathbb{R}_{+}} F^{xz}(t) - F^{xy}(t) $$

### Classes of models which accomodate violations of this axiom

Basic idea: in addition to utility, there's a parameter governing how difficult it is to gain information about a particular alternative.

**Broader Axiom [this really needs a name]:** Let $\rho : X \times X \to [0,1]$ be the pairwise stochastic choice function and let $F^{xy} : \mathbb{R}_{+} \to [0,1]$ be the probability of making a choice by time $t$, for each $x, y \in X$. This axiom states that there exists some functions

$$u : X \to \mathbb{R} $$
$$a : X \to \mathbb{R}_{+} $$

such that

\begin{equation}
\rho(x, y) \ge \rho(w, z) \Leftrightarrow u(x) - u(y) \ge u(w) - u(z) 
\end{equation}

and

\begin{equation}
F^{xy}(t) \ge F^{wz}(t) \Leftrightarrow \frac{|u(x) - u(y)|}{a(x) + a(y)} \ge \frac{|u(w) - u(z)|}{a(w) + a(z)}
\end{equation}


*Note: the functional form of this axiom might change a bit.*

- What if objects are super-high-cost to acquire information, and people just give up on them

- This axiom is only valuable if the previous axiom is violated

    - See in what directions your first axiom is violated
    - This will inform your speculation on what model will accomodate this violation
    - In stochastic transitivity section, look at $3.38$, think about distance metrics

**DDM Model which (less trivially) satisfies this axiom**: 

DDM model from [Prof. Strzalecki's paper](https://www.pnas.org/content/117/52/33141.short) assumes we have utilities $u : X \to \mathbb{R}$, a boundary $b : \mathbb{R}_{+} \to \mathbb{R}_{+}$ such that

$$p^{xy}(t) = p^*(t, u(x) - u(y), b, \alpha^x + \alpha^y) $$
$$p^{xy}(t) = F^*(t, u(x) - u(y), b, \alpha^x + \alpha^y). $$

Note the volatility is no longer fixed over different pairs of alternatives $x, y \in X$.

- Look at Exercise 10.14

- Think about an agent optimizing---make a story