---
title: Landscape Shrouded in Clouds
render-on-save: true
author:
  - name: Tom Cunningham
citation: true
date: 2025-08-02
date-modified: last-modified
fig-align: center
fig-height: 1
# draft: true
bibliography: ai.bib
reference-location: margin
# engine: knitr
format:
   html:
      toc: true
      toc-depth: 2
      toc-location: left
---




<style>p { text-indent: -2em; margin-left: 2em; }</style>


#           Model: Exploring a Landscape.




```{tikz}
#| column: margin
#| fig-width: 3
#| caption: caption
\begin{tikzpicture}[scale=6]
   \draw[<->, line width=1] (0,1) -- (0,0) node[midway,above,rotate=90] {payoff}
         -- (1,0) node[midway,below,align=center]{action};

   \fill[gray!50!white] (0,.3)--(.2,.3)--(.2,.7)--(0,.7)--cycle;
      \fill (.1,.5) circle(.01);
   \fill[gray!50!white] (.2,.3)--(.4,.3)--(.4,.7)--(.2,.7)--cycle;
      \fill (.3,.5) circle(.01);
   \fill[gray!50!white] (.4,.4)--(.6,.4)--(.6,.65)--(.4,.65)--cycle;
      \fill (.5,.525) circle(.01);
   \fill[gray!50!white] (.6,.3)--(.8,.3)--(.8,.7)--(.6,.7)--cycle;
      \fill (.7,.5) circle(.01);
   \fill[gray!50!white] (.8,.3)--(1,.3)--(1,.7)--(.8,.7)--cycle;
      \fill (.9,.5) circle(.01);
\end{tikzpicture}
```





Assumptions:

1. **Each person chooses some design $x$.** They get payoff $y(x)$, but the function $y(.)$ is unknown, so you don't observe the payoff until you try it out. You can interpret the action as a blueprint for a house, a business plan, a computer program, an agricultural practice, a novel, a song. Everything below also applies when the payoff depends on the state, $v(x|z)$, and so the action is context-specific, e.g. replying to an email, operating a car, operating a machine, writing copy to advertise some product.

2. **You keep making the same choice over time.**

3. **Everyone is playing an independent game, but can observe each other.** We're all subsistence farmers living side-by-side, our actions don't directly affect each other. Suppose we can all observe each others' choices but not their payoffs.

Implications:

1. **Exploration will decrease over time.**With a single agent they will stop exploring at some point, and choose $x$ every period. E.g. we see that societies adopt certain practices in agriculture, architecture, clothes, cooking, and then settle on those once they have exhausted local improvements.

2. **Everyone will do the same thing (herding).** If actions are observable then it is rational for each person to imitate others' actions, and so within a society everyone's actions will tend to be clustered in a neighborhood (if you're walking through a minefield, you want to follow someone else's footsteps).[^Aumann] 

3. **There will be inefficiently little exploration.** When you try out a new $x$ then your neighbors benefit because they can learn from your experience. We will thus have a million farmers all doing the same thing, it would be better if some of them experimented. We can collectively organize this: (A) sponsor people to run experiments; (B) let people register a claim on some $x$ and charge others to use it (intellectual property protection).

4. **Bigger societies will find better designs.** Bigger societies will have (A) more random variation to learn from; (B) more ability to collectively organize to explore.

5. **People will become experts.** Some people will learn the local neighbhorhood of their payoff-space and can charge for that expertise: artisans, architects, artists, doctors.



[^Aumann]: Define equilibrium as when everyone chooses the same $x$ every period. This follows if there's common priors, common-knowledge-of-rationality, and a regularity assumption on beliefs. Choosing some $\cap{x}$ implies that, for every $x\neq\cap{x}$, that $E[y(\cap{x})] - E[y(x)]\geq 0$. Assume these inequalities are always strict, meaning every $E[y(x)]$ is one-to-one (injective). Then if two people choose different $x$ they must disagree about one of those inequalities, violating common knowledge of rationality (Aumann's agreeing-to-disagree).


##          Assumptions on landscape

**Assume $v(\bm{x})$ is non-convex.** If it's convex then you'll gradually converge to the global maximum by local exploration.

##          Observations

- **Agricultural Yield is Exploration.** Agricultural societies slowly accumulated crop management practices that have raised yields (irrigation, rotation, fertilizers). The invention of printing made knowledge diffuse more quickly, & so more people caught up to the knowledge frontier (e.g. Diderot's encyclopedia). Organized research advanced the frontier. People argue over whether intellectual property was a positive or negative for innovation.

- **Technologies of Reproduction.** You could extend the model such that you pay a lower cost when your action is an exact copy of an existing action (a reproduction). Certain technologies made it cheap to reproduce existing things: writing, printing, photography, audio recording, video recording. This increases welfare but makes outcomes more homogenous, the distribution of actions becomes very spikey.

- **Science Maps out the Landscape.** Normal progress consists of mapping out individual points on the landscape. Scientific progress is different - it maps out big areas. Newtonian mechanics tells you the stability of any bridge; modern chemistry tells you the properties of any combination of ingredients.

##          Applications to AI

There are multiple things AI can do here: (1) predict the outcome given each context and action ($v(\bm{z},\bm{x})$); (2) find an action $\bm{x}$ which maximizes $v(\bm{z},\bm{x})$; (3) predict the typical human action ($\bm{x}$) given the context ($\bm{z}$). The last is imitative.

   - **Human-level classification (content moderation, radiologists).** You train a model on human labels to classify inputs. Now classification becomes very cheap. Expect this to be a substitute for low-skilled employees: borderline cases would still be escalated to the high-skilled employees.
   
   - **Super-human classification (sexing chickens; MRI).** You see a chick $z$, you choose whether to raise it to a chicken ($x\in\{0,1\}$), and you get payoff if the chicken is female. Humans can't tell the difference between male and female chicks by sight, but suppose AI models figure out how to do it. This is a pure yield increase for chicken ranchers because you save the cost of raising male chicks.

   - **Synthesize media (text, music, video, ads).** Here the function $v(\cdot)$ represents human appreciation. Thus the function is known in a certain sense but the knowledge is implicit, and so in practice we explore the space and we we have experts: writers, musicians, artists, who can create content that gets a good reaction.
   
   - **Playing chess.**
   - **Folding proteins.**
   - **Writing code (imitative).** 
   - **Face recognition.** 
   - **Creating pictures.**
   - **Driving a car.**
   - **Writing boilerplate text.**
   - **Suggesting responses to customer queries.**
   - **Satisfying constraints.**
   - **LLM answering factual questions.** 


|                                                            | linear fit? | skill-biased? |
| ---------------------------------------------------------- | ----------- | ------------- |
| estimating probability of death (or loan default)          | yes         |               |
| agricultural management (irrigation, rotation, fertilizer) | yes         |               |
| classifying whether photo is porn                          | no          |               |
| classifying whether text is hate speech, spam, etc.        | no          |               |
| classifying sex of chicken                                 | no          |               |
| answering a customer question about a product              | no          |               |
| driving a car                                              | no          |               |
| answering a factual question                               | no          |               |
|                                                            |             |               |


###            applications

Types of designs.
: - *design for consumption*: a story, a novel, a song, a picture, a wood-carving.
: - *design for function*: computer code, agricultural practice (what to use as fertilizer, when to water plants), a letter, conversation with a customer, blueprint for a building.
: - *tacit knowledge of practices.* How to operate a machine, how to stitch a collar, how to pitch a sale to a client.


LLMs aggregate existing knowledge.
: LLMs don't map out new points on the lanscape, but aggregate existing knowledge. They are similar to encyclopedias or search engines. We'd expect each of these decisions to raise the quality of decisions, but also lower the returns to expertise: e.g. experts on Cobol, on rare diseases, on asbestos management.

LLMs translate between two idioms.
: They can serve as interfaces between business logic and humans. They can write emails & probably soon will have telephone conversations.

   Much human work can be thought of as *translation*: (1) customer support tells you how your situation fits into the policy; (2) insurance adjuster translates your bent fender into policy language; (3) literal translators/interpreters.


   Many occupations are translating between human and machine. People who serve as interfaces between a mainframe and the customer: call center agents, gate agents, insurance adjusters, bank tellers, tax agents, rental car service clerk. They talk to the customer and type stuff into a computer.

   Slow replacement with self-service: ATMs, self-service kiosks, automated phone systems, websites, phone apps.

   It's been difficult to automate policy agents, but some societies have been successful: Sweden tax by SMS, Japanese vending machines, ATMs instead of bank tellers. 

#           Application to Intellectual Property


**Key points:**

   1. **We have IP protection because the landscape is shrouded in clouds.**

   2. **AI illuminates the whole landscape.** An implication is we should substantially loosen IP law, so it's not just a race to acquire land.

   3. **Copying occurs in a pre-AI world, but in the post-AI world there's no real distinction between creation and copying.**

   4. **LLMs could be prevented from producing exact matches with a hash function.** (bloom filter?)


--------------------------------------------

We should carefully distinguish *current IP law* or *future IP law* or *ideal IP law*.
: Much of the debate is about how to interpret current IP law, but current IP law was written as a response to a specific situation, we should ask how do we expect IP law and interpretation likely to change, and what would be the ideal IP law?

Artefacts that can have intellectual protection.

:    - Chemical composition of a drug
     - Process used to manufacture a light bulb
     - The likeness of a cartoon character
: - A brand name
: - A photograph
: - The text of a news article
: - The lyrics of a song
: - The facts reported in a news article (sometimes)
: - A software algorithm


We can make two assumptions about the landscape:
: 1. Snowflake/atomic: $y(x)$ has no structure, each $y(x)$ is a random draw.
: 2. Smooth: $y(x)$ has structure, so as you observe more you'll make better decisions.

Model: unique snowflakes.
: Suppose each work is its own unique snowflake, this is a common way of modelling intellectual property (I think the Josh Gans paper assumes this). Implications:
   1. _Copying is binary._ you either copy or you don't, there's no partial copying.
   2. _The marginal value of information is equal to the average value._ (???) 


With AI everything is illuminated.
: Now suppose that AI illuminates the entire landscape, i.e. the mapping $v(x)$ becomes fully known to everybody.
: E.g. we can suddenly observe (1) of all possible drugs, how effective is each; (2) of all possible lyrics, how resonant is each; (3) of all possible paintings, how attractive is each.

Distinction: whether the value is the world, or human response.
: It's worth distinguishing two sources of uncertainty in v(x): whether v(.) measures the effect of x on the outside world, or the effect of x on human responses.
: 1. About the world: $v(x)$ represents efficacy of a drug, or the speed of a sorting algorithm.
: 2. About human responses: $v(x)$ represents memorability of a poem, the click-through-rate of an advertisement, the beauty of an image.

Prediction: there will be less imitation in the post-AI world.
: If the full landscape is disclosed then there's no longer a *reason* to imitate. You can just choose the x which maximizes v(x). We will still see *clustering*, people will choose similar values of x, but just because they maximize v(.), not because they're imitating each other.

--------------------------------------------

Twist: familiarity changes the value.
: In some cases the use of an input x will change the output v(x). E.g. after people have been exposed to a particular phrase or a particular cartoon character then that particular realization becomes more attractive in the future. As a consequence the landscape will have ridges, creases bearing the imprint of particular cases (distinct from the ridges due to finite training data).

Good application: genre novels.
: Suppose we have a set of 10,000 novels which are romance or western or fantasy. We can think of the probability distribution from which they're drawn.
: The probability distribution will have a ridge around actual novels but also a lot of structure off that; trained on 10,000 novels from a trillion; could just avoid direct quotes (bloom filter) but still pick up the sense.

The marginal value of information is close to zero.
: The returns to information are highly concave: think about the value of each dot on the landscape, the value decreases with something like sqrt(N). As a consequence if we pay people for the marginal value of their information the payments will be very small. (Euler's theorem: if a function has constant returns to scale, then sum of payments to factors will exactly equal total product).
: This result only holds in the *landscape* world, not in the *snowflake* world.


--------------------------------------------


> "Data are considered discoverable "Facts," not original works in themselves, and are thus not copyrightable. The methods of compilation, analysis, annotation arrangement, or selection of data, which may be novel, unique, or proprietary, can be protected under copyright.


#           Related Literature

Rugged landscape.
: Callander, (2011, AER) "Searching and Learning by Trial and Error.": Outcomes are the realized path of a Brownian motion over the choice space; optimal experimentation is history‑dependent and can settle at local optima. 
: Callander, Lambert, Matouschek (2025 WP) “Innovation and Competition on a Rugged Technological Landscape”
   > "Innovation in this market is irregular with frequent changes of direction and cycles between frontier and niche innovation. We show how the ruggedness of the technological landscape itself deters innovation, generating less entry and product diﬀerentiation, narrower markets, and more intense competition than in a world of certainty."
: Carnehl & Schneider (2025, Ecma) "A Quest for Knowledge"
   > "Researchers select a question and how intensely to study it. The novelty of a question determines both the value and difficulty of discovering its answer. We show that the benefits of discoveries are nonmonotone in novelty. Knowledge expands endogenously step-by-step over time."

Endogenous growth.
: There are a number of classic models of endogenous innovation (Romer, Lucas, Aghion, Krugman). However I believe they all assume that each innovation is idiosyncratic, there is no metric of distance between different techniques. In the model discussed here there's a seam of innovations gradually being mined. In a typical "optimal control" problem there's also explore-exploit but it's usually convex so it's not especially hard to converge on the global optimum.