# Post-Digital Transformation, Decision Making and Intellectual Debt

### [Neil D. Lawrence](http://inverseprobability.com), University of

Cambridge

### 2022-04-26

**Abstract**: Digital transformation has offered the promise of moving
from a manual decision-making world to a world where decisions can be
rational, data-driven and automated. The first step to digital
transformation is mapping the world of atoms (material, customers,
logistic networks) into the world of bits.

I’ll discuss how the artificial systems we have developed operate in a
fundamentally different way to our own intelligence. I’ll describe how
this difference in operational capability leads us to misunderstand the
influence the nature of decisions made by machine intelligence.

Developing this understanding is important in integrating human
decisions with those from the machine. These ideas are designed to help
with the challenge of ‘post digital transformation’: doing business in a
digital world.

$$
$$

::: {.cell .markdown}

<!-- Do not edit this file locally. -->
<!-- Do not edit this file locally. -->
<!---->
<!-- Do not edit this file locally. -->
<!-- Do not edit this file locally. -->
<!-- The last names to be defined. Should be defined entirely in terms of macros from above-->
<!--

-->

# Introduction

## Pre-Read Material

{Please watch [this
excerpt](https://www.youtube.com/watch?v=ubq3ayuG2EY) from the Lex
Friedman podcast, interviewing with the roboticist Rodney Brooks. Please
read this [blog post by Jonathan Zittrain on Intellectual
Debt](https://medium.com/berkman-klein-center/from-technical-debt-to-intellectual-debt-in-ai-e05ac56a502c).

## Setup

In [None]:
import matplotlib.pyplot as plt
plt.rcParams.update({'font.size': 22})

<!--setupplotcode{import seaborn as sns
sns.set_style('darkgrid')
sns.set_context('paper')
sns.set_palette('colorblind')}-->

## notutils

<span class="editsection-bracket" style="">\[</span><span
class="editsection"
style=""><a href="https://github.com/lawrennd/talks/edit/gh-pages/_software/includes/notutils-software.md" target="_blank" onclick="ga('send', 'event', 'Edit Page', 'Edit', 'https://github.com/lawrennd/talks/edit/gh-pages/_software/includes/notutils-software.md', 13);">edit</a></span><span class="editsection-bracket" style="">\]</span>

This small package is a helper package for various notebook utilities
used

The software can be installed using

In [None]:
%pip install notutils

from the command prompt where you can access your python installation.

The code is also available on GitHub:
<https://github.com/lawrennd/notutils>

Once `notutils` is installed, it can be imported in the usual manner.

In [None]:
import notutils

## The Gartner Hype Cycle

<span class="editsection-bracket" style="">\[</span><span
class="editsection"
style=""><a href="https://github.com/lawrennd/talks/edit/gh-pages/_data-science/includes/gartner-hype-cycle.md" target="_blank" onclick="ga('send', 'event', 'Edit Page', 'Edit', 'https://github.com/lawrennd/talks/edit/gh-pages/_data-science/includes/gartner-hype-cycle.md', 13);">edit</a></span><span class="editsection-bracket" style="">\]</span>

<img src="https://inverseprobability.com/talks/./slides/diagrams//Gartner_Hype_Cycle.svg" class="" width="80%" style="vertical-align:middle;">

Figure: <i>The Gartner Hype Cycle places technologies on a graph that
relates to the expectations we have of a technology against its actual
influence. Early hope for a new techology is often displaced by
disillusionment due to the time it takes for a technology to be usefully
deployed.</i>

The [Gartner Hype Cycle](https://en.wikipedia.org/wiki/Hype_cycle) tries
to assess where an idea is in terms of maturity and adoption. It splits
the evolution of technology into a technological trigger, a peak of
expectations followed by a trough of disillusionment and a final
ascension into a useful technology. It looks rather like a classical
control response to a final set point.

## Cycle for ML Terms

<span class="editsection-bracket" style="">\[</span><span
class="editsection"
style=""><a href="https://github.com/lawrennd/talks/edit/gh-pages/_data-science/includes/gartner-hype-cycle-ai-bd-dm-dl-ml.md" target="_blank" onclick="ga('send', 'event', 'Edit Page', 'Edit', 'https://github.com/lawrennd/talks/edit/gh-pages/_data-science/includes/gartner-hype-cycle-ai-bd-dm-dl-ml.md', 13);">edit</a></span><span class="editsection-bracket" style="">\]</span>

## Google Trends

<span class="editsection-bracket" style="">\[</span><span
class="editsection"
style=""><a href="https://github.com/lawrennd/talks/edit/gh-pages/_data-science/includes/gartner-hype-cycle-base.md" target="_blank" onclick="ga('send', 'event', 'Edit Page', 'Edit', 'https://github.com/lawrennd/talks/edit/gh-pages/_data-science/includes/gartner-hype-cycle-base.md', 13);">edit</a></span><span class="editsection-bracket" style="">\]</span>

In [None]:
%pip install pytrends

In [None]:
import mlai.plot as plot

In [None]:
plot.google_trends(terms=['artificial intelligence', 'big data', 'data mining', 'deep learning', 'machine learning'], 
                  initials='ai-bd-dm-dl-ml', 
                  diagrams='./data-science')

In [None]:
import notutils as nu
from ipywidgets import IntSlider

In [None]:
nu.notebook.display_plots('ai-bd-dm-dl-ml-google-trends{sample:0>3}.svg', 
                            './data-science/', sample=IntSlider(0, 0, 4, 1))

<img src="https://inverseprobability.com/talks/./slides/diagrams//data-science/ai-bd-dm-dl-ml-google-trends.svg" class="" width="80%" style="vertical-align:middle;">

Figure: <i>Google trends for ‘artificial intelligence,’ ‘big data,’
‘data mining,’ ‘deep learning,’ ‘machine learning’ as different
technological terms gives us insight into their popularity over
time.</i>

Google trends gives us insight into the interest for different terms
over time.

Examining Google treds for ‘artificial intelligence,’ ‘big data,’ ‘data
mining,’ ‘deep learning’ and ‘machine learning’ we can see that
‘artificial intelligence’ *may* be entering a plateau of productivity,
‘big data’ is entering the trough of disillusionment, and ‘data mining’
seems to be deeply within the trough. On the other hand, ‘deep learning’
and ‘machine learning’ appear to be ascending to the peak of inflated
expectations having experienced a technology trigger.

For deep learning that technology trigger was the ImageNet result of
2012 (Krizhevsky et al., n.d.). This step change in performance on
object detection in images was achieved through convolutional neural
networks, popularly known as ‘deep learning.’

# What is Machine Learning?

<span class="editsection-bracket" style="">\[</span><span
class="editsection"
style=""><a href="https://github.com/lawrennd/talks/edit/gh-pages/_ml/includes/what-is-ml.md" target="_blank" onclick="ga('send', 'event', 'Edit Page', 'Edit', 'https://github.com/lawrennd/talks/edit/gh-pages/_ml/includes/what-is-ml.md', 13);">edit</a></span><span class="editsection-bracket" style="">\]</span>

What is machine learning? At its most basic level machine learning is a
combination of

$$\text{data} + \text{model} \stackrel{\text{compute}}{\rightarrow} \text{prediction}$$

where *data* is our observations. They can be actively or passively
acquired (meta-data). The *model* contains our assumptions, based on
previous experience. That experience can be other data, it can come from
transfer learning, or it can merely be our beliefs about the
regularities of the universe. In humans our models include our inductive
biases. The *prediction* is an action to be taken or a categorization or
a quality score. The reason that machine learning has become a mainstay
of artificial intelligence is the importance of predictions in
artificial intelligence. The data and the model are combined through
computation.

In practice we normally perform machine learning using two functions. To
combine data with a model we typically make use of:

**a prediction function** a function which is used to make the
predictions. It includes our beliefs about the regularities of the
universe, our assumptions about how the world works, e.g., smoothness,
spatial similarities, temporal similarities.

**an objective function** a function which defines the cost of
misprediction. Typically, it includes knowledge about the world’s
generating processes (probabilistic objectives) or the costs we pay for
mispredictions (empirical risk minimization).

The combination of data and model through the prediction function and
the objective function leads to a *learning algorithm*. The class of
prediction functions and objective functions we can make use of is
restricted by the algorithms they lead to. If the prediction function or
the objective function are too complex, then it can be difficult to find
an appropriate learning algorithm. Much of the academic field of machine
learning is the quest for new learning algorithms that allow us to bring
different types of models and data together.

A useful reference for state of the art in machine learning is the UK
Royal Society Report, [Machine Learning: Power and Promise of Computers
that Learn by
Example](https://royalsociety.org/~/media/policy/projects/machine-learning/publications/machine-learning-report.pdf).

You can also check my post blog post on [What is Machine
Learning?](http://inverseprobability.com/2017/07/17/what-is-machine-learning).

## Artificial Intelligence and Data Science

<span class="editsection-bracket" style="">\[</span><span
class="editsection"
style=""><a href="https://github.com/lawrennd/talks/edit/gh-pages/_ai/includes/ai-vs-data-science-2.md" target="_blank" onclick="ga('send', 'event', 'Edit Page', 'Edit', 'https://github.com/lawrennd/talks/edit/gh-pages/_ai/includes/ai-vs-data-science-2.md', 13);">edit</a></span><span class="editsection-bracket" style="">\]</span>

Artificial intelligence has the objective of endowing computers with
human-like intelligent capabilities. For example, understanding an image
(computer vision) or the contents of some speech (speech recognition),
the meaning of a sentence (natural language processing) or the
translation of a sentence (machine translation).

### Supervised Learning for AI

The machine learning approach to artificial intelligence is to collect
and annotate a large data set from humans. The problem is characterized
by input data (e.g. a particular image) and a label (e.g. is there a car
in the image yes/no). The machine learning algorithm fits a mathematical
function (I call this the *prediction function*) to map from the input
image to the label. The parameters of the prediction function are set by
minimizing an error between the function’s predictions and the true
data. This mathematical function that encapsulates this error is known
as the *objective function*.

This approach to machine learning is known as *supervised learning*.
Various approaches to supervised learning use different prediction
functions, objective functions or different optimization algorithms to
fit them.

For example, *deep learning* makes use of *neural networks* to form the
predictions. A neural network is a particular type of mathematical
function that allows the algorithm designer to introduce invariances
into the function.

An invariance is an important way of including prior understanding in a
machine learning model. For example, in an image, a car is still a car
regardless of whether it’s in the upper left or lower right corner of
the image. This is known as translation invariance. A neural network
encodes translation invariance in *convolutional layers*. Convolutional
neural networks are widely used in image recognition tasks.

An alternative structure is known as a recurrent neural network (RNN).
RNNs neural networks encode temporal structure. They use auto regressive
connections in their hidden layers, they can be seen as time series
models which have non-linear auto-regressive basis functions. They are
widely used in speech recognition and machine translation.

Machine learning has been deployed in Speech Recognition (e.g. Alexa,
deep neural networks, convolutional neural networks for speech
recognition), in computer vision (e.g. Amazon Go, convolutional neural
networks for person recognition and pose detection).

The field of data science is related to AI, but philosophically
different. It arises because we are increasingly creating large amounts
of data through *happenstance* rather than active collection. In the
modern era data is laid down by almost all our activities. The objective
of data science is to extract insights from this data.

Classically, in the field of statistics, data analysis proceeds by
assuming that the question (or scientific hypothesis) comes before the
data is created. E.g., if I want to determine the effectiveness of a
particular drug, I perform a *design* for my data collection. I use
foundational approaches such as randomization to account for
confounders. This made a lot of sense in an era where data had to be
actively collected. The reduction in cost of data collection and storage
now means that many data sets are available which weren’t collected with
a particular question in mind. This is a challenge because bias in the
way data was acquired can corrupt the insights we derive. We can perform
randomized control trials (or A/B tests) to verify our conclusions, but
the opportunity is to use data science techniques to better guide our
question selection or even answer a question without the expense of a
full randomized control trial (referred to as A/B testing in modern
internet parlance).

# Embodiment and Intellectual Debt

<img class="" src="https://inverseprobability.com/talks/./slides/diagrams//art/sistine-chapel-ceiling.jpg" style="width:100%">

Figure: <i>The ceiling of the Sistine Chapel.</i>

[Patrick Boyde](https://www.mmll.cam.ac.uk/pb127)’s talks on the Sistine
Chapel focussed on both the structure of the chapel ceiling, describing
the impression of height it was intended to give, as well as the
significance and positioning of each of the panels and the meaning of
the individual figures.

<img class="" src="https://inverseprobability.com/talks/./slides/diagrams//art/the-creation-of-man-michelangelo.jpg" style="width:80%">

Figure: <i>Photo of Detail of Creation of Man from the Sistine chapel
ceiling.</i>

One of the most famous panels is central in the ceiling, it’s the
creation of man. Here, God in the guise of a pink-robed bearded man
reaches out to a languid Adam.

The representation of God in this form seems typical of the time,
because elsewhere in the Vatican Museums there are similar
representations.

<img class="" src="https://inverseprobability.com/talks/./slides/diagrams//art/the-creation-of-man-detail-god-michelangelo.jpg" style="width:80%">

Figure: <i>Photo detail of God.</i>

<https://commons.wikimedia.org/wiki/File:Michelangelo,_Creation_of_Adam_04.jpg>

For a time at the head of all articles about AI, an [image of the
terminator](https://www.flickr.com/photos/tom-margie/2144882415/sizes/o/)
was included.

<img class="" src="https://inverseprobability.com/talks/./slides/diagrams//ai/terminator-image.jpg" style="width:70%">

Figure: <i>Image of James Cameron’s terminator. Images like this have
been used to illustrate articles about artificial intelligence.</i>

Sometimes, this image is even combined with that of God to create what
[Beth Singler](https://bvsingler.com), a digital anthropologist who is a
JRF at Hmerton College, refers to as the creation meme (Singler, 2020).

<img class="" src="https://inverseprobability.com/talks/./slides/diagrams//people/beth-singler.jpg" style="width:80%">

Figure: <i>Beth Singler is a digital anthropologist who holds a JRF at
Homerton College. She has explored parallels between the Michelangelo
image of creation and our own notion of robotic creation</i>

So in a very real sense, we can see that both God and AI are viewed by
us as embodied intelligences, whether creator or created. We show these
other-intelligences in a humanoid form.

Krizhevsky, A., Sutskever, I., Hinton, G.E., n.d. ImageNet
classification with deep convolutional neural networks. pp. 1097–1105.

Singler, B., 2020. The AI creation meme: A case study of the new
visibility of religion in artificial intelligence discourse. Religions
11. <https://doi.org/10.3390/rel11050253>