# **Mathematical Neuron by McCulloch and Pitts:**

> ![](https://miro.medium.com/v2/resize:fit:640/format:webp/1*fDHlg9iNo0LLK4czQqqO9A.png)

> ![](https://av-eks-lekhak.s3.amazonaws.com/media/__sized__/article_images/36bb334b-4393-4c4c-82a8-ecac23d4ac70-thumbnail_webp-600x300.webp)

The McCulloch-Pitts ($MCP$) Neuron model, proposed in 1943 by Warren McCulloch and Walter Pitts, is considered the birth of Artificial Neural Networks ($ANNs$). It provides the first computational and mathematical abstraction of a biological neuron, demonstrating that networks of these simple units could perform complex logic operations, including any function computable by a Turing Machine.

In **1943**, Warren **McCulloch** (neuroscientist) and Walter **Pitts** (logician) asked a radical question:

> *`Can the activity of biological neurons be described using formal logic and mathematics?`*

Their answer was the **first mathematical model of a neuron**, published in:

> *“A Logical Calculus of the Ideas Immanent in Nervous Activity”*

This paper did **three revolutionary things**: 
   1. Reduced neurons to logical units
   2. Showed networks of neurons can compute logical functions
   3. Connected neuroscience with computation and logic

#### **Biological assumptions behind the $MCP$ neuron:** 

They made **`strong simplifying assumptions`** to make the problem tractable:   
   1. Neurons are either **`active or inactive`** (`binary`)
   2. Synapses are either **`excitatory or inhibitory`**
   3. All excitatory inputs contribute equally
   4. Inhibitory inputs veto firing completely
   5. A neuron fires if total excitation exceeds a threshold

**These assumptions are biologically crude, but mathematically powerful.** 

**Step 1: Represent neural activity mathematically:**

**Binary neuron state:**

A neuron’s output is either:   
> $y \in {0,1}$

* $1$ = neuron fires
* $0$ = neuron does not fire

Inputs are also binary:

> $x_i \in {0,1}$ 

This corresponds to the idea that neurons either spike or not in a small time window.

**Step 2: Represent synapses as weights:**

They assigned a **`weight`** to each input connection:

* Excitatory synapses: positive contribution
* Inhibitory synapses: negative contribution (or veto)

Let:

> $w_i \in ({+1, -\infty})$ 

(In practice, inhibition was treated as a logical veto rather than a numeric value.)

**Step 3: Summation of inputs (integration):**

The neuron computes the total excitation:

> $S = \sum_i w_i x_i$ 

This reflects biological integration at the soma.

**Step 4: Threshold decision (firing rule):**

The neuron fires if excitation exceeds a threshold ( $\theta$ ):

> $y =
\begin{cases}
1 & \text{if } S \ge \theta \\
0 & \text{otherwise}
\end{cases}$

This is a **Heaviside step function**:

> $y = H\left(\sum_i w_i x_i - \theta\right)$

This is the full $MCP$ neuron rule.

#### **Logical interpretation (key insight):**

**McCulloch and Pitts showed that:**

* A neuron with suitable weights and threshold can implement **$AND$**, **$OR$**, and **$NOT$**
* Networks of such neurons can implement **any Boolean function**

**Example: $AND$ gate:**

Let:    
   * ($x_1, x_2 \in {0,1}$)
   * ($w_1 = w_2 = 1$)
   * ($\theta = 2$)

Then:   
> $y = H(x_1 + x_2 - 2)$ 

Only fires if both inputs are $1$.

#### **What the MCP neuron cannot do?**

Despite its power, the $MCP$ neuron has major limitations:

* No learning (weights are fixed)
* No graded outputs
* No time dynamics
* No noise
* No plasticity
* No spike timing

It is a **`logical neuron`**, not a learning neuron.

----
----
-----

## **Mathematical Derivation:**

In the early 1940s, scientists did not yet have a formal link between:

* **Biology** (neurons in the brain)
* **Logic** (Boolean reasoning)
* **Computation** (machines that calculate)

McCulloch (a neuroscientist) and Pitts (a logician) asked a profound question:

> **`Can the activity of neurons be described using mathematics and logic?`**

Their goal was **`not`** to build a learning machine but try to stablish a connection between them theoretically. 

They began with **simplified biological facts**, deliberately ignoring complexity to isolate computation:

1. Neurons communicate via **discrete spikes**
2. A neuron either fires or does not fire in a short time window
3. Multiple inputs converge onto a neuron
4. Some inputs excite, others inhibit
5. Neurons fire only when input exceeds a threshold

These observations motivated a **`binary, logical abstraction`**.

**1. First Abstraction: Binary Neural Activity:**   

**Biological intuition:**      
   * A neuron either emits a spike or it does not
   * This can be approximated as a binary state

**Mathematical representation:** Define neuron activity as: 

> $y \in {0,1}$ 

Where:   
   * (1) = neuron fires
   * (0) = neuron does not fire

Similarly, each input neuron:

> $x_i \in {0,1}$ 

This is the **`starting foundation`**.

**2. Second Abstraction: Synapses as Fixed Connections:**

**Biological intuition:**   
   * Neurons connect through synapses
   * Some synapses increase firing likelihood (excitatory)
   * Others prevent firing (inhibitory)

**Mathematical representation:** Each input has a **fixed weight**:

> $w_i$ 

* Excitatory synapse → positive weight
* Inhibitory synapse → strong negative effect (conceptually a veto)

At this stage:    
   * **Weights do not change**
   * There is **no learning**

**3. Third Abstraction: Neural Integration as Summation:**

**Biological intuition:** The neuron’s soma **collects and integrates** all incoming signals.

**Mathematical necessity:** The simplest way to combine multiple binary inputs is **addition**:

> $S = \sum_i w_i x_i$ 

This sum represents total excitation.

This step is not arbitrary — summation is the **minimal operation** that:   
   * Counts active inputs
   * Allows unequal influence via weights

**Fourth abstraction: threshold-based firing:**

**Biological intuition:**   
Neurons fire only if their membrane potential crosses a threshold.

**Mathematical formulation:**  
Introduce a threshold parameter ( $\theta$ ):

> $y =
\begin{cases}
1 & \text{if } \sum_i w_i x_i \ge \theta \\
0 & \text{otherwise}
\end{cases}$ 

This is a **Heaviside step function**:

> $y = H\left(\sum_i w_i x_i - \theta\right)$ 

This completes the mathematical neuron.

**Handling inhibition (important historical detail):**

In McCulloch–Pitts’ original formulation:

* Any active inhibitory input **completely prevents firing**
* This was treated as a logical NOT, not a numeric subtraction

In modern notation, this is approximated by:

* Assigning large negative weights, or
* Explicit logical constraints

This reflects their logical (not statistical) mindset.

**Final McCulloch–Pitts neuron equation:**

Putting all abstractions together:

> $\boxed{
y = H\left(\sum_i w_i x_i - \theta\right)
}$ 

Where:   
   * Inputs are binary
   * Weights are fixed
   * Output is binary
   * Threshold enforces all-or-none firing

**Logical interpretation (the key breakthrough):**

McCulloch and Pitts proved that:

* A single neuron can implement basic logic gates:
  * AND
  * OR
  * NOT
* Networks of such neurons can compute **any Boolean function**

**Example: AND gate:**

Let:   
   * (w_1 = w_2 = 1)
   * (\theta = 2)

Then:

> $y = H(x_1 + x_2 - 2)$   

This fires **only when both inputs are 1**.

This result connected:   
> **Neural activity ↔ logic ↔ computation**

**What this model intentionally ignores:**

The MCP neuron is **not biologically realistic**. It ignores:    
   * Learning and plasticity
   * Spike timing
   * Noise and stochasticity
   * Continuous firing rates
   * Temporal dynamics
   * Structural changes

These were intentionally excluded to focus on **`computational capability`**.

**Why this model was revolutionary?**

The McCulloch–Pitts neuron:   
   1. Was the **first mathematical model of a neuron**
   2. Showed brains could be understood computationally
   3. Laid foundations for:   
      * Perceptrons
      * Neural networks
      * Computational neuroscience
      * Theoretical computer science

It preceded learning rules but **made learning possible later**.

**Historical progression (important context):**

| Year  | Contribution                         |
| ----- | ------------------------------------ |
| 1943  | McCulloch–Pitts neuron (computation) |
| 1949  | Hebbian learning (plasticity)        |
| 1957  | Perceptron (learning + neuron)       |
| 1980s | Backpropagation                      |
| 2010s | Deep learning                        |


> **The McCulloch–Pitts neuron is a logically derived mathematical abstraction that models a neuron as a binary threshold unit, establishing the first formal bridge between biological neurons and computation.**