# Introduction to Pytorch and Tensors

## What is PyTorch?

PyTorch is an open-source deep learning framework developed by Facebook's AI Research lab 

It is popular among researchers and developers because of its dynamic computational graph, ease of use, and strong community support.

Several large companies currently use PyTorch to develop and deploy models (Facebook, Tesla, to name a few)

In [None]:
pip install --upgrade typing_extensions

In [None]:
python -c "import torch; print(torch.__version__)"

In [1]:
# Importing PyTorch
import torch
#torch.__version__


A module that was compiled using NumPy 1.x cannot be run in
NumPy 2.0.2 as it may crash. To support both 1.x and 2.x
versions of NumPy, modules must be compiled with NumPy 2.0.
Some module may need to rebuild instead e.g. with 'pybind11>=2.12'.

If you are a user of the module, the easiest solution will be to
downgrade to 'numpy<2' or try to upgrade the affected module.
We expect that some modules will need time to support NumPy 2.

Traceback (most recent call last):  File "C:\Users\ramya\anaconda3\lib\runpy.py", line 197, in _run_module_as_main
    return _run_code(code, main_globals, None,
  File "C:\Users\ramya\anaconda3\lib\runpy.py", line 87, in _run_code
    exec(code, run_globals)
  File "C:\Users\ramya\anaconda3\lib\site-packages\ipykernel_launcher.py", line 17, in <module>
    app.launch_new_instance()
  File "C:\Users\ramya\anaconda3\lib\site-packages\traitlets\config\application.py", line 846, in launch_instance
    app.start()
  File "C:\Users\ramya\anaconda3\lib\site-pack

## So, what are tensors?

Tensors are a fundamental concept in both mathematics and machine learning, especially in frameworks like PyTorch and TensorFlow. In essence, tensors are multi-dimensional arrays, similar to matrices but generalized to more dimensions. They are used to represent data in various forms and are the basic data structures that machine learning models operate on!


### Creating our first tensors 

Creating tensors in PyTorch is straightforward and flexible. PyTorch provides multiple ways to create tensors, whether you're starting with raw data, generating them programmatically, or creating them with specific properties like all zeros or ones. Let's look at some of those ways:

In [2]:
# Here we create our first tensor
scalar = torch.tensor(8)
scalar

tensor(8)

In [3]:
# You will note that it has 0 dimensions, which makes it a scalar!
scalar.ndim

0

In [55]:
scalar.shape

torch.Size([])

In [4]:
scalar.dtype

torch.int64

In [5]:
# Vector
vector = torch.tensor([1, 2])
vector

tensor([1, 2])

In [6]:
# Our Vector has 1 dimension
vector.ndim

1

In [7]:
# Checking the shape of our vector
vector.shape

torch.Size([2])

In [8]:
# Matrix
matrix = torch.tensor([[7, 8], 
                       [9, 10]])
matrix

tensor([[ 7,  8],
        [ 9, 10]])

In [9]:
# Check number of dimensions
matrix.ndim

2

In [10]:
matrix.shape

torch.Size([2, 2])

In [65]:
# Creating a tensor
tensor1 = torch.tensor([[[1, 2, 3],
                        [4, 5, 6],
                        [7, 8, 9]]])
tensor1

tensor([[[1, 2, 3],
         [4, 5, 6],
         [7, 8, 9]]])

In [12]:
# Check number of dimensions for TENSOR
tensor1.ndim

3

In [56]:
tensor1

tensor([1, 2, 3])

In [66]:
tensor1.size()

torch.Size([1, 3, 3])

In [57]:
# Check shape of TENSOR
TENSOR1.shape

NameError: name 'TENSOR1' is not defined

In [62]:
tensor1

tensor([[[1, 2, 3],
         [4, 5, 6],
         [7, 8, 9]]])

### General Interpretation of a 3D Tensor

A 3D tensor has three dimensions, which are often interpreted as follows:

#### First Dimension (Depth, Batch, or Number of Samples):
- This dimension usually represents the number of distinct elements or samples in the tensor.
- For example, in a batch of images, this dimension might represent the batch size (number of images in the batch). If the tensor represents sequences (like in NLP tasks), this might represent the number of sequences.
- **Example:** If the shape is `(10, 3, 224)`, the first dimension (`10`) might represent 10 samples, such as 10 images or 10 sequences.

#### Second Dimension (Height, Channel, or Features):
- This dimension often represents a secondary characteristic of each sample, such as the number of channels in an image (e.g., RGB channels), the height of an image, or the number of features in a dataset.
- For instance, in an image, this could be the number of channels (such as 3 for RGB images). In sequence data, it could be the number of features at each time step.
- **Example:** If the shape is `(10, 3, 224)`, the second dimension (`3`) might represent 3 color channels (Red, Green, Blue) in an image.

#### Third Dimension (Width, Sequence Length, or Time Steps):
- The third dimension often represents the size along another axis, such as the width of an image, the length of a sequence, or the number of time steps in time-series data.
- **Example:** If the shape is `(10, 3, 224)`, the third dimension (`224`) might represent 224 pixels in the width of each image.


### After creating some tensors using lists, let's create a random one

In [15]:
# Create a random tensor of size (3, 3)
random_tensor = torch.rand(size=(3, 3))
random_tensor, random_tensor.dtype

(tensor([[0.0808, 0.6364, 0.1270],
         [0.1542, 0.8252, 0.0930],
         [0.9542, 0.0133, 0.9310]]),
 torch.float32)

In [16]:
# Create a random tensor of size (224, 224, 3)
random_image_size_tensor = torch.rand(size=(224, 224, 3))
print(random_image_size_tensor.shape)
print(random_image_size_tensor.ndim)
random_image_size_tensor

torch.Size([224, 224, 3])
3


tensor([[[0.8023, 0.6207, 0.2072],
         [0.3877, 0.2163, 0.4232],
         [0.7206, 0.6265, 0.6526],
         ...,
         [0.9757, 0.8268, 0.4453],
         [0.7634, 0.0160, 0.3627],
         [0.1130, 0.7588, 0.4059]],

        [[0.4723, 0.3214, 0.6312],
         [0.6828, 0.1845, 0.2029],
         [0.9471, 0.5140, 0.1541],
         ...,
         [0.9398, 0.0582, 0.3742],
         [0.7664, 0.3659, 0.7184],
         [0.6649, 0.2767, 0.5236]],

        [[0.5273, 0.2737, 0.6709],
         [0.7114, 0.5359, 0.7940],
         [0.1410, 0.6360, 0.8520],
         ...,
         [0.8002, 0.0542, 0.7108],
         [0.5038, 0.5085, 0.4542],
         [0.3839, 0.0114, 0.3596]],

        ...,

        [[0.5887, 0.4968, 0.3234],
         [0.5287, 0.8317, 0.7605],
         [0.7856, 0.4349, 0.7196],
         ...,
         [0.7313, 0.3336, 0.4174],
         [0.2900, 0.0526, 0.4169],
         [0.4407, 0.8681, 0.7638]],

        [[0.7161, 0.1872, 0.8811],
         [0.4061, 0.8372, 0.2707],
         [0.

In [70]:
random_image_size_tensor[0][0][1]

tensor(0.6207)

### Using torch.zeros 

In [71]:
# Tensor filled with zeros
zeros = torch.zeros(size=(3, 3))
zeros, zeros.dtype

(tensor([[0., 0., 0.],
         [0., 0., 0.],
         [0., 0., 0.]]),
 torch.float32)

We can do the same to create a tensor of all ones except using [`torch.ones()` ](https://pytorch.org/docs/stable/generated/torch.ones.html) instead.

In [72]:
# Tensor filled with ones
ones = torch.ones(size=(3, 3))
ones, ones.dtype

(tensor([[1., 1., 1.],
         [1., 1., 1.],
         [1., 1., 1.]]),
 torch.float32)

### Using torch.arange

In [73]:
# Create a range of values 0 to 10
zero_to_ten = torch.arange(start=0, end=10, step=1)
zero_to_ten

tensor([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])

In [74]:
# We can also create a tensor of zeros similar to another tensor
ten_zeros = torch.zeros_like(input=zero_to_ten) # will have same shape
ten_zeros

tensor([0, 0, 0, 0, 0, 0, 0, 0, 0, 0])

In [77]:
zeros = torch.zeros(size=(1,10))
zeros

tensor([[0., 0., 0., 0., 0., 0., 0., 0., 0., 0.]])

In [76]:
ten_zeros1 = torch.zeros(10)
ten_zeros1

tensor([0., 0., 0., 0., 0., 0., 0., 0., 0., 0.])

### Basic Tensor operations

Here we will look at some basic tensor operations to become familiar with tensor manipulation, since
Neural Networks are all about that!

In [23]:
# Create a tensor of values and add a number to it
tensor = torch.tensor([1, 2, 3])
tensor + 10

tensor([11, 12, 13])

In [24]:
# Multiply it by 10
tensor * 10

tensor([10, 20, 30])

In [25]:
# Tensors don't change unless reassigned
tensor

tensor([1, 2, 3])

In [26]:
# Subtracting and assigning to a new variable
tensor = tensor - 10
tensor

tensor([-9, -8, -7])

In [27]:
# Adding and reassigning to a new variable
tensor = tensor + 10
tensor

tensor([1, 2, 3])

In [28]:
# Can also use torch functions
torch.multiply(tensor, 10)

tensor([10, 20, 30])

In [29]:
# Original tensor is still unchanged 
tensor

tensor([1, 2, 3])

However, it's more common to use the operator symbols like `*` instead of `torch.mul()`

In [30]:
# Element-wise multiplication (each element multiplies its equivalent, index 0->0, 1->1, 2->2)
print(tensor, "*", tensor)
print("Equals:", tensor * tensor)

tensor([1, 2, 3]) * tensor([1, 2, 3])
Equals: tensor([1, 4, 9])


### Matrix Multiplication: A Brief Overview

Matrix multiplication involves multiplying two matrices to produce a third matrix. Given matrices $A$ (size $m \times n$) and $B$ (size $n \times p$), the resulting matrix $C = A \times B$ has dimensions $m \times p$. Each element $c_{ij}$ in $C$ is computed as:

$$
c_{ij} = \sum_{k=1}^{n} A_{ik} \times B_{kj}
$$

**Example:**

For matrices $A$ and $B$:

$$
A = \begin{bmatrix} 1 & 2 \\ 3 & 4 \end{bmatrix}, \quad B = \begin{bmatrix} 5 & 6 \\ 7 & 8 \end{bmatrix}
$$

The product $C = A \times B$ is:

$$
C = \begin{bmatrix} 19 & 22 \\ 43 & 50 \end{bmatrix}
$$

**Key Points:**
- Matrix multiplication is not commutative: $A \times B \neq B \times A$.
- It is associative and distributive over addition.

In PyTorch, you can perform matrix multiplication with `torch.mm(A, B)` or the `@` operator.


In [31]:
import torch
tensor = torch.tensor([1, 2, 3])
tensor.shape

torch.Size([3])

In [32]:
# Element-wise matrix multiplication
tensor * tensor

tensor([1, 4, 9])

In [78]:
# Matrix multiplication
torch.mm(tensor, tensor)

RuntimeError: self must be a matrix

In [79]:
# Can also use the "@" symbol for matrix multiplication, though not recommended
tensor @ tensor

RuntimeError: Expected size for first two dimensions of batch2 tensor to be: [1, 1] but got: [1, 3].

We can make matrix multiplication work between `tensor_A` and `tensor_B` by making their inner dimensions match.

One of the ways to do this is with a **transpose** (switch the dimensions of a given tensor).

You can perform transposes in PyTorch using either:
* `torch.transpose(input, dim0, dim1)` - where `input` is the desired tensor to transpose and `dim0` and `dim1` are the dimensions to be swapped.
* `tensor.T` - where `tensor` is the desired tensor to transpose.

Let's try the latter.

In [35]:
# Created tensor a and b

In [36]:
tensor_A = torch.tensor([[7, 8], 
                       [9, 10],
                        [11, 12]])
tensor_B = torch.tensor([[7, 8], 
                       [9, 10],
                        [11, 12]])

In [37]:
# View tensor_A and tensor_B
print(tensor_A)
print(tensor_B)

tensor([[ 7,  8],
        [ 9, 10],
        [11, 12]])
tensor([[ 7,  8],
        [ 9, 10],
        [11, 12]])


In [86]:
# View tensor_A and tensor_B.T
print(tensor_A)
print(tensor_B)

tensor([[ 7,  8],
        [ 9, 10],
        [11, 12]])
tensor([[ 7,  9, 11],
        [ 8, 10, 12]])


In [83]:
tensor_B = tensor_B.T
tensor_B

tensor([[ 7,  9, 11],
        [ 8, 10, 12]])

In [90]:
tensor_A * tensor_B # Error because (*) is for element wise multiplication not matrices

RuntimeError: The size of tensor a (2) must match the size of tensor b (3) at non-singleton dimension 1

In [87]:
tensor_A @ tensor_B

tensor([[113, 143, 173],
        [143, 181, 219],
        [173, 219, 265]])

In [89]:
torch.mm(tensor_A, tensor_B)

tensor([[113, 143, 173],
        [143, 181, 219],
        [173, 219, 265]])

### Exploring our tensor





In [39]:
# Create a tensor
x = torch.arange(0, 100, 10)
x

tensor([ 0, 10, 20, 30, 40, 50, 60, 70, 80, 90])

In [94]:
print(f"Minimum: {x.min()}")
print(f"Maximum: {x.max()}")
print(f"Mean: {x.type(torch.float32).mean()}") # won't work without float datatype
print(f"Sum: {x.sum()}")

Minimum: 1
Maximum: 9
Mean: 5.0
Sum: 45


In [41]:
print(f"Index where max value occurs: {x.argmax()}")
print(f"Index where min value occurs: {x.argmin()}")

Index where max value occurs: 9
Index where min value occurs: 0


### Tensor Manipulation: Reshaping, Stacking, and Squeezing

#### Reshaping
- **Reshaping** is the process of changing the shape (dimensions) of a tensor without altering its data. This is useful when you need to adjust the structure of a tensor to fit into a model or a specific operation.
- You can reshape a tensor using `torch.reshape()` or `tensor.view()`.

In [42]:
# Example
tensor = torch.arange(12)
tensor

tensor([ 0,  1,  2,  3,  4,  5,  6,  7,  8,  9, 10, 11])

In [43]:
reshape_tensor = tensor.reshape(3, 4)  # Reshape to 3x4 matrix
reshape_tensor

tensor([[ 0,  1,  2,  3],
        [ 4,  5,  6,  7],
        [ 8,  9, 10, 11]])

In [44]:
reshaped_tensor = tensor.view(3, 4)  # Reshape to 3x4 matrix
reshaped_tensor

tensor([[ 0,  1,  2,  3],
        [ 4,  5,  6,  7],
        [ 8,  9, 10, 11]])

### Stacking

- **Stacking** involves joining a sequence of tensors along a new dimension. This operation is useful when you want to combine multiple tensors into a single tensor while adding an extra dimension.
- You can stack tensors using `torch.stack()`.

In [45]:
tensor1 = torch.tensor([1, 2, 3])
tensor2 = torch.tensor([4, 5, 6])
stacked_tensor = torch.stack([tensor1, tensor2])  # Shape will be (2, 3)
print(f"Shape of tensor1 is: {tensor1.shape}")
print(f"Shape of tensor2 is: {tensor2.shape}")
print(f"Shape of stacked_tensor is: {stacked_tensor.shape}")
stacked_tensor

Shape of tensor1 is: torch.Size([3])
Shape of tensor2 is: torch.Size([3])
Shape of stacked_tensor is: torch.Size([2, 3])


tensor([[1, 2, 3],
        [4, 5, 6]])

### Squeezing

- **Squeezing** removes dimensions of size 1 from a tensor, effectively reducing the dimensionality without altering the data. This is particularly useful for eliminating unnecessary dimensions that can result from certain operations.
- You can squeeze a tensor using `torch.squeeze()`.

In [46]:
tensor = torch.tensor([[[1], [2], [3]]])  # Shape is (1, 3, 1)
print(f"Shape of tensor is: {tensor.shape}")
squeezed_tensor = tensor.squeeze()  # Shape will be (3,)
print(squeezed_tensor)

Shape of tensor is: torch.Size([1, 3, 1])
tensor([1, 2, 3])


In [47]:
# can't be applied on tensors with more than one column, it just eliminate unnecessory dimensions
stacked_tensor_squeezed = stacked_tensor.squeeze()
stacked_tensor_squeezed

tensor([[1, 2, 3],
        [4, 5, 6]])

## Indexing (selecting data from tensors)

Sometimes you'll want to select specific data from tensors (for example, only the first column or second row).

To do so, you can use indexing.

If you've ever done indexing on Python lists or NumPy arrays, indexing in PyTorch with tensors is very similar.

In [100]:
# Create a tensor 
#import torch
x = torch.arange(1, 10).reshape(1,3,3)
x, x.shape

(tensor([[[1, 2, 3],
          [4, 5, 6],
          [7, 8, 9]]]),
 torch.Size([1, 3, 3]))

Indexing values goes outer dimension -> inner dimension (check out the square brackets).

In [49]:
# Let's index bracket by bracket
print(f"First square bracket:\n{x[0]}") 
print(f"Second square bracket: {x[0][0]}") 
print(f"Third square bracket: {x[0][0][0]}")

First square bracket:
tensor([[1, 2, 3],
        [4, 5, 6],
        [7, 8, 9]])
Second square bracket: tensor([1, 2, 3])
Third square bracket: 1


You can also use `:` to specify "all values in this dimension" and then use a comma (`,`) to add another dimension.

In [50]:
# Get all values of 0th dimension and the 0 index of 1st dimension
x[:, 0]

tensor([[1, 2, 3]])

In [51]:
# Get all values of 0th & 1st dimensions but only index 1 of 2nd dimension
x[:, :, 1]

tensor([[2, 5, 8]])

In [52]:
# Get all values of the 0 dimension but only the 1 index value of the 1st and 2nd dimension
x[:, 1, 1]

tensor([5])

In [53]:
# Get index 0 of 0th and 1st dimension and all values of 2nd dimension 
x[0, 0, :] # same as x[0][0]

tensor([1, 2, 3])

In [54]:
x[0, 0, 2]

tensor(3)

### Summary

Tensors are the core data structures in PyTorch, acting as multi-dimensional arrays that store data. They are crucial for performing efficient computations, particularly in deep learning. Key concepts and operations include:

- **Reshaping:** Adjusting the dimensions of a tensor to match model requirements using functions like `view()` or `reshape()`.

- **Stacking:** Combining multiple tensors along a new dimension, useful for creating batches or merging data, using `torch.stack()`.

- **Squeezing:** Removing unnecessary dimensions of size 1 to simplify the tensor's shape with `torch.squeeze()`.

- **Tensor Operations:** Performing mathematical and logical operations on tensors, such as element-wise operations, reductions, and broadcasting, which are fundamental for building and training neural networks.

- **Matrix Multiplication:** A vital operation in neural networks, where two matrices are multiplied to produce a third matrix. This is commonly used in layers like fully connected layers and is performed in PyTorch using `torch.mm()` or the `@` operator.

Understanding these concepts allows you to manipulate tensors effectively, enabling the construction and optimization of deep learning models in PyTorch.
