In [3]:
import torch
torch.__version__

'2.8.0+cu126'

In [4]:
# Creating a scalar
scalar = torch.tensor(7)
scalar

tensor(7)

In [5]:
print(scalar.item())

7


In [7]:
# Create a 1D vector
vector = torch.tensor([1.0,2,3,4])
vector

tensor([1., 2., 3., 4.])

In [8]:
vector.dtype

torch.float32

In [9]:
# Create a 2D tensor/matrix
matrix = torch.tensor([[1,2,3], [4,5,6]])
matrix

tensor([[1, 2, 3],
        [4, 5, 6]])

In [10]:
print(f'dim={matrix.ndim}, shape={matrix.shape}')

dim=2, shape=torch.Size([2, 3])


In [11]:
# print matrix
print(matrix.numpy())

[[1 2 3]
 [4 5 6]]


In [12]:
print(vector.dtype)

torch.float32


In [13]:
# Creating random tensors
randImage = torch.rand(size=(3,4,2))
randImage

tensor([[[0.5384, 0.8940],
         [0.8295, 0.4343],
         [0.8502, 0.5893],
         [0.7136, 0.9822]],

        [[0.2743, 0.1570],
         [0.9726, 0.2717],
         [0.5878, 0.5069],
         [0.4627, 0.3829]],

        [[0.7740, 0.8697],
         [0.9614, 0.5312],
         [0.3530, 0.1222],
         [0.4242, 0.7020]]])

In [14]:
print(f'Shape: {randImage.shape}, ndim: {randImage.ndim}, height: {randImage.shape[0]}, width: {randImage.shape[1]}, depth: {randImage.shape[2]}')

Shape: torch.Size([3, 4, 2]), ndim: 3, height: 3, width: 4, depth: 2


In [19]:
# Creating tensor of 1's
ones = torch.ones(size=(2,4))
ones

tensor([[1., 1., 1., 1.],
        [1., 1., 1., 1.]])

In [16]:
# Creating tensors of 0's
zeros = torch.zeros(size=(2,3))
zeros

tensor([[0., 0., 0.],
        [0., 0., 0.]])

In [18]:
# Creating ones and Zeros to match an existing tensors shape
ones_like = torch.ones_like(zeros)
ones_like

tensor([[1., 1., 1.],
        [1., 1., 1.]])

In [20]:
zeros_like = torch.zeros_like(ones)
zeros_like

tensor([[0., 0., 0., 0.],
        [0., 0., 0., 0.]])

In [21]:
# arange() and reshape()
aRange = torch.arange(start=0.1, end=5.0, step=0.2)
aRange

tensor([0.1000, 0.3000, 0.5000, 0.7000, 0.9000, 1.1000, 1.3000, 1.5000, 1.7000,
        1.9000, 2.1000, 2.3000, 2.5000, 2.7000, 2.9000, 3.1000, 3.3000, 3.5000,
        3.7000, 3.9000, 4.1000, 4.3000, 4.5000, 4.7000, 4.9000])

In [22]:
matrix = aRange.reshape(shape=(5,5))
matrix

tensor([[0.1000, 0.3000, 0.5000, 0.7000, 0.9000],
        [1.1000, 1.3000, 1.5000, 1.7000, 1.9000],
        [2.1000, 2.3000, 2.5000, 2.7000, 2.9000],
        [3.1000, 3.3000, 3.5000, 3.7000, 3.9000],
        [4.1000, 4.3000, 4.5000, 4.7000, 4.9000]])

In [23]:
import numpy as np

In [25]:
# Creating Tensors from Numpy arrays
npArray = np.arange(1, 21, dtype=np.float32)
npArray

array([ 1.,  2.,  3.,  4.,  5.,  6.,  7.,  8.,  9., 10., 11., 12., 13.,
       14., 15., 16., 17., 18., 19., 20.], dtype=float32)

In [26]:
# Reahape to (5,4)
npArray = npArray.reshape((5, -1))
print(npArray.shape)
npArray

(5, 4)


array([[ 1.,  2.,  3.,  4.],
       [ 5.,  6.,  7.,  8.],
       [ 9., 10., 11., 12.],
       [13., 14., 15., 16.],
       [17., 18., 19., 20.]], dtype=float32)

In [28]:
torchArray = torch.from_numpy(npArray)
torchArray

tensor([[ 1.,  2.,  3.,  4.],
        [ 5.,  6.,  7.,  8.],
        [ 9., 10., 11., 12.],
        [13., 14., 15., 16.],
        [17., 18., 19., 20.]])

In [29]:
# Casting to integer
torchArray = torchArray.type(torch.int16)
torchArray

tensor([[ 1,  2,  3,  4],
        [ 5,  6,  7,  8],
        [ 9, 10, 11, 12],
        [13, 14, 15, 16],
        [17, 18, 19, 20]], dtype=torch.int16)

In [30]:
print(torchArray.numpy())

[[ 1  2  3  4]
 [ 5  6  7  8]
 [ 9 10 11 12]
 [13 14 15 16]
 [17 18 19 20]]


## Tensor datatypes

There are many different [tensor datatypes available in PyTorch](https://pytorch.org/docs/stable/tensors.html#data-types). Some are specific for CPU and some are better for GPU. Getting to know which is which can take some time.

Generally if you see `torch.cuda` anywhere, the tensor is being used for GPU (since Nvidia GPUs use a computing toolkit called CUDA).

The most common type (and generally the default) is `torch.float32` or `torch.float`. This is referred to as "32-bit floating point". But there are also 16-bit floating point (`torch.float16` or `torch.half`) and 64-bit floating point (`torch.float64` or `torch.double`). And to confuse things even more there's also 8-bit, 16-bit, 32-bit and 64-bit integers.


In [31]:
# Default datatype for tensors is float32
float_32_tensor = torch.tensor([3.,6.,9.],
                               dtype = None, # Defaults to None which is float32
                               device=None, # defaults to None, which uses the default tensor type
                               requires_grad=False # If True, operations performed on the Tensor are recorded
                               )
float_32_tensor.shape, float_32_tensor.dtype, float_32_tensor.device

(torch.Size([3]), torch.float32, device(type='cpu'))

In [32]:
float_16_tensor = torch.tensor([3.,6.,9.],
                               dtype = torch.float16)
float_16_tensor.dtype

torch.float16

## Getting information from tensors

Once you've created tensors (or someone else or a PyTorch module has created them for you), you might want to get some information from them.

We've seen these before but three of the most common attributes you'll want to find out about tensors are:
* `shape` - what shape is the tensor? (some operations require specific shape rules)
* `dtype` - what datatype are the elements within the tensor stored in?
* `device` - what device is the tensor stored on? (usually GPU or CPU)

Let's create a random tensor and find out details about it.

## Manipulating tensors (tensor operations)

In deep learning, data (images, text, video, audio, protein structures, etc) gets represented as tensors. A model learns by investigating those tensors and performing a series of operations (could be 1,000,000s+) on tensors to create a representation of the patterns in the input data.

These operations are often a wonderful dance between:
* Addition
* Substraction
* Multiplication (element-wise)
* Division
* Matrix multiplication

And that's it. Sure there are a few more here and there but these are the basic building blocks of neural networks.

In [33]:
# Create a tensor of values and add a number to it
tensor = torch.tensor([1,2,3])
tensor1 = tensor + 10
tensor1

tensor([11, 12, 13])

In [34]:
# Multiply by 10
tensor2 = tensor * 10
tensor2

tensor([10, 20, 30])

In [35]:
tensor - 10

tensor([-9, -8, -7])

PyTorch also has a bunch of built-in functions like [`torch.mul()`](https://pytorch.org/docs/stable/generated/torch.mul.html#torch.mul) (short for multiplication) and [`torch.add()`](https://pytorch.org/docs/stable/generated/torch.add.html) to perform basic operations.

In [37]:
torch.multiply(tensor,10)

tensor([10, 20, 30])

In [38]:
# Element wise multiplication
print(tensor,'*',tensor)
print('Equals:', tensor * tensor)

tensor([1, 2, 3]) * tensor([1, 2, 3])
Equals: tensor([1, 4, 9])


### Matrix multiplication (is all you need)

One of the most common operations in machine learning and deep learning algorithms (like neural networks) is [matrix multiplication](https://www.mathsisfun.com/algebra/matrix-multiplying.html).

PyTorch implements matrix multiplication functionality in the [`torch.matmul()`](https://pytorch.org/docs/stable/generated/torch.matmul.html) method.



In [39]:
A = torch.arange(1,7, dtype=torch.float32).reshape((3,-1))

In [40]:
B = torch.rand((2,4))
print(f'B={B}\n shape = {B.shape}')

B=tensor([[0.9131, 0.6379, 0.0664, 0.5794],
        [0.1248, 0.1969, 0.1725, 0.6159]])
 shape = torch.Size([2, 4])


In [41]:
torch.matmul(A,B)

tensor([[1.1627, 1.0317, 0.4114, 1.8113],
        [3.2385, 2.7013, 0.8893, 4.2020],
        [5.3142, 4.3709, 1.3671, 6.5927]])

In [42]:
A.matmul(B)

tensor([[1.1627, 1.0317, 0.4114, 1.8113],
        [3.2385, 2.7013, 0.8893, 4.2020],
        [5.3142, 4.3709, 1.3671, 6.5927]])

In [43]:
A@B

tensor([[1.1627, 1.0317, 0.4114, 1.8113],
        [3.2385, 2.7013, 0.8893, 4.2020],
        [5.3142, 4.3709, 1.3671, 6.5927]])

### Linear Layer

Neural networks are full of matrix multiplications and dot products.

The [`torch.nn.Linear()`](https://pytorch.org/docs/1.9.1/generated/torch.nn.Linear.html) module (we'll see this in action later on), also known as a feed-forward layer or fully connected layer, implements a matrix multiplication between an input `x` and a weights matrix `W`.

$$
y = x\cdot W + b
$$
