In [3]:
%matplotlib inline


What is PyTorch?
================

It’s a Python based scientific computing package targeted at two sets of
audiences:

-  A replacement for numpy to use the power of GPUs
-  a deep learning research platform that provides maximum flexibility
   and speed

Getting Started
---------------

Tensors
^^^^^^^

Tensors are similar to numpy’s ndarrays, with the addition being that
Tensors can also be used on a GPU to accelerate computing.



In [4]:
from __future__ import print_function
import torch

Construct a 5x3 matrix, uninitialized:



In [5]:
x = torch.Tensor(5, 3)
print(x)


1.00000e-32 *
  0.0000  0.0000  0.0000
  0.0000  3.4667  0.0000
  3.5191  0.0000  3.5190
  0.0000  3.5191  0.0000
  0.0000  0.0000  0.0000
[torch.FloatTensor of size 5x3]



Construct a randomly initialized matrix



In [None]:
x = torch.rand(5, 3)
print(x)

Get its size



In [None]:
print(x.size())

<div class="alert alert-info"><h4>Note</h4><p>``torch.Size`` is in fact a tuple, so it supports the same operations</p></div>

Operations
^^^^^^^^^^
There are multiple syntaxes for operations. Let's see addition as an example

Addition: syntax 1



In [None]:
y = torch.rand(5, 3)
print(x + y)

Addition: syntax 2



In [None]:
print(torch.add(x, y))

Addition: giving an output tensor



In [None]:
result = torch.Tensor(5, 3)
torch.add(x, y, out=result)
print(result)

Addition: in-place



In [None]:
# adds x to y
y.add_(x)
print(y)

<div class="alert alert-info"><h4>Note</h4><p>Any operation that mutates a tensor in-place is post-fixed with an ``_``
    For example: ``x.copy_(y)``, ``x.t_()``, will change ``x``.</p></div>

You can use standard numpy-like indexing with all bells and whistles!



In [None]:
print(x[:, 1])

**Read later:**


  100+ Tensor operations, including transposing, indexing, slicing,
  mathematical operations, linear algebra, random numbers, etc are described
  `here <http://pytorch.org/docs/torch>`_

Numpy Bridge
------------

Converting a torch Tensor to a numpy array and vice versa is a breeze.

The torch Tensor and numpy array will share their underlying memory
locations, and changing one will change the other.

Converting torch Tensor to numpy Array
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^



In [None]:
a = torch.ones(5)
print(a)

In [None]:
b = a.numpy()
print(b)

See how the numpy array changed in value.



In [None]:
a.add_(1)
print(a)
print(b)

Converting numpy Array to torch Tensor
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
See how changing the np array changed the torch Tensor automatically



In [None]:
import numpy as np
a = np.ones(5)
b = torch.from_numpy(a)
np.add(a, 1, out=a)
print(a)
print(b)

All the Tensors on the CPU except a CharTensor support converting to
NumPy and back.

CUDA Tensors
------------

Tensors can be moved onto GPU using the ``.cuda`` function.



In [None]:
# let us run this cell only if CUDA is available
if torch.cuda.is_available():
    x = x.cuda()
    y = y.cuda()
    x + y

Exercises
------------


1) Initialize random tensors A, B, C of size [2,3], [2,3], [3,3,2].

In [6]:
A = torch.rand(2, 3)
B = torch.rand(2, 3)
C = torch.rand(3, 3, 2)
print(A)
print(B)
print(C)


 0.2244  0.6546  0.0518
 0.3574  0.5349  0.4076
[torch.FloatTensor of size 2x3]


 0.3297  0.5256  0.9737
 0.3903  0.5924  0.1489
[torch.FloatTensor of size 2x3]


(0 ,.,.) = 
  0.0727  0.0214
  0.4441  0.5672
  0.2008  0.4544

(1 ,.,.) = 
  0.3912  0.1632
  0.3100  0.3505
  0.0269  0.5179

(2 ,.,.) = 
  0.2365  0.0821
  0.0904  0.2671
  0.5339  0.1436
[torch.FloatTensor of size 3x3x2]



2) Fill tensor A with all 10s

In [7]:
A.fill_(10)
print(A)


 10  10  10
 10  10  10
[torch.FloatTensor of size 2x3]



3) Fill tensor B with elements sampled from the normal distribution

In [8]:
B.normal_()
print(B)


-0.7217  0.8695 -0.5743
-0.4224 -2.0295  1.2609
[torch.FloatTensor of size 2x3]



4) Point-wise multiply A with B, and put the result into tensor B

In [9]:
B = A * B
print(B)


 -7.2172   8.6949  -5.7433
 -4.2239 -20.2948  12.6089
[torch.FloatTensor of size 2x3]



5) Print the mean and standard deviation of the elements of B

In [10]:
print(B.mean())
print(B.std())

-2.695901791254679
11.880839133784619


6) Fill tensor C with elements samples from the uniform distribution U(-1,1). Print the dimensions of C.

In [11]:
C.uniform_(-1, 1)
print(C)
print(C.size())


(0 ,.,.) = 
  0.3252  0.9528
  0.7135  0.9721
  0.0497  0.8947

(1 ,.,.) = 
 -0.4838  0.1501
 -0.4662 -0.9419
  0.3977  0.8387

(2 ,.,.) = 
  0.2892  0.7378
 -0.3176  0.1429
 -0.6857  0.0676
[torch.FloatTensor of size 3x3x2]

torch.Size([3, 3, 2])


7) Transpose the second and third dimension of tensor C, and put the result into tensor C itself (in-place). Print the dimensions of C.

In [12]:
C.transpose_(1, 2)
print(C)
print(C.size())


(0 ,.,.) = 
  0.3252  0.7135  0.0497
  0.9528  0.9721  0.8947

(1 ,.,.) = 
 -0.4838 -0.4662  0.3977
  0.1501 -0.9419  0.8387

(2 ,.,.) = 
  0.2892 -0.3176 -0.6857
  0.7378  0.1429  0.0676
[torch.FloatTensor of size 3x2x3]

torch.Size([3, 2, 3])


8) Show the contiguity property of the tensors

In [13]:
print(A.is_contiguous())
print(B.is_contiguous())
print(C.is_contiguous())

True
True
False


9) Print the second column of the third dimension of tensor C (note zero-indexed)

In [None]:
C[2][:,1]

In [None]:
C[2,:,1]

10) Perform operation A+B+C (note the broadcasting)

In [18]:
print(A.size())
print(B.size())
print(C.size())

torch.Size([2, 3])
torch.Size([2, 3])
torch.Size([3, 2, 3])


In [48]:
print(A)
print(B)
print(C)


 -4.4344  27.3897  -1.4866
  1.5522 -30.5896  35.2178
[torch.FloatTensor of size 2x3]


 -7.2172   8.6949  -5.7433
 -4.2239 -20.2948  12.6089
[torch.FloatTensor of size 2x3]


(0 ,.,.) = 
 -11.3264  36.7981  -7.1801
  -1.7189 -49.9123  48.7214

(1 ,.,.) = 
 -12.1354  35.6184  -6.8322
  -2.5216 -51.8264  48.6654

(2 ,.,.) = 
 -11.3623  35.7670  -7.9155
  -1.9339 -50.7415  47.8943
[torch.FloatTensor of size 3x2x3]



In [49]:
A+B+C[0]


 -22.9780   72.8827  -14.4100
  -4.3905 -100.7968   96.5481
[torch.FloatTensor of size 2x3]

In [50]:
A+B+C[1]


 -23.7869   71.7030  -14.0620
  -5.1933 -102.7108   96.4921
[torch.FloatTensor of size 2x3]

In [51]:
A+B+C[2]


 -23.0139   71.8516  -15.1454
  -4.6056 -101.6260   95.7210
[torch.FloatTensor of size 2x3]

11) In-place store the result into tensor C

In [52]:
C = torch.from_numpy((A.numpy()+B.numpy()+C.numpy()))

In [53]:
#C = A+B+C
print(C)


(0 ,.,.) = 
  -22.9780   72.8827  -14.4100
   -4.3905 -100.7968   96.5481

(1 ,.,.) = 
  -23.7869   71.7030  -14.0620
   -5.1933 -102.7108   96.4921

(2 ,.,.) = 
  -23.0139   71.8516  -15.1454
   -4.6056 -101.6260   95.7210
[torch.FloatTensor of size 3x2x3]

