tpu_cores=8 not working

## 🐛 Bug

After #2016 was fixed with PR #2033 the code is running perfectly on single tpu core and a specific tpu core but now not working with 8 tpu cores. After the training is complete getting `RuntimeError: Cannot replicate if number of devices (1) is different from 8`.

### To Reproduce
[Colab notebook](https://colab.research.google.com/drive/1jPen-P-e4njk_vHUQPsNWrcJ87P5UFxr)

### Expected behavior
Should train with 8 tpu cores with no error just like it works in case of a single core.

### Environment

 - pytorch/xla: nightly
 - pytorch-lightning: master
 - PyTorch Version (e.g., 1.0): 1.5
 - OS (e.g., Linux): Linux
 - How you installed PyTorch (`conda`, `pip`, source): pip
 - Python version: 3.7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

tpu_cores=8 not working #2106

🐛 Bug

To Reproduce

Expected behavior

Environment

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

tpu_cores=8 not working #2106

Description

🐛 Bug

To Reproduce

Expected behavior

Environment

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions