# Exploring TensorFlow Datasets We Will Use with Torch
This notebook illustrates how to leverage the datasets built by TensorFlow using the code we have in the GitHub repository.

## Set-up
The utility and helper code we have created lives here:

https://github.com/hlab-repo/data_science_symposium.git

In [None]:
!git clone https://github.com/hlab-repo/data_science_symposium.git

In [2]:
from pprint import pprint

import numpy as np
import torch
from torch.utils.data import DataLoader

from data_science_symposium.datasets import BaseTextIterDataset

## Exploring the base dataset
We can load any text dataset from TensorFlow's catalog with the base `BaseTextIterDataset`. For this example, let's download the SNLI dataset and see what it looks like. You can find the relevant information required in the API call below from the datasets url (peruse the other datasets available in the left hand tab as well!):

https://www.tensorflow.org/datasets/catalog/snli

We need to indicate a few items in order for our `BaseTextIterDataset` instance to know which dataset to download and use. This includes the `tf_dataset` name, the epoch size (given that this is a large dataset), and which `split_` to use (test or train).

In [None]:
dataset = BaseTextIterDataset(
    tf_dataset='snli', # 570k Eng. sent. pairs with labels: entailment, contradiction, and neutral
    epoch_size=16,     # using a very small epoch size for illustration; 10k+ more typical
    split_='test',     # load the smaller dataset for illustration
    )

In [4]:
dataloader = DataLoader(dataset, batch_size=2) # each pass through dataloader should have 2 batches

In [5]:
for batch in dataloader:
    pprint(batch)

{'hypothesis': [('A girl is entertaining on stage',),
                ('A group of people posing in front of a body of water.',),
                ("The group of people aren't inide of the building.",),
                ('The people are taking a carriage ride.',),
                ('Two girls are getting ready to ride horses.',),
                ('Two youths were pushing each other.',),
                ('A child takes a nap on a couch.',),
                ('A person is consuming food.',),
                ('A boy in light green pajamas plays.',),
                ('The boy is wearing a backpack.',),
                ('The man enjoys dogs.',),
                ('A man is pointing into the ear and wearing a striped '
                 'shirt.',),
                ('a woman and a baby watching',),
                ('There are women outside in this picture',),
                ('The man is walking his dog.',),
                ('The man in the leather jacket is walking to his '
                 'motor