Why I read the fer2013.csv only 13000 more data? #6

renhui19931001 · 2017-03-16T08:14:45Z

I use the script of cvs_to_numpy.py,and download the dataset in https://www.kaggle.com/c/challenges-in-representation-learning-facial-expression-recognition-challenge/data and then ,I try to get the .npy from the script but I only get 13000 more data from it

mysayalHan · 2017-03-31T03:42:14Z

I thought he used opencv to downsample the data. Unless the face in the picture could be found by xml in opencv, it could not be training data.

dearhoper · 2017-04-01T08:11:08Z

The total count of the images in the dataset is 35887. 28709 for Training, 3589 for PublicTest and another 3589 for PrivateTest.
cvs_to_numpy.py filters the images according to whether or not found a face in the image.
So, only 13000+ images through the selection.

sakshamjindal07 · 2017-05-24T07:36:40Z

Hi @dearhoper ,

I was successful in splitting the dataset into train and test and hence, I was able to generate 4 files as listed:

self._images         = np.load('data_set_fer2013.npy')
self._labels           = np.load('data_labels_fer2013.npy')
self._images_test  = np.load('test_set_fer2013.npy')
self._labels_test    = np.load('test_labels_fer2013.npy')

No. of images in data_set = 10809
No. of images in test_set = 3157

The problem which I am facing is that when the training of the model starts, the tflearn generates a log :

[+] Training network

Run id: emotion_recognition
Log directory: /tmp/tflearn_logs/

Training samples: 10809
Validation samples: 10809

I just want to know if you faced a similar issue where you the training sample and validation sample had the same no. of images. I tried to dig into the TFlearn library but could not find any work around.

Can you help me point out the issue with this ?

Regards
Saksham

dearhoper · 2017-05-25T02:11:02Z

Hi @sakshamjindal07 ,

There is a bug in the dataset_loader.py file, See the bold words as follows:
...
def load_from_save(self):
self._images = np.load(join(SAVE_DIRECTORY, SAVE_DATASET_IMAGES_FILENAME))
self._labels = np.load(join(SAVE_DIRECTORY, SAVE_DATASET_LABELS_FILENAME))
self._images_test = np.load(join(SAVE_DIRECTORY, SAVE_DATASET_IMAGES_TEST_FILENAME))
self._labels_test = np.load(join(SAVE_DIRECTORY, SAVE_DATASET_LABELS_TEST_FILENAME))
self._images = self._images.reshape([-1, SIZE_FACE, SIZE_FACE, 1])
self._images_test = self._images_test.reshape([-1, SIZE_FACE, SIZE_FACE, 1])
self._labels = self._labels.reshape([-1, len(EMOTIONS)])
self._labels_test = self._labels_test.reshape([-1, len(EMOTIONS)])

Rloguzzo · 2017-05-30T14:44:28Z

@sakshamjindal07 How did you get the test data?

NikAndrush · 2017-10-24T14:15:19Z

@dearhoper how i can get the test data?

dearhoper · 2017-10-25T07:10:45Z

@NikAndrush
Download the FER2013 dataset and exact its content: https://www.kaggle.com/c/challenges-in-representation-learning-facial-expression-recognition-challenge/data.
These are 3 fields in the fer2013.csv. The "Usage" field means the use of the current image. "Training" is expressed as training data, and "PublicTest" is expressed as test data. (Data marked as "PrivateTest" has not been adopted here.)
You can use cvs_to_numpy.csv to parse the training data and test data.

asthasharma017 · 2017-12-19T00:55:44Z

Is the data set removed from [https://www.kaggle.com/c/challenges-in-representation-learning-facial-expression-recognition-challenge/data.] ?
I am getting 404 error for this.

jingyugao · 2017-12-20T13:22:58Z

@asthasharma017 you should first sign up an account, and then you will get the file.

xhjcxxl mentioned this issue Apr 4, 2018

How to solve the problem about data_set_fer2013.npy etc. #39

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why I read the fer2013.csv only 13000 more data? #6

Why I read the fer2013.csv only 13000 more data? #6

renhui19931001 commented Mar 16, 2017

mysayalHan commented Mar 31, 2017

dearhoper commented Apr 1, 2017

sakshamjindal07 commented May 24, 2017

dearhoper commented May 25, 2017

Rloguzzo commented May 30, 2017

NikAndrush commented Oct 24, 2017

dearhoper commented Oct 25, 2017

asthasharma017 commented Dec 19, 2017

jingyugao commented Dec 20, 2017

Why I read the fer2013.csv only 13000 more data? #6

Why I read the fer2013.csv only 13000 more data? #6

Comments

renhui19931001 commented Mar 16, 2017

mysayalHan commented Mar 31, 2017

dearhoper commented Apr 1, 2017

sakshamjindal07 commented May 24, 2017

[+] Training network

Run id: emotion_recognition Log directory: /tmp/tflearn_logs/

Training samples: 10809 Validation samples: 10809

dearhoper commented May 25, 2017

Rloguzzo commented May 30, 2017

NikAndrush commented Oct 24, 2017

dearhoper commented Oct 25, 2017

asthasharma017 commented Dec 19, 2017

jingyugao commented Dec 20, 2017

Run id: emotion_recognition
Log directory: /tmp/tflearn_logs/

Training samples: 10809
Validation samples: 10809