How should ground truth data be classified? #25

Nate-Wessel · 2018-05-08T03:25:52Z

The purpose of the ground truth data is to test the performance of the algorithm on a known dataset. It seems to me that there are two broad potential approaches to this:

We can classify the data according to what actually happened on the ground, just translated into the required language of discrete trips and activities.
Or we can classify according to what we see in the GPS points, informed by what actually happened on the ground.

The ground truth data we currently have (my own) is a sloppy mix of these.

To give an example, should we include activity locations that we actually visited but that don't look like activities in coordinates.csv, perhaps because of missing or inaccurate data?

The benefit of producing a properly true ground truth is that we can measure how far our algorithm (considered as encompassing the app, the phone, etc.) is from actual reality as interpreted by the one who lived it, or at least from a more traditional activity survey.

The benefit of ground truth as manual classification of input data is that it tells us how far we are from the best possible results we can get from the data we have available.

My Reality > Phone's Reality > Our interpretation of Phone's Reality

The text was updated successfully, but these errors were encountered:

Nate-Wessel · 2018-05-25T13:42:37Z

Discussed with Michael and Felipe. Consensus seemed to be that it's our job to interpret itinerum, itinerum's job to represent reality.

But I think we have to remember then that we have no idea (quantitatively) how well it does that.

Nate-Wessel added the discussion the means of resolving this have yet to be decided label May 8, 2018

Nate-Wessel closed this as completed May 25, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How should ground truth data be classified? #25

How should ground truth data be classified? #25

Nate-Wessel commented May 8, 2018 •

edited

Loading

Nate-Wessel commented May 25, 2018

How should ground truth data be classified? #25

How should ground truth data be classified? #25

Comments

Nate-Wessel commented May 8, 2018 • edited Loading

Nate-Wessel commented May 25, 2018

Nate-Wessel commented May 8, 2018 •

edited

Loading