ISSUE-600: data-shell/data/animals.txt is truncated #722

agrimstrup · 2018-03-03T01:13:05Z

Added additional records to bring the animals.txt file up to 586 lines as mentioned in the text. Added ellipses to the file listing to indicate more data exists in the file than listed.

Before the change, checked the output of the example
[arne@localhost data]$ cat animals.txt | head -n 5 | tail -n 3 | sort -r
2012-11-06,rabbit
2012-11-06,deer
2012-11-05,raccoon

Added additional records and checked the distribution of values
[arne@localhost data]$ wc -l animals.txt
586 animals.txt

[arne@localhost data]$ cat animals.txt | sort | uniq -c
43 2012-11-05,bear
35 2012-11-05,deer
44 2012-11-05,fox
44 2012-11-05,rabbit
45 2012-11-05,raccoon
42 2012-11-06,bear
37 2012-11-06,deer
44 2012-11-06,fox
39 2012-11-06,rabbit
35 2012-11-06,raccoon
36 2012-11-07,bear
32 2012-11-07,deer
40 2012-11-07,fox
36 2012-11-07,rabbit
34 2012-11-07,raccoon

Tested the example pipeline to ensure no changes resulted
[arne@localhost data]$ cat animals.txt | head -n 5 | tail -n 3 | sort -r
2012-11-06,rabbit
2012-11-06,deer
2012-11-05,raccoon

…to 586 lines as mentioned in the text. Added ellipses to the file listing to indicate more data exists in the file than listed.

gcapes · 2018-03-05T08:48:36Z

If this file is to be updated, the zip file for learners to download also needs updating.

colinmorris · 2018-03-05T22:20:15Z

This looks good to me, though I'd like to get a +1 from another maintainer before merging this in case there's something I'm missing. @shwina , any thoughts?

agrimstrup · 2018-03-05T22:34:22Z

@gcapes I didn't see the zip file in the first pass. An updated version of the zip file has been added to the PR.

gcapes · 2018-03-06T08:53:39Z

Sorry to be late with my thoughts, but is there actually a need to have a larger data set? If the file works for the examples, then it seems fit for purpose.
The suggestion on #600 and #720 to indicate this is a subset of a larger data set seems like it might be a good light-touch fix?

gdevenyi · 2018-03-07T17:01:11Z

I agree that there's no need for a "full" dataset. The truncation can serve fine as an example of a subset.

colinmorris · 2018-03-19T19:17:05Z

I'm personally fine with either fix (increasing the file size to match the existing lesson text, or changing the lesson text to match the existing file size). However the former has the advantage that we have a pull request that implements it (right here), and the latter is hypothetical.

Is there a downside to merging this? It seems like a strict improvement over the current state of the lesson.

shwina · 2018-04-04T10:37:07Z

+1 from me

gcapes · 2019-01-16T11:26:22Z

Thanks for the PR. This inconsistency looks to have been fixed now.

Thanks again for your contribution.

ISSUE-600: Added additional records to bring the animals.txt file up …

7218caf

…to 586 lines as mentioned in the text. Added ellipses to the file listing to indicate more data exists in the file than listed.

colinmorris requested a review from shwina March 5, 2018 22:20

ISSUE-600 Updated student data packet as recommended in comments

98626db

gcapes closed this Jan 16, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ISSUE-600: data-shell/data/animals.txt is truncated #722

ISSUE-600: data-shell/data/animals.txt is truncated #722

agrimstrup commented Mar 3, 2018

gcapes commented Mar 5, 2018

colinmorris commented Mar 5, 2018

agrimstrup commented Mar 5, 2018

gcapes commented Mar 6, 2018

gdevenyi commented Mar 7, 2018

colinmorris commented Mar 19, 2018

shwina commented Apr 4, 2018

gcapes commented Jan 16, 2019

ISSUE-600: data-shell/data/animals.txt is truncated #722

ISSUE-600: data-shell/data/animals.txt is truncated #722

Conversation

agrimstrup commented Mar 3, 2018

gcapes commented Mar 5, 2018

colinmorris commented Mar 5, 2018

agrimstrup commented Mar 5, 2018

gcapes commented Mar 6, 2018

gdevenyi commented Mar 7, 2018

colinmorris commented Mar 19, 2018

shwina commented Apr 4, 2018

gcapes commented Jan 16, 2019