Skip to content
This repository has been archived by the owner on Jan 18, 2018. It is now read-only.

duplicate entries #94

Closed
nickhilliard opened this issue Feb 24, 2017 · 3 comments
Closed

duplicate entries #94

nickhilliard opened this issue Feb 24, 2017 · 3 comments

Comments

@nickhilliard
Copy link

% cat sorted_unique_cf.txt | tr '[A-Z]' '[a-z]' | sort | uniq -d | wc -l
    1213
%

this should return zero.

@nevinera
Copy link

Yeah, he documented that he used | uniq | sort, which I thought was a typo. A similar (simpler) comparison:

$ cat sites-using-cloudflare-master/sorted_unique_cf.txt  | sort | uniq | wc -l
 4287625
$ cat sites-using-cloudflare-master/sorted_unique_cf.txt  | wc -l
 4288852

@coderobe
Copy link
Contributor

I have opened a PR with a fixed version, see #99

@pirate
Copy link
Owner

pirate commented Feb 24, 2017

Thanks @coderobe, will merge that shortly and fix the README methodology.

@pirate pirate closed this as completed Feb 24, 2017
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants