Skip to content

joshwcomeau/script-search

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Site Fetch

TODO

Getting Data

The current data source is a txt file of the top 1,000,000 sites, fetched from https://statvoo.com/dl/top-1million-sites.csv.zip

This needs to be trimmed into a more reasonable number. Use the following command line script to trim it to a more reasonable number, as well as remove line prefixes:

head -50000 data.txt | cut -d, -f2 | cut -d/ -f1 > topsites.txt

About

Find code used on the world's top sites

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages