Unofficial and incomplete lists of various domain names, for research purposes only.
The CZ list is an append-only list because there's no easy way to filter out expired domains, see this issue. @k47 ran a scan and says roughly half of the domains are dead. Consider yourself warned.
- Initial set by Ondřej Caletka
- Few updates by Patrik Votoček
- Majestic Million
- Alexa Top 1M
- regZone.cz
- HSTS preloaded list
- scans.io dataset by @roycewilliams
- Expired domains archive from Monitoruju.net
- Nette Framework-powered list by @Myiyk
- Certificate Transparency logs
- A list by Vítězslav Lindovský
- A private collection by Vladimír Smitka
- Another private collection by Kamil Vavra
- A list by domainsproject.org
Thanks!
Registrant ID and date of expiration from the Whois for tld-cz.txt. Contains also some historical data. If a domain from tld-cz.txt is not in whois-cz.txt with expire_at > current_date it should be considered as not existing.
Czech government domains (fully-qualified, not just eTLD+1) originally from circa 2018.
Have a list of domains you'd like to add? Feel free to create a pull request!
Here's a short how-to:
- Only 2nd-level domains foo.tld, no bar.foo.tld (lines should match
^[a-z0-9-]+\.tld$
, so for examplegrep --only-matching --perl-regexp "[a-z0-9-]+\.cz" data.txt > yourlist.txt
) - Use LF newlines, not CRLF (
dos2unix yourlist.txt
) - Generate a new list, for example with
cat tld-cz.txt yourlist.txt | sort | uniq > tld-cz-new.txt
- Review
tld-cz-new.txt
, rename totld-cz.txt
, pull request it
Thank you!