Unofficial and incomplete lists of various domain names, for research purposes only.
The CZ list is an append-only list because there's no easy way to filter out expired domains, see this issue. @k47 ran a scan and says roughly half of the domains are dead. Consider yourself warned.
- Initial set by Ondřej Caletka
- Few updates by Patrik Votoček
- Majestic Million
- Alexa Top 1M
- HSTS preloaded list
- scans.io dataset by @roycewilliams
- Expired domains archive from Monitoruju.net
- Nette Framework-powered list by @Myiyk
- Certificate Transparency logs
- A list by Vítězslav Lindovský
- A private collection by Vladimír Smitka
- Another private collection by Kamil Vavra
- A list by domainsproject.org
Registrant ID and date of expiration from the Whois for tld-cz.txt. Contains also some historical data. If a domain from tld-cz.txt is not in whois-cz.txt with expire_at > current_date it should be considered as not existing.
Czech government domains (fully-qualified, not just eTLD+1) originally from circa 2018.
Want to contribute?
Have a list of domains you'd like to add? Feel free to create a pull request!
Here's a short how-to:
- Only 2nd-level domains foo.tld, no bar.foo.tld (lines should match
^[a-z0-9-]+\.tld$, so for example
grep --only-matching --perl-regexp "[a-z0-9-]+\.cz" data.txt > yourlist.txt)
- Use LF newlines, not CRLF (
- Generate a new list, for example with
cat tld-cz.txt yourlist.txt | sort | uniq > tld-cz-new.txt
tld-cz-new.txt, rename to
tld-cz.txt, pull request it