PyDomainExtractor
PyDomainExtractor copied to clipboard
Add link to 10 million domains file
From the README
The test was conducted on a file containing 1 million random urls (Mar. 13rd 2022)
It would be good to add a hyperlink to the specific list of random URLs used in the benchmarks (could be a GitHub gist or a separate repository to avoid increasing the filesize of this repository).
Hi and thanks for asking. This was the original project I took the list from: https://www.domcop.com/openpagerank/what-is-openpagerank And this is the link to the domains file: https://www.domcop.com/files/top/top10milliondomains.csv.zip