Extraction of TLD from urls and sorting domains and subdomains for each TLD file
I have a list of million urls. I need to extract the TLD for each url and create multiple files for each TLD. For example collect all urls with .com as tld and dump that in 1 file, another file for .edu tld and so on. Further within each file, I have to sort it alphabetically by domains and then by subdomains etc. Can anyone give me a head start for implementing this in perl? Use URI to parse the URL, Use its host method to get the host, Use Domain::PublicSuffix 's get_root_domain to parse the host name. Use the tld or suffix method to get the real TLD or the pseudo TLD. use feature qw( say );