url1=\'www.google.com\'
url2=\'http://www.google.com\'
url3=\'http://google.com\'
url4=\'www.google\'
url5=\'http://www.google.com/images\'
url6=\'https://www.youtub
A more elegant solution would be using urlparse:
from urllib.parse import urlparse
def get_hostname(url, uri_type='both'):
"""Get the host name from the url"""
parsed_uri = urlparse(url)
if uri_type == 'both':
return '{uri.scheme}://{uri.netloc}/'.format(uri=parsed_uri)
elif uri_type == 'netloc_only':
return '{uri.netloc}'.format(uri=parsed_uri)
The first option includes https
or http
, depending on the link, and the second part netloc
includes what you were looking for.