Detect URLs in a string and wrap with “<a

后端 未结 2 1374
伪装坚强ぢ
伪装坚强ぢ 2021-01-04 20:24

I am looking to write something that seems like it should be easy enough, but for whatever reason I\'m having a tough time getting my head around it.

I am looking to

2条回答
  •  渐次进展
    2021-01-04 20:59

    The "regex magic" you need is just sub (which does a substitution):

    def encode_string_with_links(unencoded_string):
      return URL_REGEX.sub(r'\1', unencoded_string)
    

    URL_REGEX could be something like:

    URL_REGEX = re.compile(r'''((?:mailto:|ftp://|http://)[^ <>'"{}|\\^`[\]]*)''')
    

    This is a pretty loose regex for URLs: it allows mailto, http and ftp schemes, and after that pretty much just keeps going until it runs into an "unsafe" character (except percent, which you want to allow for escapes). You could make it more strict if you need to. For example, you could require that percents are followed by a valid hex escape, or only allow one pound sign (for the fragment) or enforce the order between query parameters and fragments. This should be enough to get you started, though.

提交回复
热议问题