Filter out HTML tags and resolve entities in python

前端 未结 8 1782
暗喜
暗喜 2020-12-03 00:11

Because regular expressions scare me, I\'m trying to find a way to remove all HTML tags and resolve HTML entities from a string in Python.

8条回答
  •  情深已故
    2020-12-03 00:38

    How about parsing the HTML data and extracting the data with the help of the parser ?

    I'd try something like the author described in chapter 8.3 in the Dive Into Python book

提交回复
热议问题