发表新帖

发表新帖

How to remove all html tags from downloaded page

后端未结

关注

 7  2012

鱼传尺愫 2020-12-31 17:32

I have downloaded a page using urlopen. How do I remove all html tags from it? Is there any regexp to replace all <*> tags?

7条回答

无人及你 (楼主)

2020-12-31 18:29

You could use html2text which is supposed to make a readable text equivalent from an HTML source (programatically with Python or as a command-line tool). Thus I may extrapolate your needs from your question...

0 讨论(0)

查看其它7个回答
发布评论:

提交评论
- 加载中...

热议问题