Is there a way to use readability (text extraction algorithm) and a custom algorithm in python to extract links from text?
问题 Is there a way to use readability (text extraction algorithm) and a custom algorithm in python to extract links from text? I'd like to figure out a way of extracting links that are in the body of text. 1.) I use readability in python https://github.com/gfxmonk/python-readability 2.) I'd like to somehow compare the extracted text to the original html text in order to extract links in the actual body of an article. 回答1: Well, it looks like it returns a BeautifulSoup tree. So you should be able