How to find the comment tag <!--…--> with BeautifulSoup?

前端 未结 2 560
醉话见心
醉话见心 2020-12-01 12:54

I tried soup.find(\'!--\') but it doesn\'t seem to work. Thanks in advance.

Edit: Thanks for the tip on how to find all comments. I have a follow up question. How d

2条回答
  •  一向
    一向 (楼主)
    2020-12-01 13:16

    You can find all the comments in a document with via the findAll method. See this example showing how to do exactly what you're trying to do Removing elements:

    In brief, you want this:

    comments = soup.findAll(text=lambda text:isinstance(text, Comment))
    

    Edit: If you're trying to search within the columns, you can try:

    import re
    comments = soup.findAll(text=lambda text:isinstance(text, Comment))
    for comment in comments:
      e = re.match(r'([^<]*)', comment.string).group(1)
      print e
    

提交回复
热议问题