Beautiful Soup if Class “Contains” or Regex?

后端 未结 3 760
离开以前
离开以前 2020-12-05 10:46

If my class names are constantly different say for example:

listing-col-line-3-11 dpt 41
listing-col-block-1-22 dpt 41
listing-col-line-4-13 CWK 12


        
相关标签:
3条回答
  • 2020-12-05 11:03

    Yu can try this:

    regex = re.compile('.*listing-col-.*')
    for EachPart in soup.find_all("div", {"class" : regex}):
            print EachPart.get_text()
    
    0 讨论(0)
  • 2020-12-05 11:05

    BeautifulSoup supports CSS selectors which allow you to select elements based on the content of particular attributes. This includes the selector *= for contains.

    The following will return all div elements with a class attribute containing the text 'listing-col-':

    for EachPart in soup.select('div[class*="listing-col-"]'):
        print EachPart.get_text()
    
    0 讨论(0)
  • 2020-12-05 11:13

    You could avoid regex by using partial matching with gazpacho...

    Input:

    html = """\
    <div class="listing-col-line-3-11 dpt 41">A</div>
    <div class="listing-col-block-1-22 dpt 41">B</div>
    <div class="listing-col-line-4-13 CWK 12">C</div>
    """
    

    Partial matching code:

    from gazpacho import Soup
    
    soup = Soup(html)
    divs = soup.find("div", {"class": "listing-col-"}, partial=True)
    [div.text for div in divs]
    

    Output:

    ['A', 'B', 'C']
    
    0 讨论(0)
提交回复
热议问题