'ascii' codec error in beautifulsoup

前端 未结 2 1248
南旧
南旧 2020-12-21 12:27

I am using beautifulsoup for scraping data from the html page. Till yesterday every thing was fine. But Now i am getting the error:

\'ascii\' codec can\'t en         


        
相关标签:
2条回答
  • 2020-12-21 13:04

    A wild stab in the dark: you're reading a page that doesn't explicitly declare an encoding and yet is not 7-bit ASCII?

    0 讨论(0)
  • 2020-12-21 13:12

    A wild guess:

    Try specifying the encoding of the page?

    soup = BeautifulSoup(page, fromEncoding=<encoding of the page>)
    

    This can also be a problem with the Python installation. If you print non-ASCII characters without BeautifulSoup, do you face the same problem? If yes, then you need to set the encoding:

    import sys
    sys.setdefaultencoding("utf-8") # or whatever you want the default encoding to be.
    
    0 讨论(0)
提交回复
热议问题