Decoding HTML entities with Python

前端 未结 4 1020
野趣味
野趣味 2020-12-04 16:46

I\'m trying to decode HTML entries from here NYTimes.com and I cannot figure out what I am doing wrong.

Take for example:

\"U.S. Adviser’         


        
4条回答
  •  借酒劲吻你
    2020-12-04 17:49

    >>> from HTMLParser import HTMLParser
    >>> print HTMLParser().unescape('U.S. Adviser’s Blunt Memo on Iraq: '
    ...                             'Time ‘to Go Home’')
    U.S. Adviser’s Blunt Memo on Iraq: Time ‘to Go Home’
    

    The function is undocumented in Python 2. It is fixed in Python 3.4+: it is exposed as html.unescape() there.

提交回复
热议问题