I am trying to convert an html block to text using Python.
Input:
I was in need of a way of doing this on a client's system without having to download additional libraries. I never found a good solution, so I created my own. Feel free to use this if you like.
import urllib
def html2text(strText):
str1 = strText
int2 = str1.lower().find("0:
str1 = str1[int2:]
int2 = str1.lower().find("")
if int2>0:
str1 = str1[:int2]
list1 = ['
', '', 'span>', 'li>', '' ]
list2 = [chr(13), chr(13), chr(9), chr(13), chr(13), chr(13), chr(13), chr(13)]
bolFlag1 = True
bolFlag2 = True
strReturn = ""
for int1 in range(len(str1)):
str2 = str1[int1]
for int2 in range(len(list1)):
if str1[int1:int1+len(list1[int2])].lower() == list1[int2]:
strReturn = strReturn + list2[int2]
if str1[int1:int1+7].lower() == '
提交回复
- 热议问题