I am trying to get input values from an HTML doc and want to parse out the values of hidden input fields. For example how can I parse out only the value from the snippet bel
Or with lxml:
import lxml.html
htmlstr = '''
'''
// Parse the string and turn it into a tree of elements
htmltree = lxml.html.fromstring(htmlstr)
// Iterate over each input element in the tree and print the relevant attributes
for input_el in htmltree.xpath('//input'):
name = input_el.attrib['name']
value = input_el.attrib['value']
print "%s : %s" % (name, value)
Gives:
post_form_id : d619a1eb3becdc05a3ebea530396782f fb_dtsg : AQCYsohu