Parsing a nested XML string from a Hive table using PIG
问题 I'm trying to use PIG to extract some XML from a field in a Hive table, rather than from an XML file (which is the assumption of most of the examples I have read). The XML comes from a table arranged as follows: ID, {XML_string} The XML string contains n. number of rows, always containing at least one from up to 10 attributes. We can assume that attribute #1 will always be present and will be unique. <row> <att1></att1> <att2></att2> ... </row> <row> <att1></att1> <att2></att2> ... </row> ...