How to tell lxml.etree.tostring(element) not to write namespaces in python?

后端 未结 3 373
你的背包
你的背包 2020-12-10 15:16

I have a huge xml file (1 Gig). I want to move some of the elements (entrys) to another file with the same header and specifications.

Let\'s say the original file co

3条回答
  •  半阙折子戏
    2020-12-10 16:02

    I often grab a namespace to make an alias for it like this:

    someXML = lxml.etree.XML(someString)
    if ns is None:
          ns = {"m": someXML.tag.split("}")[0][1:]}
    someid = someXML.xpath('.//m:ImportantThing//m:ID', namespaces=ns)
    

    You could do something similar to grab the namespace in order to make a regex that will clean it up after using tostring.

    Or you could clean up the input string. Find the first space, check if it is followed by xmlns, if yes, delete the whole xmlns bit up to the next space, if no delete the space. Repeat until there are no more spaces or xmlns declarations. But don't go past the first >.

提交回复
热议问题