I extracted text from some long xml files in form of a string list, so every file is represented by a string element in the list.
The problem is that i need to clean