I would like to use python2.7 to remove anything that isn\'t the documents\' text from EDGAR filings (which are available online as .txt files). An example of what the file
The pysec project looks promising. It's a basic Django app that downloads the Edgar index and then allows you to download specific filings and extract financial parameters from the XBRL.