Revisiting a stalled project and looking for advice in modernizing thousands of \"old\" documents and making them available via web.
Documents exist in various forma
Use Sunspot or RSolr or similar, it handles most major document formats. They use Solr/Lucene.