Python file indexing and searching

前端 未结 5 925
谎友^
谎友^ 2020-12-16 17:03

I have a large set off files (hdf) that I need to enable search for. For Java I would use Lucene for this, as it\'s a file and document indexing engine. I don\'t know what t

5条回答
  •  庸人自扰
    2020-12-16 17:53

    I haven't done indexing before, however the following may be helpful :-

    1. pyIndex - http://rgaucher.info/beta/pyIndex/ -- File indexing library for Python
    2. http://www.xml.com/pub/a/ws/2003/05/13/email.html -- Thats a script for searching Outlook email using Python and Lucene
    3. http://gadfly.sourceforge.net/ - Aaron water's gadfly database (I think you can use this one for indexing. Haven't used it myself.)

    As far as using HDF files goes, I have heard of a module called h5py.

    I hope this helps.

提交回复
热议问题