I am just wondering if we could achieve some RDBMS capabilities in lucene.
Example: 1) I have 10,000 project documents (pdf files) which have to be indexed with thei
Sounds like a perfectly good thing to do. The only limitation you'll have (by storing a reference to the project in Lucene rather than the project data itself) is that you won't be able to query both the document text and project metadata at the same time. For example, "documentText:foo OR projectName:bar" . If you have no such requirement, then seems like storing the ID in Lucene which refers to a database row is a fine thing to do.