I am looking for Apache Lucene web crawler written in java if possible or in any other language. The crawler must use lucene and create a valid lucene index and document fil
Take a look at solr search server and nutch (crawler), both are related to the lucene project.