i have installed nutch and solr for crawling a website and search in it; as you know we can index meta tags of webpages into solr with parse meta tags plugin of nutch.(http:
You can use one of these custom plugins to parse xml files based on xpath (or css selectors):