My current task on hand is to figure out the best approach to load millions of documents in solr. The data file is an export from DB in csv format.
Currently, I am t
Above answers have explained really well the ingestion strategies from single machine.
Few more options if you have big data infrastructure in place and want to implement distributed data ingestion pipeline.
hive- solr connector, spark- solr connector.
PS:
StandardDirectoryFactory.autoCommit and autoSoftCommit configuration in solrconfig.xml file.SolrServerException: No live SolrServers available to handle this request