I would like to get some frequently occurring phrases with Lucene. I am getting some information from TXT files, and I am losing a lot of context for not having information
Julia, It seems what you are looking for is n-grams, specifically Bigrams (also called collocations).
Here's a chapter about finding collocations (PDF) from Manning and Schutze's Foundations of Statistical Natural Language Processing.
In order to do this with Lucene, I suggest using Solr with ShingleFilterFactory. Please see this discussion for details.