Extract sub-string between two certain words using regex in java

后端 未结 3 1906
囚心锁ツ
囚心锁ツ 2020-12-30 09:59

I would like to extract sub-string between certain two words using java.

For example:

This is an important example about regex for my work.
         


        
3条回答
  •  滥情空心
    2020-12-30 10:26

    For your first question, make it lazy. You can put a question mark after the quantifier and then the quantifier will match as less as possible.

    (?<=an).*?(?=for)
    

    I have no idea what the additional . at the end is good for in .*. its unnecessary.

    For your second question you have to define what a "word" is. I would say here probably just a sequence of non whitespace followed by a whitespace. Something like this

    \S+\s
    

    and repeat this 3 times like this

    (?<=an)\s(\S+\s){3}(?=for)
    

    To ensure that the pattern mathces on whole words use word boundaries

    (?<=\ban\b)\s(\S+\s){1,5}(?=\bfor\b)
    

    See it online here on Regexr

    {3} will match exactly 3 for a minimum of 1 and a max of 3 do this {1,3}

    Alternative:

    As dma_k correctly stated in your case here its not necessary to use look behind and look ahead. See here the Matcher documentation about groups

    You can use capturing groups instead. Just put the part you want to extract in brackets and it will be put into a capturing group.

    \ban\b(.*?)\bfor\b
    

    See it online here on Regexr

    You can than access this group like this

    System.out.println("I found the text: " + matcher.group(1).toString());
                                                            ^
    

    You have only one pair of brackets, so its simple, just put a 1 into matcher.group(1) to access the first capturing group.

提交回复
热议问题