RegEx to extract text between a HTML tag

前端 未结 3 1363
我寻月下人不归
我寻月下人不归 2020-12-21 01:44

I\'m looking a regular expression which must extract text between HTML tag of different types.

For ex:

Span 1 - O/p:

3条回答
  •  野趣味
    野趣味 (楼主)
    2020-12-21 02:17

    Your comment shows that you have neglected to escape the backslashes in your regex string.

    And if you want to match lowercase letters add a-z to the character classes or use Pattern.CASE_INSENSITIVE (or add (?i) to the beginning of the regex)

    "<([A-Za-z][A-Za-z0-9]*)\\b[^>]*>(.*?)"
    

    If the tag contents may contain newlines, then use Pattern.DOTALL or add (?s) to the beginning of the regex to turn on dotall/singleline mode.

提交回复
热议问题