HTML to Plain Text with Ruby?

后端 未结 9 2048
无人共我
无人共我 2020-12-15 18:03

Is there anything out there to convert html to plain text (maybe a nokogiri script)? Something that would keep the line breaks, but that\'s about it.

If I write som

9条回答
  •  忘掉有多难
    2020-12-15 18:20

    Is simply stripping tags and excess line breaks acceptable?

    html.gsub(/<\/?[^>]*>/, '').gsub(/\n\n+/, "\n").gsub(/^\n|\n$/, '')
    

    First strips tags, second takes duplicate line breaks down to one, third removes line breaks at the start and end of the string.

提交回复
热议问题