HTML to Plain Text with Ruby?

后端 未结 9 2034
无人共我
无人共我 2020-12-15 18:03

Is there anything out there to convert html to plain text (maybe a nokogiri script)? Something that would keep the line breaks, but that\'s about it.

If I write som

9条回答
  •  猫巷女王i
    2020-12-15 18:24

    Building slightly on Matchu's answer, this worked for my (very similar) requirements:

    html.gsub(/<\/?[^>]*>/, ' ').gsub(/\n\n+/, '\n').gsub(/^\n|\n$/, ' ').squish
    

    Hope it makes someone's life a bit easier :-)

提交回复
热议问题