How to extract plain text from a DOCX file using the new OOXML support in Apache POI 3.5?

后端 未结 2 1879
时光取名叫无心
时光取名叫无心 2021-01-01 21:08

On September 28, 2009 the Apache POI project released version 3.5 which officially supports the OOXML formats introduced in Office 2007, like DOCX and XLSX.

Please p

2条回答
  •  春和景丽
    2021-01-01 21:45

    This is more generic

    POITextExtractor poitex = ExtractorFactory.createExtractor(in);

    return poitex.getText();

提交回复
热议问题