问题
I feel like this should be easier, but I cannot figure this out.
How do I filter out documents from a corpus based on metadata. To be more specific, I have a corpus of 576 documents, each of which has the tag 'Section'. Section has a number of different values such as, "News", "Editorial" and "Comment".
How do i use tm_filter to filter out documents, say, that have "Editorial" and or "Comment" in this?
I'm sorry I haven't provided reproducible data. I don't really know how to go about producing a sample corpus.
来源:https://stackoverflow.com/questions/33698428/subset-a-corpus-by-meta-data