问题
I am looking for a dataset which contains large quantities of relation tuples. For example, the search of "people" and "location" yields "lives in", "worked in", etc. University of Washington's OpenIE http://OpenIE.cs.washington.edu is a good tool but their dataset is only accessible through web. Where can I download a database or library like this?
回答1:
- OpenIE itself provides large dataset of 11 gb for this purpose. Check this
http://knowitall.cs.washington.edu/paralex/
Although it is an auto answering system, you can consider intermediate relation extraction result for your purpose.
- Another method you could implement is using
syntex parse
. Use syntex parser and write rules to extract subject, object and other entities as per your requirement.
回答2:
I've been collecting all the public datasets containing relationships between named-entities or nominals.
You can find them here:
https://github.com/davidsbatista/Annotated-Semantic-Relationships-Datasets
来源:https://stackoverflow.com/questions/29591485/where-to-find-a-state-of-art-relation-extraction-dataset