Unstructured Text to Structured Data

前端 未结 4 1430
我寻月下人不归
我寻月下人不归 2021-02-02 03:23

I am looking for references (tutorials, books, academic literature) concerning structuring unstructured text in a manner similar to the google calendar quick add button.

4条回答
  •  情深已故
    2021-02-02 03:50

    If you are only working for cases like the example you cited, you are better off using some manual rule-based that is 100% predictable and covers 90% of the cases it might encounter production..

    You could enumerable lists of all possible brands and categories and detect which is which in an input string cos there's usually very little intersection in these two lists..

    The other two could easily be detected and extracted using regular expressions. (1-3 digit numbers are always sizes, etc)

    Your problem domain doesn't seem big enough to warrant a more heavy duty approach such as statistical learning.

提交回复
热议问题