I have a big data. a column of text and a column of id.
column id hello world 1 dinner 1 father 1 hi 1 work/related 2 summ