How to group text data based on document similarity?
问题 Consider the dataframe like below df = pd.DataFrame({'Questions': ['What are you doing?','What are you doing tonight?','What are you doing now?','What is your name?','What is your nick name?','What is your full name?','Shall we meet?', 'How are you doing?' ]}) Questions 0 What are you doing? 1 What are you doing tonight? 2 What are you doing now? 3 What is your name? 4 What is your nick name? 5 What is your full name? 6 Shall we meet? 7 How are you doing? How to group the dataframe with