How to cluster search engine keywords?
问题 From Google Analytics I have a (long) list of keywords that people used in search engines to find my website. I want to find the 'core keywords', hypothetical example: java online training learning java scala training training for java online training java learn scala programming The ideal result would be: 'java', 'online training', 'training', 'scala' and 'learn'. The difficulty seems to be detecting complete phrases, ignoring common words (for) and handling variations (learn-learning). Is