fselector

Feature selection in document-feature matrix by using chi-squared test

送分小仙女□ 提交于 2021-02-06 12:50:43
问题 I am doing texting mining using natural language processing. I used quanteda package to generate a document-feature matrix (dfm). Now I want to do feature selection using a chi-square test. I know there were already a lot of people asked this question. However, I couldn't find the relevant code for that. (The answers just gave a brief concept, like this: https://stats.stackexchange.com/questions/93101/how-can-i-perform-a-chi-square-test-to-do-feature-selection-in-r) I learned that I could use

Feature selection in document-feature matrix by using chi-squared test

三世轮回 提交于 2021-02-06 12:49:32
问题 I am doing texting mining using natural language processing. I used quanteda package to generate a document-feature matrix (dfm). Now I want to do feature selection using a chi-square test. I know there were already a lot of people asked this question. However, I couldn't find the relevant code for that. (The answers just gave a brief concept, like this: https://stats.stackexchange.com/questions/93101/how-can-i-perform-a-chi-square-test-to-do-feature-selection-in-r) I learned that I could use

Error: protect(): protection stack overflow while feature extraction

别说谁变了你拦得住时间么 提交于 2019-12-11 03:59:30
问题 I have a dataframe that has 4755 rows and 27199 columns. It's actually a document term matrix and I'm trying to perform feature selection using the "FSelector" package. Here is some of the code below: library(FSelector) weights <- information.gain(Flag~., dtmmatdf) Each time I do this I get an error Error: protect(): protection stack overflow I have a 24GB RAM and the dataframe is about 500Mb in size. So I don't know what the problem is and how do I fix it? 来源: https://stackoverflow.com