Label or score outliers in R

为君一笑 提交于 2019-12-26 09:25:07

问题


I'm looking for some easy to use algorithms in R to label (outlier or not) or score (say, 7.5) outliers row-wise. Meaning, I have a matrix m that contains several rows and I want to identify rows who represent outliers compared to the other rows.

m <- matrix( data = c(1,1,1,0,0,0,1,0,1), ncol = 3 )

To illustrate some more, I want to compare all the (complete) rows in the matrix with each other to spot outliers.


回答1:


Here's some really simple outlier detection (using either the boxplot statistics or quantiles of the data) that I wrote a few years ago.

Outliers

But, as noted, it would be helpful if you'd describe your problem with greater precision.

Edit:

Also you say you want row-wise outliers. Do you mean to say that you're interested in identifying whole rows vs observations within a variable (as is typically done)? If so, you'll want to use some sort of distance metric, though which metric you choose will depend on your data.



来源:https://stackoverflow.com/questions/32870703/label-or-score-outliers-in-r

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!