发表新帖

发表新帖

Count number of unique values per row [duplicate]

后端未结

关注

 2  2038

别跟我提以往 2021-01-17 22:09

2条回答

暗喜 (楼主)

2021-01-17 22:59
We can also use a vectorized approach with regex. After pasteing the elements of each row of the dataset (do.call(paste0, ...), match a pattern of any character, capture as a group ((.)), using the positive lookahead, match characters only if it appears again later in the string (\\1 - backreference for the captured group and replace it with blank (""). So, in effect only those characters remain that will be unique. Then, with nchar we count the number of characters in the string.
```
example$count <- nchar(gsub("(.)(?=.*?\\1)", "", do.call(paste0, example), perl = TRUE))
example$count
#[1] 2 1 3 3 2 1
```
0 讨论(0)

查看其它2个回答
发布评论:

提交评论
- 加载中...

热议问题