Convert currency with commas into numeric

僤鯓⒐⒋嵵緔 提交于 2019-11-26 08:24:22

问题


I have a column in a dataframe as follows:

COL1
$54,345
$65,231
$76,234

How do I convert it into this:

COL1
54345
65231
76234

The way I tried it at first was:

df$COL1<-as.numeric(as.character(df$COL1))

That didn\'t work because it said NA\'s were introduced.

Then I tried it like this:

df$COL1<-as.numeric(gsub(\"\\\\$\",\"\",as.character(df$COL1)))

And the same this happened.

Any ideas?


回答1:


The reason why the gsub didn't work was there was , in the column, which is still non-numeric. So when convert to 'numeric' with as.numeric, all the non-numeric elements are converted to NA. So, we need to remove both , and $ to make it work.

df1$COL1 <- as.numeric(gsub('[$,]', '', df1$COL1))

We match the $ and , inside the square brackets ([$,]) so that it will be considered as that character ($ left alone has special meaning i.e. it signifies the end of the string.) and replace it with ''.

Or we can escape (\\) the character ($) to match it and replace by ''.

df1$COL1 <- as.numeric(gsub('\\$|,', '', df1$COL1))



回答2:


We could use parse_number from readr package which removes any non-numeric characters.

library(readr)
parse_number(df$COL1)
#[1] 54345 65231 76234



回答3:


Another option using stringr library to remove '$' and ',' then convert as follows:

df %>% mutate(COL1 = COL1 %>% str_remove_all("\\$,") %>% as.numeric())


来源:https://stackoverflow.com/questions/31944103/convert-currency-with-commas-into-numeric

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!