Using R with tidyquant and massiv data

你。 提交于 2019-12-04 15:40:14

Check out the article at:

datascience.la/dplyr-and-a-very-basic-benchmark

There is a table that shows runtime comparisons for some of the data wrangling tasks you are performing. From the table, it looks as though dplyr with data.table behind it is likely going to do much better than dplyr with a dataframe behind it.

There’s a link to the benchmarking code used to make the table, too.

In short, try adding a key, and try using data.table over dataframe.

To make x your key, and say your data.table is named dt, use setkey(dt,x).

While Pakes answer deals with the described problem I found a solution to the underlying problem. For Compatibility reason I used R in the 3.4.3 version. Now I'm using the newer 3.5.1 version which works quite fine.

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!