I can give you the highlights for sparklyr:
- Supports dplyr, Spark ML and H2O.
- Distributed on CRAN.
- Easy to install.
- Extensible.
In the current 0.4 version, it does not support arbitrary parallel code execution yet. However, extensions can be easily written in Scala to overcome this limitation, see sparkhello.