Skip to content

Integrating RStudio Workbench with Spark and sparklyr#

sparklyr is an R interface for Apache Spark that allows you to install and connect to Spark, filter and aggregate datasets using dplyr syntax against Spark, then bring them into R for analysis and visualization.

You can install RStudio Workbench, formerly RStudio Server Pro1, within a Spark/Hadoop cluster and use sparklyr from R sessions.

The following articles describe how to integrate RStudio Workbench with a Spark cluster in different configurations:

Visit for more information.

  1. We have renamed RStudio Server Pro to RStudio Workbench. This change reflects the product’s growing support for a wide range of different development environments. RStudio Workbench enables R and Python data scientists to use their preferred IDE in a secure, scalable, and collaborative environment -- whether that is the RStudio IDE, JupyterLab, Jupyter Notebooks, or VS Code. We want RStudio Workbench to be the best single platform to support open source, code-first data science, whether your team is using R or Python. Please see our official Announcement and review our FAQ regarding the name change from RStudio Server Pro to RStudio Workbench.