5 R Markdown

5.1 Scheduling

R Markdown reports that are published with source code can be re-executed by RStudio Connect. See section 3.2.2 to learn how to publish with source code.

Re-executing content can either be done manually or on a schedule. After navigating to the “Schedule” pane in the configuration window, you might see “The source code for this content was not published. The output cannot be refreshed.” If this is the case, then you will need to publish source code before it is possible to schedule your content.

Schedule pane with no source code

Similarly, Shiny applications or R Markdown documents with a Shiny runtime (Shiny documents) cannot be scheduled. Shiny assets show the latest data each time they are refreshed.

Schedule pane for a Shiny document

In other cases the Schedule pane provides options to schedule your asset for execution on RStudio Connect.

Schedule pane showing all options

5.1.1 Date and Time

The start date and time defaults to the current user’s date, time, and time zone. The schedule will be built off of this date and time. Take care to keep in mind how daylight savings time (DST) might affect the actual execution time of the report.

If you choose a start date and time in the past, execution will begin at the first future execution that satisfies “Schedule Type,” measured from the time that changes are saved.

5.1.2 Schedule Type

The schedule type and related fields will determine how frequently the R Markdown document is executed by RStudio Connect. Using the “Date and Time” above as an anchor, “Schedule Type” defines the time interval between executions.

The following interval configurations are all supported:

  • defined in minutes
    • e.g.: “every 15 minutes”
    • e.g.: “every 90 minutes”
  • defined in hours
    • e.g.: “every 3 hours”
    • e.g.: “every 25 hours”
  • defined in days
    • e.g.: “every 2 days”
    • e.g.: “every 40 days”
  • every weekday
    • e.g.: “Monday, Tuesday, Wednesday, Thursday, Friday”
  • defined in weeks (on select days of the week)
    • e.g.: “every 2 weeks on Tuesday and Thursday”
    • e.g.: “every 8 weeks on Friday”
  • semi-monthly (1st and 15th)
    • e.g.: “on the 1st and 15th of every month”
  • semi-monthly (14th and last)
    • e.g.: “on the 14th and last day of every month”
  • monthly on a given day of the month
    • e.g.: “every 2 months on the 3rd”
    • e.g.: “every 8 months on the 30th”
  • monthly on a week / day
    • e.g.: “every 3 months on the 3rd Tuesday of the month”
    • e.g.: “every 9 months on the 1st Saturday of the month”
  • defined in years
    • e.g.: “every 2 years”
    • e.g.: “every 8 years”

5.1.3 Publish Output

When content executes, you can decide whether or not the output should be saved and published to RStudio Connect. If you opt not to have output published after it is generated, any emails will be sent and any side-effects such as database writes will occur, but the report output will not be saved on Connect. Further, there will be no output history (See section 5.2).

In order to save and publish output, as well as track the history of output on Connect, keep the “Publish Output” box checked.

5.1.4 Send Email

This section of the Schedule configuration determines if and where emails will be delivered after execution. If checked, the owner of the content will always be notified unless they opt out. Further, content output can be sent to:

  • all collaborators
  • all viewers
  • named additional recipients

By default the rendered content will be attached and RStudio Connect will generate a standard subject line and email body. For more information on the ability to customize this email, see the section on email customization.

If you are having difficulty sending emails, contact your RStudio Connect administrator.

5.2 Report History

You can view past renders of R Markdown content in RStudio Connect using the History tool. This includes both manually triggered and scheduled execution of content (See section 5.1).

To access the history of an R Markdown report, browse to the “More” button (indicated by ellipsis in a circle) and select “History.”

Navigate to R Markdown history

This will open up a dialog to select the output bundles from previous executions of this report. The latest execution is in bold and the currently visible version of the output is highlighted.

R Markdown history selection

Different variants of a parameterized report will have a separate rendering history associated with them.

R Markdown history selection for a report variant

Each collection of output is saved to disk and is available at a unique URL, along with any output files (See section 5.4). Permissions for saved output will be the same as permissions for the “parent document” and the source code that is stored on Connect.

5.3 Output Metadata

We normally think of R Markdown documents as producing a single output artifact, such as an HTML or PDF file. The rmarkdown package allows report authors to emit additional output metadata from their report. RStudio Connect takes advantage of this metadata, allowing output files, custom email subjects, and additional email attachments.

There are two ways to set output metadata: in the YAML header and in R code chunks.

The YAML header is a good place to configure default values for metadata that you always want to emit. All output metadata lives under the rmd_output_metadata section in the YAML:

---
title: "Report Title"
rmd_output_metadata:
  rsc_email_subject: Quarterly Department Metrics
---

You can also use R code to set output metadata. This is useful if you want metadata to vary based on conditions or variables within your code.

changePercent <- 20
subject <- paste("Sales changed by ", changePercent, "%", sep = "")
rmarkdown::output_metadata$set(rsc_email_subject = subject)

Your R code can also read the current state of your output metadata. This can help you gradually alter this information as the report runs.

subject <- rmarkdown::output_metadata$get("rsc_email_subject")
if (changePercent > 10) {
    subject <- paste(subject, "Exceeding goals!")
    rmarkdown::output_metadata$set(rsc_email_subject = subject)
}

The rmd_output_metadata names starting with rsc_ are reserved for use with RStudio Connect.

5.4 Output Files

5.4.1 Introduction to Output Files

Output files are files that live alongside your rendered report. They could be plots, data files, or other artifacts generated from the R code in your report. Output files will be available via HTTP, and they will be versioned in the same way as your report. Output files are also subject to the same access controls as your report.

Connect will not process any output files that exist outside the working directory of the report that is rendering. That means that you cannot use absolute paths (e.g., /root/file.csv), relative paths (e.g., ../file.csv) or subdirectories (e.g., folder/file.csv).

5.4.2 How to Work with Output Files

There are two ways to specify which files should be treated as output files. The first is to list the file names in the R Markdown YAML header’s rmd_output_metadata section under rsc_output_files, like so:

---
title: "Report Title"
rmd_output_metadata:
  rsc_output_files:
    - "data.csv"
---

rsc_output_files takes a list of names of files that should be available after the report has rendered. If you list a file that does not exist after rendering your report, Connect will log a message but continue trying to processing the other files listed. If the output files are not generated during the rendering of your report, then you will also need to list them as resource files when you upload your report to Connect. For more information on resource files, see Section 5.5.

It is also possible to specify the list of output files from R code. For example:

rmarkdown::output_metadata$set(rsc_output_files = list("data1.csv", "data2.csv"))

You can also make a link to share an output file from your report using the standard Markdown links as supported in R Markdown. For example, if you want to share a file named data.csv, you make a link to it in your report like this:

Here is the data used in my report: [data.csv](data.csv)

Because output files are versioned along with the rendering of their report, they also benefit from historical views. In the example above, if you view a historical rendering of the report, when you click on the data.csv link, you will get a download of the file from the same point in time as the report.

5.4.3 Accessing output files over HTTP

Content deployed to http://connect.mycompany.com/content/42/ will have its output files available under that URL path. An output file named daily-summary.csv will be available at the URL http://connect.mycompany.com/content/42/daily-summary.csv.

The URL for your content is the same as its “Open Solo” location and is available in the RStudio Connect dashboard.

5.5 Resource Files

If you want RStudio Connect to host a file that you have in your report’s source directory on your computer, and that file is not generated by the report when you render it, then you will need to mark that file as a resource file. Like an output file, a resource file can be a plot, a data file, or any other artifact that exists as a file. You can use the RStudio IDE to select resource files, or you can list them in the R Markdown header:

---
title: "Report Title"
rmd_output_metadata:
  rsc_output_files:
    - "data.csv"
resource_files:
  - "data.csv"
---

Unlike rsc_output_files, the resource_files key is not nested under rmd_output_metadata. If you do not list your resource files under resource_files, then you will need to add them manually using the “Add More…” button when deploying from the IDE. See Section 3 for more information on publishing additional resource files.

5.6 Email Customization

5.6.1 Email Subject

You can customize the subject line used when an email of a report is generated. RStudio Connect uses the output metadata entry named rsc_email_subject as email subject. A report without an rsc_email_subject entry uses its published document name.

Use the YAML header to specify a simple, static text override of the email subject:

---
title: "Report Title"
rmd_output_metadata:
    rsc_email_subject: "My Email Subject Goes Here"
---

Set the email subject in an R code chunk if you need to dynamically build the subject:

changePercent <- 20
subject <- paste("Sales changed by ", changePercent, "%", sep = "")
rmarkdown::output_metadata$set(rsc_email_subject = subject)

The RSC_EMAIL_SUBJECT environment variable contains the name of your published report, which also acts as the default email subject. This environment variable is helpful if you want to add, but not fully replace the subject.

changePercent <- 20
defaultSubject <- Sys.getenv("RSC_EMAIL_SUBJECT")
subject <- sprintf("%s changed by %d%%", defaultSubject, changePercent)
rmarkdown::output_metadata$set(rsc_email_subject = subject)

You can also read the current subject from the output metadata to incrementally compose a final subject.

changePercent <- 20
subject <- rmarkdown::output_metadata$get("rsc_email_subject")
subject <- sprintf("%s changed by %d%%", subject, changePercent)
rmarkdown::output_metadata$set(rsc_email_subject = subject)

5.6.2 Email Attachments

An attachment is an output file that will be attached to an emailed report. You can specify email attachment files in essentially the same way as output files, but instead of listing them under rsc_output_files, you list them under rsc_email_attachments. For example:

---
title: "Report Title"
rmd_output_metadata:
  rsc_output_files:
    - "data.csv"
  rsc_email_attachments:
    - "attachment_1.csv"
    - "attachment_2.csv"
---

In the example above, we are specifying the file data.csv to be an output file and the files attachment_1.csv and attachment_2.csv to be email attachments.

It is also possible to specify the list of email attachments from R code. For example:

attachments <- list("attachment_1.csv", "attachment_2.csv")
rmarkdown::output_metadata$set(rsc_email_attachments = attachments)

An email attachment will be accessible via HTTP just like an output file, and you can make a link to it in your report in the same way.

Some mail systems have limitations on attachments in email messages. Attachments from your report need to follow the restrictions enforced by your organization. Connect is not aware of those limitations. Please work with your systems administrators / IT organization if you have trouble delivering file attachments.

5.6.3 Suppressing Scheduled Email

Scheduled reports can be configured to automatically send email after generating their output. The author of the report can suppress this option by giving the rsc_email_suppress_scheduled metadata property a logical (Boolean) value.

Use the YAML header to set a default value for rsc_email_suppress_scheduled. If set to true and not altered by downstream code, the report will never permit email after scheduled execution. Any attempt to configure post-update emailing in the RStudio Connect dashboard will have no effect.

---
title: "Report Title"
rmd_output_metadata:
  rsc_email_suppress_scheduled: true
---

You can make this decision dynamically based on data available in your report. This example assumes email is suppressed by default (as in the YAML above) but triggered when some business condition is exceeded.

changePercent <- compute_weekly_sales_change()
if (changePercent < -5 || changePercent > 5) {
    // email on substantial sales changes.
    rmarkdown::output_metadata$set(rsc_email_suppress_scheduled = FALSE)
}

5.6.4 Suppress Attaching Report To Email

By default, Connect adds the generated document as an attachment to email messages for that report. You can prevent this attachment from your R Markdown report by giving the rsc_email_suppress_report_attachment metadata property a logical (Boolean) value.

Use the YAML header to set a default value for rsc_email_suppress_report_attachment. A true value that is not later adjusted indicates that the generated content is never to be attached to email.

---
title: "Report Title"
rmd_output_metadata:
  rsc_email_suppress_report_attachment: true
---

You can also make an “attach or not” decision in R code.

rmarkdown::output_metadata$set(rsc_email_suppress_report_attachment = TRUE)

Attachments configured by the rsc_email_attachments metadata property (5.6.2) are still attached and not affected by the rsc_email_suppress_report_attachment setting.