-
Notifications
You must be signed in to change notification settings - Fork 28
Expand file tree
/
Copy pathpowerpoint.qmd
More file actions
352 lines (210 loc) · 9.01 KB
/
powerpoint.qmd
File metadata and controls
352 lines (210 loc) · 9.01 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
---
title: "Best Practices for Administering RStudio in Production"
author: "Nathan Stephens"
format:
pptx:
reference-doc: template.pptx
---
# Overview
## Solutions engineering
### We help you integrate RStudio products into your systems
### Our team
### Where do we hang out?
* [Community.rstudio.com](http://community.rstudio.com)
* [Github.com/sol-eng](http://)
* [Solutions.rstudio.com](http://solutions.rstudio.com)
* [Support.rstudio.com](http://support.rstudio.com)
* [Docs.rstudio.com](docs.rstudio.com)
## Who is this webinar for?
### R Admin -- data scientists who want to do more
### R Evangelists
### IT/Ops
### Anyone who wants to try RStudio professional products
## RStudio
### RStudio empowers individuals to be productive with data science.
* Open source and reproducible research
* APIs and interoperability
* Usability and clear documentation
* Inclusive and collective success
* Creating lasting value for data science
## What we do
### RStudio builds open source and professional software for data science
* Our professional features include things like
* Security
* Authentication
* Load balancing
* Support
### RStudio professional products
* RStudio Server Pro
* RStudio Connect
* RStudio Package Manager
## What is the relationship between R and RStudio?
### We don’t own R, package R, or distribute R
### R Core team: 20 members -- Zero from RStudio
### RStudio products "sit on top of R"
* You standardize on R first
* Install our products second
### We assume you have chosen to invest in R
## Professional R tooling and integration
### Legitimacy
* Recognize R as an analytic standard
### Competencies*
* Understand and manage R tooling
### Adoption
* Rely on integrated R based solutions
## Administering RStudio professional products
### People want to know if they are doing things the best way
* R is relatively unknown in most organizations
* No single place to get all the information you need
* Hard to see the forest through the trees
### We see a lot of trial and error
* Organizational hurdles
* Resource limitations
## Outline
### I want to share some best practices for managing RStudio in production
* Share product requirements
* Some tips
* A path for getting started
Goal is to give you a big picture view of what success looks like, assuming you are using RStudio professional products
# 5 Best practices for administering RStudio in production
## 1. Keep your system up to date
### Modern tools
* Operating system
* Browsers
### C++11 compiler
* R packages on Linux must be compiled
### Internet access
* R packages
## 2. Support multiple versions of R
### Why do you want to run multiple versions of R?
* Manage upgrades of R
* Test code on a variety of R versions and distributions
* Support projects that depend on various versions of R
* All products support multiple versions of R
### Upgrade yearly (version 3.1.0+)
### Build R from source
* Multiple versions of R side by side requires you build R from source
* Not hard to do (i.e. config/make/make install)
* [Instructions](https://support.rstudio.com/hc/en-us/articles/360002242413-Multiple-versions-of-R)
## 3. Organize your R packages
### R Packages rule the nest.
* Packages will drive your R version, Linux dependencies, and even your operating system
* Data scientists will want access to their most beloved packages
### Managing packages for a single user is easy.
* Managing packages for an entire platform is hard
### RStudio Package Manager solves several problems
* Disconnected, air-gapped environments
* Curate packages into multiple repositories for security and control
* Share internal packages
## 4. Use root privileges
The group in your organization that installs, configures, and manages R and RStudio will need root privileges
### RStudio products
* Installs require root privileges
* Runs require root privileges
* RStudio Server Pro runs as the root user in order to create new R sessions on behalf of its users
* RStudio Connect runs as the root user in order to isolate applications and processes
### R
* System-wide installations of R on Linux often involve root also
## 5. Securely manage your users
### R programmers - RStudio Server Pro
* They will need access to R, file shares, databases, and probably many other sensitive systems.
* R processes run as the user under a local account
### End users - RStudio Connect
* End users consume apps and reports.
* R Processes typically run under a service account
## Authentication
### Your organization
* Probably has strong opinions on how to authenticate users
* This space is only getting more fragmented not less
* LDAP, Active Directory, PAM, OAuth, Okta, Duo, Auth0, etc.
### Proxied authentication
* If we don't support your specific system, then you can use our proxied authentication
* With proxied auth, users do not log in through RStudio but through a proxy that you set up
## Supported Auth Methods
### RStudio Server Pro
* PAM (LDAP and Active Directory)
* OAuth 2.0 using Google Apps
* *Proxied authentication*
### RStudio Connect
* LDAP and Active Directory
* OAuth 2.0 using Google Apps
* PAM
* SAML [Beta]
* *Proxied authentication*
## Recommendations (your Happy path)
### 1. Keep your operating systems and browsers up to date
### 2. Plan to support multiple versions of R by building R from source
### 3. Organize your R packages for reliability and consistency
### 4. Use root privileges to install and run RStudio products
### 5. Securely manage your R programmers and end users
# Getting Started
## Tooling
### RStudio makes software tools that are designed to work together
* Our R packages and products work together
* There are many ways to assemble our tools
* But it will be up to you to decide how to do it
* Your configuration depends on what does data science means to your organization
Our goal is to make it easy to install and configure all of our products
## Solutions
### Data science lab
### Application factories (Dev/Test/Prod)
### On premises, cloud, hybrid cloud
### Single server or a multi-departmental deployment
### Crawl, walk, run strategies
## Architecture

## Server Setup

## Recipes
### Overview
* List of ingredients that make up your platform
* Helps you organize and automate your work
* And are unique to your organization
### Structure
* Most of your code will be for Linux, R, and R packages
* A small part of your code will be for installation
* If you've installed R properly, installation is usually easy
* The rest will be configuration
## Infrastructure as code
### Organize your recipes so that can manage your platform
* [Configuration management tools for the R admin](https://resources.rstudio.com/rstudio-server-pro/configuration-management-tools-for-the-r-admin)
* Ansible, Chef, Puppet, CodeDeploy, SaltStack, etc.
* Sandbox
# What if I've never used these products?
## RStudio Quickstart
### Overview
* A virtual machine that runs on your desktop
* Includes all our professional products
* And includes pre built assets for you to explore and demonstrate to others
### Motivation
* Experience RStudio professional products
* Free and easy
[Demo](https://www.rstudio.com/products/quickstart/)
## Summary
### The happy path
* Keep your operating system and browser up to date
* Support multiple versions of R by building R from source
* Make sure you have easy access to R packages
* Install products as root
* Use a supported authentication system
### How to get started
* Recipes and [checklists](https://support.rstudio.com/hc/en-us/articles/360015079054)
* Crawl/Walk/Run strategies
* [RStudio QuickStart](https://www.rstudio.com/products/quickstart/)
## Summary
### Connecting with solutions engineering
* [Community.rstudio.com](https://community.rstudio.com/)
* [Github.com/sol-eng](http://)
* [Solutions.rstudio.com](http://solutions.rstudio.com)
* [Support.rstudio.com](http://support.rstudio.com)
* [Docs.rstudio.com](docs.rstudio.com)
## References
[Administration of Pro Products](https://resources.rstudio.com/administration-of-pro-products)
[Professional R Tooling and Integration](https://resources.rstudio.com/webinars/2018-07-11-13-00-professional-r-tooling-and-integration-nathan-stephens-1)
[The R Admin is Rad](https://resources.rstudio.com/rstudio-conf-2018/the-r-admin-is-rad-a-guide-to-professional-r-tooling-and-integration-nathan-stephens)
[R Admin Community](https://community.rstudio.com/c/r-admin)
[RStudio Docs](https://docs.rstudio.com/resources.html)
[RStudio Professional Product Requirements](https://support.rstudio.com/hc/en-us/articles/360015177453-RStudio-professional-product-requirements)
[RStudio Server Pro Example Checklist](https://support.rstudio.com/hc/en-us/articles/360015079054-RStudio-Server-Pro-Installation-and-Configuration-Example-Checklist)
[R for the Enterprise](https://rviews.rstudio.com/categories/r-for-the-enterprise/)
[Configuration Management Tools for the R Admin](https://resources.rstudio.com/rstudio-server-pro/configuration-management-tools-for-the-r-admin)