-
Notifications
You must be signed in to change notification settings - Fork 6
/
demo.Rmd
386 lines (262 loc) · 13.6 KB
/
demo.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
---
title: "An example R Markdown file"
subtitle: "Illustrating use of R, bash, Python, and Julia code chunks"
author: "Christopher Paciorek"
date: "February 2022"
bibliography: refs.bib
---
```{r chunksetup, include=FALSE}
# include any code here you don't want to show up in the document,
# e.g., package and dataset loading
require(ggplot2)
set.seed(1)
# also a good place to set global chunk options
library(knitr) # need this for opts_chunk command
opts_chunk$set(fig.width = 5, fig.height = 5)
# if we wanted chunks by default not to be evaluated, uncomment next line
# opts_chunk$set(eval = FALSE)
myControlVar <- TRUE # R variable used later to control chunk behavior
```
# 1) How to generate a document from this file
From within R, you can run the document through the either the *rmarkdown* or *knitr* package for R to generate an html file, or through the *rmarkdown* package to generate PDF or Word (the latter being useful at times but hopefully avoidable).
```{r, compile-R, eval=FALSE}
library(rmarkdown); render('demo.Rmd', 'pdf_document')
library(rmarkdown); render('demo.Rmd', 'html_document')
library(rmarkdown); render('demo.Rmd', 'word_document')
library(knitr); knit2html('demo.Rmd')
```
Or in RStudio, click on the 'Knit' pull-down menu and choose to knit to HTML, PDF, or Word (for R Markdown) or use the 'Render' button in more recent versions of RStudio.
Alternatively, from the UNIX command line, run one of these:
```{bash, compile-bash, eval=FALSE}
Rscript -e "library(rmarkdown); render('demo.Rmd', 'pdf_document')" # PDF
Rscript -e "library(rmarkdown); render('demo.Rmd', 'html_document')" # HTML
Rscript -e "library(rmarkdown); render('demo.Rmd', 'word_document')" # Word
Rscript -e "library(knitr); knit2html('demo.Rmd')" # HTML alternative
```
# 2) Some basic Markdown formatting
Here's an *introduction* to our **critical** discovery. Here we have some code to display inline but not evaluate: `exp(7)` and we can embed the code in a static code block as follows:
```
a = 7 %% 5
b = exp(a)
```
This document will focus on embedding math and code and not on standard Markdown formatting. There are lots of sources of information on Markdown. [RStudio has good information on R Markdown](https://rmarkdown.rstudio.com) (including Markdown formatting).
For documents whose output format is HTML, you can use HTML formatting within your Markdown-based text.
# 3) Embedding equations using LaTeX
This can be done with the following syntax. Note that you can't have a space after the initial $ for the inline equations.
Here is an inline equation $f(x) = \int f(y, x) dy$.
Here's a displayed equation
$$
f_\theta(x) = \int f_\theta(y, x) dy.
$$
# 4) Embedding R code
Here's an R code chunk
```{r, r-chunk1}
a <- c(7, 3)
mean(a)
b <- a + 3
mean(b)
```
Here's another chunk:
```{r, r-chunk2}
mean(b)
```
When running R code, output is printed interspersed with the code, as one would generally want. Also, later chunks have access to result from earlier chunks (i.e., state is preserved between chunks).
Let's make a plot:
```{r, r-plot, fig.height = 3, fig.alt="example plot"}
hist(rnorm(20))
```
And here's some inline R code: What is 3 plus 5? `r 3+5`.
# 5) Controlling code chunk behavior
You have control over whether code in chunks is echoed into the document and evaluated using the `include`, `echo`, and `eval` tags.
Here we print the code but don't evaluate it by setting `eval` to `false`.
```{r, evalChunk, eval=FALSE}
cat("This code is not evaluated, but the code itself is printed in the document.")
```
Here is the result of running the code in a chunk but not printing the code by setting `eval` to `false`.
```{r, echoChunk, echo=FALSE}
cat("This code is not printed in the document, but results of evaluating the code are printed.")
```
And here is a chunk that is evaluated, but neither the code nor the result of evaluating the code is printed in the rendered document. This is achieved by setting `include` to `false`.
```{r, includeChunk, include=FALSE}
cat("This code is evaluated but nothing is printed in the document.")
library(fields)
myVar <- 7
## fields package should now be loaded for use by later chunks and
## myVar should now be available
```
Results of intensive calculations can be saved using the `cache=TRUE` tag so they don't need to be rerun every time you compile the document.
```{r, slow-step, cache=TRUE}
a <- mean(rnorm(5e7))
a
```
You can use R variables to control the chunk options. Note that the variable `myControlVar` is defined in the first chunk of this document. Here it is used to turn off evaluation of the chunk code.
```{r, use-var-in-chunk-option, echo=myControlVar, eval=!myControlVar}
print("hi")
```
An alternative, nice way to specify chunk options is within the chunk, like this:
```{r}
#| label: put-options-in-chunk
#| eval: FALSE
#| echo: TRUE
cat("This code is printed in the document, but the code is not evaluated.")
```
# 6) Embedding bash and Python code
## 6.1) bash
A bash chunk:
```{bash, bash-chunk1}
ls -l
df -h
cd /tmp
pwd
```
Unfortunately, output from bash chunks occurs after all the code is printed. Also, state is not preserved between chunks.
We can see that state is not preserved here, where the current working directory is NOT the directory that we changed to in the chunk above.
```{bash, bash-chunk2}
pwd # result would be /tmp if state were preserved
```
Inline bash code won't work: `bash wc demo.Rmd`, unlike with R code.
## 6.2) Embedding Python code
You can embed Python code. As with R, state is preserved so later chunks can use objects from earlier chunks.
```{python, py-chunk1}
import numpy as np
x = np.array((3, 5, 7))
print(x.sum())
x.min() # this will print with more recent versions of rmarkdown
```
```{python, py-chunk2}
try:
print(x[0])
except NameError:
print('state is not preserved: x does not exist')
```
There is no facility for inline Python code: `python print(3+5)`
# 6.3) Embedding Julia code
You can embed Julia code. As with R and Python, state is preserved so later chunks can use objects from earlier chunks.
```{julia}
#| label: jl-chunk1
x = [3, 5, 7];
x[2]
```
```{julia}
#| label: jl-chunk2
try
println("state is preserved if we see the value of `x[2]` next")
print(x[2])
catch
print("state is not preserved: x does not exist")
end
```
There is no facility for inline Julia code: `julia print(3+5)`
# 7) Reading code from an external file
It's sometimes nice to draw code in from a separate file. Before invoking a chunk, we need to read the chunks from the source file, which contains the chunks tagged with some special formatting. Note that a good place for reading the source file via `read_chunk()` is in an initial setup chunk at the beginning of the document.
```{r, read-chunk, echo=FALSE}
library(knitr)
read_chunk('demo.R') ## contains external_chunk_1 and external_chunk_1
```
```{r, external_chunk_1}
```
```{r, external_chunk_2}
```
# 8) Formatting of long lines of code and of output
## 8.1) R code
Having long lines be nicely formatted and other aspects of formatting can be a challenge. Also, results can differ depending on your output format (e.g., PDF vs. HTML). In general the code in this section will often overflow the page width in PDF but not in HTML, but even in the HTML the line breaks may be awkwardly positioned.
Here are some examples that overflow in PDF output.
```{r, r-long}
b <- "Statistics at UC Berkeley: We are a community engaged in research and education in probability and statistics. In addition to developing fundamental theory and methodology, we are actively"
## Statistics at UC Berkeley: We are a community engaged in research and education in probability and statistics. In addition to developing fundamental theory and methodology, we are actively
## This should work to give decent formatting in HTML but doesn't in PDF.
cat(b, fill = TRUE)
vecWithALongName = rnorm(100)
a = length(mean(5 * vecWithALongName + vecWithALongName - exp(vecWithALongName) + vecWithALongName * vecWithALongName, na.rm = TRUE))
a = length(mean(5 * vecWithALongName + vecWithALongName)) # this is a comment that goes over the line by a good long ways
a = length(mean(5 * vecWithALongName + vecWithALongName - exp(vecWithALongName) + vecWithALongName, na.rm = TRUE)) # this is a comment that goes over the line by a good long long long long long long long long ways
```
In contrast, long output is usually fine, even in PDF.
```{r, r-long-output}
rnorm(30)
```
Adding the `tidy=TRUE` chunk option and setting the width (as shown in the Rmd version of this document) can help with long comment lines or lines of code, but doesn't help for some of the cases above.
```{r, r-long-tidy, tidy=TRUE, tidy.opts=list(width.cutoff=80)}
## Long strings and long comments:
b <- "Statistics at UC Berkeley: We are a community engaged in research and education in probability and statistics. In addition to developing fundamental theory and methodology, we are actively"
## Statistics at UC Berkeley: We are a community engaged in research and education in probability and statistics. In addition to developing fundamental theory and methodology, we are actively
## This should work to give decent formatting in HTML but doesn't in PDF:
cat(b, fill = TRUE)
## Now consider long lines of code:
vecWithALongName <- rnorm(100)
a <- length(mean(5 * vecWithALongName + vecWithALongName - exp(vecWithALongName) + vecWithALongName * vecWithALongName, na.rm = TRUE))
a <- length(mean(5 * vecWithALongName + vecWithALongName)) # this is a comment that goes over the line by a good long ways
a <- length(mean(5 * vecWithALongName + vecWithALongName - exp(vecWithALongName) + vecWithALongName, na.rm = TRUE)) # this is a comment that goes over the line by a good long long long long long long long long ways
```
To address the problems seen above, sometimes you can format things manually for better results. You may need to tag the chunk with `tidy=FALSE`, but I have not done that here.
```{r, r-long-manual}
## Breaking up a string:
b <- "Statistics at UC Berkeley: We are a community engaged in research
and education in probability and statistics. In addition to developing
fundamental theory and methodology, we are actively"
## Breaking up a comment:
## Statistics at UC Berkeley: We are a community engaged in research and
## education in probability and statistics. In addition to developing
## fundamental theory and methodology, we are actively
## Breaking up code lines:
vecWithALongName = rnorm(100)
a <- length(mean(5 * vecWithALongName + vecWithALongName - exp(vecWithALongName) +
vecWithALongName * vecWithALongName, na.rm = TRUE))
a <- length(mean(5 * vecWithALongName + vecWithALongName)) # this is a comment that
## goes over the line by a good long ways
a <- length(mean(5 * vecWithALongName + vecWithALongName - exp(vecWithALongName) +
vecWithALongName, na.rm = TRUE)) # this is a comment that goes over the line
## by a good long long long long long long long long ways
```
## 8.2) bash code
In bash, we have similar problems with lines overflowing in PDF output, but bash allows us to use a backslash to break lines of code. However that strategy doesn't help with long lines of output.
```{bash, bash1}
echo "Statistics at UC Berkeley: We are a community engaged in research and education in probability and statistics. In addition to developing fundamental theory and methodology, we are actively" > tmp.txt
echo "Second try: Statistics at UC Berkeley: We are a community engaged \
in research and education in probability and statistics. In addition to \
developing fundamental theory and methodology, we are actively" \
>> tmp.txt
cat tmp.txt
```
We also have problems with long comments, so we would need to manually format them.
Here is a long comment line that overflows in PDF:
```{bash, bash2}
# asdl lkjsdf jklsdf kladfj jksfd alkfd klasdf klad kla lakjsdf aljdkfad kljafda kaljdf afdlkja lkajdfsa lajdfa adlfjaf jkladf afdl
```
Instead manually break the comment into multiple lines:
```{bash, bash3}
# asdl lkjsdf jklsdf kladfj jksfd alkfd klasdf klad kla
# lakjsdf aljdkfad kljafda kaljdf afdlkja lkajdfsa lajdfa
# adlfjaf jkladf afdl
```
## 8.3) Python code
In Python, there is similar trouble with lines overflowing in PDF output too.
```{python, test-python1}
# This overflows the page:
b = "asdl lkjsdf jklsdf kladfj jksfd alkfd klasdf klad kla lakjsdf aljdkfad kljafda kaljdf afdlkja lkajdfsa lajdfa adlfjaf jkladf afdl"
print(b)
```
```{python, test-python2}
# This code overflows the page:
zoo = {"lion": "Simba", "panda": None, "whale": "Moby", "numAnimals": 3, "bear": "Yogi", "killer whale": "shamu", "bunny:": "bugs"}
print(zoo)
```
To fix the issue, we can manually break the code into multiple lines, but long
output still overflows.
```{python, test-python3}
zoo = {"lion": "Simba", "panda": None, "whale": "Moby",
"numAnimals": 3, "bear": "Yogi", "killer whale": "shamu",
"bunny:": "bugs"}
print(zoo)
```
Long comments overflow as well, but you can always manually break into multiple lines.
```{python, test-python4}
# asdl lkjsdf jklsdf kladfj jksfd alkfd klasdf klad kla lakjsdf aljdkfad kljafda kaljdf afdlkja lkajdfsa lajdfa adlfjaf jkladf afdl
# asdl lkjsdf jklsdf kladfj jksfd alkfd klasdf klad kla lakjsdf aljdkfad
# kljafda kaljdf afdlkja lkajdfsa lajdfa adlfjaf jkladf afdl
```
# 9) References
We'll just see how you use BibTeX style references. @Bane:etal:2008 proposed a useful method. This was confirmed [@Cres:Joha:2008].
Note the indication of the `refs.bib` file in the initial lines of this document so that the bibliographic information for these citations can be found.
The list of references is placed at the end of the document. You'd presumably want a section header like this:
# Literature cited