README.Rmd

scientist
=========

```{r echo=FALSE}
knitr::opts_chunk$set(
  comment = "#>",
  collapse = TRUE,
  warning = FALSE,
  message = FALSE
)
```

[![Project Status: Concept – Minimal or no implementation has been done yet, or the repository is only intended to be a limited example, demo, or proof-of-concept.](https://www.repostatus.org/badges/latest/concept.svg)](https://www.repostatus.org/#concept)

__Experiment with changes in an R project__

Inspired by the Ruby gem [scientist](https://github.com/github/scientist) -
but instead of targeted at web apps, this project targets researchers/etc.
that want to compare changes in their code in a rigorous fashion.

## How is this different from X tool?

* git/version control: scientist does not play any part in managing or
dealing versions of your project. use git for that.
* unit tests: tests are meant for making sure changes you make to your code
don't change outputs. scientist is a sort of opposite to
unit tests in that it aims to tell you **how** changes in your code
**do change** the outputs.
* benchmarking/profiling: scientist isn't purely a tool for comparing how
fast your code runs, but it does include comparison of run times
as a tool for decision making about what version to use

 
## Use cases (click to expand)

<details> <summary><strong>Code block</strong></summary> <p>

You have some code. You want to make a change to the code,
and you have a few different ideas about what you'd like
to do. For example, you want to pre-allocate the size of the
data.frame to see if that saves time.

`scientist` can help you sort out changes by comparing how long
each version takes, and visually diff results.

Using `scientist` you can compare these two functions like:

```{r eval=FALSE}
a <- Experiment$new(name = "compare_code")
a$control(v1 = {
  out <- data.frame(letter = NA_character_, LETTER = NA_character_,
    both = NA_character_, stringsAsFactors = FALSE)
  for (i in 1:26)
    out[i,] <- c(letters[i], LETTERS[i], paste0(letters[i], LETTERS[i]))
  out
})
a$candidate(v2 = {
  out <- data.frame(letter = rep(NA_character_, times = 26),
    LETTER = rep(NA_character_, times = 26),
    both = rep(NA_character_, times = 26),
    stringsAsFactors = FALSE)
  for (i in 1:26)
    out[i,] <- c(letters[i], LETTERS[i], paste0(sample(letters, 1), LETTERS[i]))
  out
})
a
```

Then we can run the "experiment"

```{r eval=FALSE}
a$run()
```

Then compare results

```{r eval=FALSE}
a$diff()
```

![diff_plot](tools/code_block_diff.png)

</p></details>

<details> <summary><strong>Scripts</strong></summary> <p>

You have an R script, let's call it `code.R`. Just as above with the code
example, you want to make a change to the script. Instead of using
code blocks as input as above, you can use file names. (NOTE: file names
not supported yet, see [issue #7](https://github.com/sckott/scientist/issues/7))

Using `scientist` you can compare these two scripts with:

```{r eval=FALSE}
b <- Experiment$new(name = "compare_scripts")
b$control(file = "code.R")
b$candidate(file = "code_new.R")
```

> note: above code doesn't work yet

</p></details>

<details> <summary><strong>Packages</strong></summary> <p>

You have a package, let's call it `foobar`. You want to change a function
in `foobar` called `stuff()`. You make a new version of that function
called `stuff_new()`. (NOTE: functions not supported yet per se, see 
[issue #8](https://github.com/sckott/scientist/issues/8); although
you can call functions just like code blocks)

Using `scientist` you can compare these two functions with:

```{r eval=FALSE}
res <- Experiment$new(name = "compare_stuff")
res$control(foobar::stuff(x = 5))
res$candidate(foobar::stuff_new(x = 5))
```

> note: above is pseudocode, as foobar is not a real package; though
> you can try functions from a real package

</p></details>


## Install

```{r eval=FALSE}
remotes::install_github("sckott/scientist")
```

```{r}
library(scientist)
```

## Usage

Initialize an experiment

```{r}
res <- Experiment$new(name = "jane")
```

Set your control code block

```{r}
res$control({
  x = 5
  x^2
})
```

Set your candidate code block. You can have 1 or more candidates, which are
compared against the control.

```{r}
res$candidate({
  y = 5
  y^3
})
```

Now you can see some control and candidate details

```{r}
res
```

Run the experiment

```{r}
res$run()
```

Get the results

```{r}
res$control_result
res$candidate_results
```

Get all results plus timing data

```{r}
res$result()
```

Publish results - opens a page in your default browser

```{r eval=FALSE}
res$publish()
```

![img](tools/publish_eg.png)

## Meta

* Please [report any issues or bugs](https://github.com/sckott/scientist/issues)
* License: MIT
* Get citation information for `scientist` in R doing `citation(package = 'scientist')`
* Please note that this project is released with a [Contributor Code of Conduct][coc]. By participating in this project you agree to abide by its terms.


[coc]: https://github.com/sckott/scientist/blob/master/CODE_OF_CONDUCT.md