Skip to content

A simple tool for testing ipython notebooks (.ipynb) files

Notifications You must be signed in to change notification settings

jhprinz/ipynb-test

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

84 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Travis Anaconda-Server Badge Anaconda-Server Badge Anaconda-Server Badge

ipynb-test

A simple tool for testing ipython notebooks (.ipynb) files

Usage output

To get the help use

python ipynbtest.py -h

which outputs

usage: ipynbtest.py [-h] [-t TIMEOUT] [--rerun-if-timeout [RERUN]]
                    [--restart-if-fail [RESTART]] [-l] [-s] [--eval [EVAL]]
                    [--tested-types [TTYPES]] [--pass-if-timeout] [-d]
                    [--abort-if-fail] [--extra-arguments [EXTRA_ARGUMENTS]]
                    [-y] [-v]
                    file.ipynb

Run all cells in an ipython notebook as a test and check whether these
successfully execute and compares their output to the one inside the notebook

positional arguments:
  file.ipynb            the notebook to be checked

optional arguments:
  -h, --help            show this help message and exit
  -t TIMEOUT, --timeout TIMEOUT
                        the default timeout time in seconds for a cell
                        evaluation. Default is 300s (5mins). Note that travis
                        will consider it an error by default if after 600s
                        (10mins) no output is generated. So 600s is the
                        default limit by travis. However, a test cell that
                        takes this long should be split in more than one or
                        simplified.
  --rerun-if-timeout [RERUN]
                        if set then a timeout in a cell will cause to run the.
                        Default is 2 (means make up to 3 attempts)
  --restart-if-fail [RESTART]
                        if set then a fail in a cell will cause to restart the
                        full notebook!. Default is 0 (means NO rerun).Use this
                        with care.
  -l, --lazy            if set to true then the default test is that cell have
                        to match otherwise a diff will not be considered a
                        failed test
  -s, --strict          if set to true then the default test is that cell have
                        to match otherwise a diff will not be considered a
                        failed test
  --eval [EVAL]         the argument will be run before the first cell is
                        executed. This can be used to set specific values
                        without changing the notebook.
  --tested-types [TTYPES]
                        the argument will specify be output types to be
                        checked forequality. Currently the following types
                        "stream.stdout.text/plain, stream.stderr.text/plain,
                        execute_result.data.text/plain,
                        display_data.data.image/png,
                        display_data.data.image/svg,
                        display_data.data.text/plain,
                        execute_result.data.image/png " can be given as acomma
                        `,` separated list. Default setting is
                        "stdout.text/plain, data.text/plain" which will test
                        stdout and test/plain exeution results. No images will
                        be tested.
  --pass-if-timeout     if set then a timeout (after last retry) is considered
                        a passed test
  -d, --show-diff       if set to true differences in the cell are shown in
                        `diff` style
  --abort-if-fail       if set to true then a fail will stop the whole test.
  --extra-arguments [EXTRA_ARGUMENTS]
                        additional arguments passed to the ipython kernel on
                        starting. Examples are `--pylab=inline`.
  -y, --pylab           if set then pylab will be added to the extra
                        arguments.
  -v, --verbose         if set then text output is send to the console.

show differences

--show-diff

This option will output a diff-like comparion of both cells to show what is different in the output. This will only be enabled for cell with text-like output, (e.g. text, html). It is automatically disabled for pictures and SVG.

cell specific commands

You can start a cell with a hashbang #! and add some commands to it like

#! skip              : will not even execute a cell and just skip it
#! ignore            : will run the cell, but not fail if anything happens and just continue
#! timeout:[seconds] : will set the timeout for this cell to the given value
#! lazy              : will accept a cell with diffs, even in strict mode
#! strict            : will fail the cell if it has a diff
#! verbose           : will send the output (text) to the console
#! quiet             : will not send the output to the console even in verbose mode

strict mode

  --strict

The strict mode only causes cell with differing output to fail. Default setting is that a diff is okay.

Note that UUIDs and hex adresses (usually memory adresses) are always replaced by a unique address so different memory addresses will not cause a diff

Time out and rerun

A timeout is caused if the evaluation of a cell takes too long. The default timeout happens after 300s or 5minutes. Keep in mind that usually notebooks are used also for illustrative purposes and therefore are similar to an integration test. This means that for once we want to keep the run time per cell short to make it a reasonable example that executes in acceptable time. Second purpose is to show that a combination of several cells in a typical test run should give expected results. So keep the evaluation of each cell short and focussed on a single thing to happen at a time.

Also, remember that travis has an internal timeout of 10 minutes (if not manually changed) and will stop a build if no results are received. Make sure that either your cell will send at least some results within 10 minutes if you extend the timeout beyond 600s (10mins).

Lastly, try to avoid that timeouts happen. This is an indication of a poor test or example design.

Cause a fail and restart

The option

--restart-if-fail [max-number-of-restarts, default:0]

will cause to restart the whole notebook in a fresh kernel, if a cell executed with fail. Here fail means whatever you declared to be a fail. In strict mode also a difference in output will cause a restart.

Be careful using this option. It is again usually a sign of poor example design should it be possible to fail, if there is no error, but some random results involved that are not what is "hoped" for and thus cause a fail. Make sure that given the correct conditions (previous cells, etc...) a cell passes.

About

A simple tool for testing ipython notebooks (.ipynb) files

Resources

Stars

Watchers

Forks

Packages

No packages published