{% if project_description %} {{ project_description }} {% endif %}

This is a template for a renku data science project using python. It has predifined folders and a installed tool for documentation. Both are further explained in the sections below.

Project Setup

Before using the template you should do the following

[] change the folder name of the folder "name" to the name of your project
[] edit the setup.py file by replacing "name" with your projects name

Doing both will enable installing you project with pip install -e .. This will let you import your python code within the src folder but also from other directories.

Imports can then be of the form.

from projectname.data.Dataloader import Dataloader

Workflow File and Renku

One of the most important ideas behind Renku is the concept of capturing the provenance of the analysis process. For that renku provides workflows. In short a renku workflow is an execution of your code where you explicitly define inputs and ouputs. This way renku can keep track of the dependencies.

A workflow is best defined in a workflow file. An example is given in this template. Some words in the yaml structure are key words. They include "name", "step", "command", "inputs", and "outputs". Make sure not to rename them.

The workflow can be executed with renku run workflow.yml An overview of the dependencies can be displayed with renku workflow visualize -i

More information about workflow files can be found in the workflow.yml template and here https://renku.readthedocs.io/en/latest/topic-guides/workflows/workflow-file.html

Documentation

Documentation is set up for this template with Sphinx. The Sphinx tool translates a set of plain text source files into various output formats. Generating the documentation can be done in two ways:

The Gitlab CI is set up to generate a PDF documentation. It can be downloaded from the CI pipelies artifacts.
It can be created from the console within the renku environment.

To create the documentation within the renku environment use the follwing command:

sphinx-apidoc -f -o docs/source/api src
cd docs/source
sphinx-build -b latex . ./_latex
cd _latex
make
cp documentation.pdf ../../documentation.pdf
cd ../../../
rm -r docs/source/_latex

The documentation.pdf file will then be in the docs folder. If you want to add the docstrings and funcion signatures of your code to the documentation you just need to add the chapter (api/modules) to the 'index.rst' file. The automatic apidoc from sphinx is already set up.

Folder Structure

├───.renku                  <- renku internal folder 
├───data                        
│   ├───external            <- data from third party sources
│   ├───processed           <- the final data sets for modeling
│   └───raw                 <- the original immutable data dump
│ 
├───docs                    <- a default Sphinx project; see sphinx-doc.org for details
│   └───source
│ 
├───models                  <- trained and serialized models
│ 
├───notebooks               <- jupyter notebooks
│ 
├───references              <- data dictonaries, manuals, and other explanatory materials
│ 
├───reports                 <- generated analysis
│   └───figures
│ 
├───setup.py                <- make this project pip installable with `pip install -e`          
│ 
├───src                     <- source code for use in this project
│   └───package_name        <- The name that you give your package
│       ├───data            <- scripts to download and generate data
│       ├───features        <- scripts to turn raw data into features for modeling
│       ├───model           <- scripts to train models and use them for predictions
│       └───visualization   <- scripts to create exploratory and results oriented visualizations
│ 
├───tests                   <- unit tests for pytest
│ 
└───workflow.yml            <- renku workflow file

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
data		data
docs/source		docs/source
models		models
notebooks		notebooks
references		references
reports/figures		reports/figures
src/name		src/name
tests		tests
.gitignore		.gitignore
Dockerfile		Dockerfile
Draft_Renku_Checklist_V01.pptx		Draft_Renku_Checklist_V01.pptx
README.md		README.md
environment.yml		environment.yml
manifest.yaml		manifest.yaml
python.png		python.png
requirements.txt		requirements.txt
setup.py		setup.py
template.yaml		template.yaml
workflow.yml		workflow.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

{{ name }}

Project Setup

Workflow File and Renku

Documentation

Folder Structure

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

{{ name }}

Project Setup

Workflow File and Renku

Documentation

Folder Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages