Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CZI EOSS 5 Notebook v7 maintenance and documentation #140

Closed
wants to merge 8 commits into from
75 changes: 75 additions & 0 deletions grants/czi-eoss5-notebook-v7-loi.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,75 @@
## Proposal Title: Auto-filled; Maximum of 60 characters, including spaces.

Jupyter Notebook maintenance and community support for v4 migration.

## Amount Requested: Total budget amount requested in USD, including indirect costs; this number should be between $100,000 USD and $400,000 USD total costs over a two-year period.

$400,000 USD

## Proposal Summary/Scope of Work: Provide a short summary of the work being proposed (maximum of 500 words).

JupyterLab and the Classic Notebook are the two major user interfaces to manipulate the Jupyter notebook documents. That format is really popular within the scientific community thanks to its ability to bring code, data analysis and visualization in a single document.
fcollonval marked this conversation as resolved.
Show resolved Hide resolved

This proposal follows the major the upcoming major version of the Classical Notebook will be built on top of JupyterLab plugins (that use newer web technologies). The three key ideas approved in the related migration enhancement proposal are: document-centric user experience, extensions are critical to the community and existing educational content relies on the current Notebook experience. The user interface changes try to be limited (although new features will require adaptation). But the extension API will be totally different.
fcollonval marked this conversation as resolved.
Show resolved Hide resolved

As maintenance is mainly done by a couple of volunteers during their free time. The backlog of issues is ever growing due to the lack of manpower. Addressing that, supporting extensions migration and the new user issues for the new version will require an important effort.
fcollonval marked this conversation as resolved.
Show resolved Hide resolved

To mitigate those points, the actions in the grant frame can be grouped into two categories: documentation and code base. Here are the lists of envisaged actions in priority order.

- Documentation
- Generate screencasts automatically.
- Today the screencasts are recorded manually and published on YouTube. This complicates their updates, their translation and their availability for educators.
- To produce screencasts we will create a set of helpers to emulate keyboard and mouse interactions (to be published as a stand-alone package to allow its usage in other projects). Then the video scenarii will need to be defined. And finally an automatic workflow to generate the videos will be set up.
fcollonval marked this conversation as resolved.
Show resolved Hide resolved
- Refactor the existing documentation focusing on
fcollonval marked this conversation as resolved.
Show resolved Hide resolved
- Reduce fragmentation of the documentation spread it multiple websites
fcollonval marked this conversation as resolved.
Show resolved Hide resolved
- Focus on improving the documentation for new contributors and plugin developers starting from scratch or migrating from the old classical notebook extension paradigm.
- Reorganizing the documentation following the four pillars approaches: tutorials, how-to guides, technical reference and explanation.
- Set up an internationalized documentation website to gather the community around a common documentation source reducing educators' effort to produce their own contents.
- The current tools do support internationalization. The goal will be to set up automation scripts for the internationalized documentation website publication and the documentation translation through a third-party service (like the one used for the interface translation).
- Add interface guided tours
- Today an extension to add interface tours exists. The goal will be to integrate it as part of the core plugin, to create meaningful tours and to document how to create new tours without coding.
- Code base
- Support the community extensions authors to migrate their code
- Address issues related to backward inconsistency in the new major classical notebook version
- Reduce the issues backlog focusing on bug fixes and features requested by the biomedical community based on the 2021 survey; at the time of this proposal writing, the document generation features and easier extension creation are the candidates.

## Value to Biomedical Users: Describe the expected value of the proposed work to the biomedical research community (maximum of 250 words).

JupyterLab and Notebook are primary tools used for biomedical science education, research, and manufacturing. Their interface provides a set of tools allowing users to work with code and data in a common document. Jupyter’s growth in schools, universities and laboratories speak to the value of these technologies in science.
Moving to newer technologies for the classical notebook is required to ensure the maintenance cost stays manageable and the tool will keep up with new browser features and requirements.
Keeping improving the tool focusing on research code documentation will improve spreading research results and understanding of the new discoveries.
fcollonval marked this conversation as resolved.
Show resolved Hide resolved

## Open Source Software Projects: Indicate the number of software projects involved in your proposal (up to five).

There are 5 projects involved: JupyterLab, Notebook, Lumino, JupyterLab server and Jupyter server. JupyterLab is the main target. It builds on Lumino, JupyterLab server and Jupyter server. The upcoming major version of Notebook will be based on JupyterLab.

| Software project name | Main code repository | Homepage |
| --- | --- | --- |
| JupyterLab | https://github.com/jupyterlab/jupyterlab | https://jupyterlab.readthedocs.io/ |
| Notebook | https://github.com/jupyter/notebook/ | https://jupyter-notebook.readthedocs.io/ |
| Lumino | https://github.com/jupyterlab/lumino/ | https://lumino.readthedocs.io/ |
| JupyterLab server | https://github.com/jupyterlab/jupyterlab_server | https://jupyterlab-server.readthedocs.io/ |
| Jupyter server | https://github.com/jupyter-server/jupyter_server/ | https://jupyter-server.readthedocs.io/ |


## Landscape Analysis: Briefly describe the other software tools (either proprietary or open source) that the audience for this proposal primarily uses. How do the software project(s) in this proposal compare to these other tools in terms of user base size, usage, and maturity? How do existing tools and the project(s) in this proposal interact? (maximum of 250 words)

Other known desktop tools used by the scientific community to document code and visualized results are RStudio, JetBrains PyCharm/DataSpell, VSCode or nteract. Some tools are also available from online services like Google Colab(oratory) or CoCalc. Out of those three, RStudio is the most used tool in the scientific biomedical community as it supports R and is well established.
Lately VSCode has invested lots of time integrating them within the nowadays most popular open-source Integrated Development Platform. So their potential user base is bigger than Jupyter. But having a broader audience the extensibility of the tool is more constrained than the more dedicated Jupyter tools.
Except nteract and VSCode, the other tools are being developed by companies that provide paid versions. That financial source helps address bug fixes and reduce the backlog compared to the fully open source community-driven Jupyter tools.

## Category: Choose the two categories that best describe the software project(s) audience:

> TBC

- Bioinformatics
- Machine learning and data analysis

## Previous CZI funding:

- Did you previously apply for funding for this or a related proposal under the CZI EOSS program? Select Yes or No.
- No - but there have been other proposal funded by CZI EOSS for Jupyter tools ([JupyterHub Fellow Program - EOSS-0000000142](https://chanzuckerberg.com/eoss/proposals/jupyterhub-contributor-in-residence-program/), [Real-Time collaboration - EOSS2-0000000084](https://chanzuckerberg.com/eoss/proposals/real-time-collaboration-in-jupyter/), [Accessibility - EOSS4](https://chanzuckerberg.com/eoss/proposals/inclusive-and-accessible-scientific-computing-in-the-jupyter-ecosystem/)
)

- Have you previously received funding for this proposal under the CZI EOSS program? Select Yes or No.
- No