-
Notifications
You must be signed in to change notification settings - Fork 181
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Temporarily use JupyterLite for high-traffic links to mybinder.org #513
Comments
I actually have some days off coming up, so if we have to spike this, I can support it. I think everything's working basically as well as it ever has at this point, even though we're still in alpha, nominally. It's really just a matter of what we want:
If we need some more bespoke labextensions, that's also very doable. At present, there is no google analytics or anything: probably doable. |
@bollwyvl what do you think is the biggest bottleneck that we would need to worry about, if there were a large spike in people clicking a link that served a JupyterLite? |
I will see if we can help with that. |
There aren't any, really. Once deployed, provided the host (whatever it is) has a CDN in front of it, it can be entirely self-sufficient and cached (just |
I would am definitely in favor of this, provided that the JupyterLite team is confident that this wouldn't generate a lot of negative attention their way once a lot more people were clicking those links. For generic and straightforward Jupyter experiences, like the ones that try.jupyter.org is meant to serve, I think JupyterLite is a much better solution than a full-blown image setup like mybinder.org . It would save cloud spend for the Binder team, and I think would provide a nice more lightweight solution to complement repo2docker. |
Yeah, there are a number of known issues that are going to be surprising to people, but yeah, in this case, it's a demo. Some gotchas:
|
Just to put some back-of-the-hand numbers to this, I grabbed the latest 30 days of activity on mybinder.org from @betatim's binderlytics analyzer. For the top 20 repositories (that make up about 75% of Binder traffic). If we split those repositories into ones that are served by try.jupyter.org, and all the rest, we get something like this: About 600,000 are from try.jupyter.org, and about 120,000 are from other repositories. This means that if we were serving try.jupyter.org via jupyterlite, it would reduce mybinder.org's launch load by more than 50%, which would save a lot of resources. That said, from a CDN standpoint, could JupyterLite be realistically served to users |
From a cold start, the jupyterlite launcher page is (gzipped) 3.1mb vs 1.8mb for the jupyter.org home page, all fronted by cloudflare. At that point, it's already got, a lot of features like markdown rendering, but doesn't yet have mathjax or any kernels. By the time the baseline pyodide notebook loads with a pyolite kernel, it's up to 31.5mb, with almost all of that coming from pyodide on jsdelivr... but at that point has already loaded matplotlib, numpy and a lot of other stuff. Meanwhile, the So a "bounced" lite load is ~2x home page loads, "full" interactive lite load is ~10x home page loads, and ~2x widget page loads, which seems like it's not going to break anything too badly. I've also been toying with shifting some of the heavy to p2p tech... we could likely fit the relevant parts of lab, retro, and the demo wheels within the free tier of many of the IPFS pinning services, and then cloudflare would also host that. |
Just read your response and want to note how cool jupyterlite is lol |
Note, of course, of try.jupyter.org's hero badges, we'll only be able to support lab and python directly, and retro as a stand-in for classic. There are a few other kernels ready to go (p5, lua, wren), a few on deck (robotframework) but r, julia, kotlin, scheme, ruby, voila, are not happening by next week. |
I don't think that'd be a huge issue for now. by far the most common repositories are: ipython in depth has no non-Python code, and is the most heavily used |
ProposalSince we are not sure exactly what will happen once we turn on the floodgates of people clicking Jupyter Lite links instead of mybinder.org links, what if we did the following:
This would slightly reduce the load on Binder, and might be a way for us to identify any issues that popped up in Jupyter Lite as a result. All the old Binder links would still be there, just wouldn't be the first two options on the page. Then after a few weeks if all is well, we could swap the other link ( Reasoning: Those |
I made a short PR to JupyterLite to add what I think are some minimal "getting started" notebooks to the JupyterLite documentation, so that we can feel more comfortable sending first-time users there if we want to use try.jupyter.org for this: jupyterlite/jupyterlite#432 If folks are generally OK with this plan, how about we try adding the following link as the second button on try.jupyter.org, and bumping the current JupyterLab button below the fold.
If this works out, we can do a similar thing with the IPython link as well: |
Mentioned over on the PR: while I'll stand behind Also, basically all the cache goes right out the door when you go to different domains. So basically, I'm imagining whatever repo is driving |
Totally agreed that's a better path forward - this was mostly an attempt at mitigating the immediate fire for binder, but we have a little bit breathing room there now anyway. I think we should take whatever path you recommend 👍 |
Yeah, can dig. So is this just a GitHub pages with Jekyll? Can we put python stuff in the build? Or would it make more sense to have a |
Yep just a plain old Jekyll site, though I think it builds with netlify |
Or just create a dedicated |
Cool. Looks like submodules would work.
Right. Either way. Some additional considerations:
So we'd want to figure out just how much we'd want to check in to this future repo, whatever it's called, and how it would be coordinated to update to a new version as links into it need to be updated. There are some pointy entropy edges around upgrades than can require significant cache-refreshing. |
This all sounds great to me. A dedicated try repo makes sense as well if the repo will require some bespoke configuration. The only thing I'd urge is that we stick to a minimal set of content and examples at first, so that we don't make this problem harder than it needs to be 🙂 The first two content links at try.jupyter.org is already too complex IMO, and I think our lives will be easier if we think in terms of "a 2 minute experience that gives people an idea of what jupyter is and where to learn more" rather than an in depth treatment or a kitchen sink. |
Is this because of some libraries/packages? Can we cut something to reduce it? Users in some countries won't be able to afford to visit a website with such requirements. |
These are compiled stand-ins for At present, the baseline Part of not making pyodide special: there would also be the potential for having an even lighter-weight python kernel, as we did on jyve, based on brython or similar, but these have even more quirks than |
I would also recommend setting up a dedicated repo / deployment for the There is also the demo repo showing how to deploy to GitHub Pages, available here for reference: https://github.com/jupyterlite/demo. |
That sounds great to me - I am happy to review work making that happen, or to update our docs to point to a new jupyterlite-backed |
Hopefully the docs will be improved soon: jupyterlite/jupyterlite#393 One way to think about |
FYI I opened an early draft PR for visibility and feedback: #682 |
The first couple of links on https://jupyter.org/try drive a huge amount of the overall traffic on mybinder.org. We're currently dealing with a bit of a funding crisis, so we might try shifting the highest traffic links to JupyterLite demos, instead, to reduce the gap funding we need to cover while we figure this out.
Maybe someday JupyterLite will be in a place where this is the right thing to do in the long term, but probably now wouldn't be the time if we don't need it. This isn't ideal, but the alternative would be for the links to just stop working, which seems worse. Plus, an exciting demo for Jupyter Lite!
The text was updated successfully, but these errors were encountered: