Update 2024 Short Term Updates Oct Nov.md

add Jeaye, Oleksandr
clojurists-together · Dec 9, 2024 · b42d708 · b42d708
1 parent 58729f8
commit b42d708
Showing 1 changed file with 273 additions and 1 deletion.
diff --git a/content/news/2024/2024 Short Term Updates Oct Nov.md b/content/news/2024/2024 Short Term Updates Oct Nov.md
@@ -48,7 +48,7 @@ https://github.com/oakmac/standard-clojure-style-js/issues/1
 
 
 ## clj-nix:José-Luis-Lafuente  
-Q3 2024 Report No. 2, Published Nov. 29, 2024  
+Q3 2024 Report No. 2 (final), Published Nov. 29, 2024  
 
 ### Nix babashka builder  
 
@@ -93,3 +93,275 @@ main branch, making it easier for people to test it out.  <br>
 
 ---
 
+## Clojure Goes Fast: Oleksandr Yakushev  
+Q3 2024 Reports No. 2 and 3 (final), Published Oct. 31 and Nov. 30, 2024   
+
+**Report 2:**  
+During the second month of my Clojurists Together project I published a major update of clj-async-profiler. The new release contains a redesigned and improved profile list page with better UI, convenient sorting, and easy access to diffing functionality. The rest of the time I've spent working on the flamegraph sharing webapp that I plan to release into a public betatest at the beginning of the third month. As a bullet list:  
+
+- Published the release of clj-async-profiler 1.4.0. Big changes:  
+  - New index page design.  
+  - Diffgraph generation functionality accessible from UI.  
+  - Flamegraph rendering fixes - prevent horizontal scrollbar from appearing and the tooltip from clipping near screen borders.  
+  - Stability fix for JDK23.  
+- Wrote a blogpost about 1.4.0 release [highlighting the main changes:](https://clojure-goes-fast.com/blog/clj-async-profiler-140/).   
+- Set up infrastructure, deployment, and proof-of-concept implementation of the flamegraph sharing site. Got very close to the initial public release.  
+
+**Report 3:**  
+I've spent the third month of my Clojureists Together project working primarily on Flamebin — a new web application that allows uploading and sharing flamegraphs. At the beginning of the month I launched it as public beta, and by the end of the November I finally published it as a first stable release. A more detailed report below:  
+- Made a public release of [Flamebin](https://flamebin.dev)  
+  - Set up a [public repository](https://github.com/clojure-goes-fast/flamebin)  
+  - Created and configured a VPS instance to host the web application  
+  - Configured CI for testing, buildign the application and deploying it to two environemnts (QA and prod)  
+  - Flamebin allows uploading profiles to later render them as flamegraphs:  
+    - Supports two formats of uploads — collapsed stackframes (compatible with perf, async-profiler, and other profiling tools) and dense profiles.  
+    - Uploading is possible using a web page or through clj-async-profiler integration.  
+    - An uploaded profile can either be publicly visible or private. Private uploads are encrypted by a secret read-token that is required for viewing the profile and is not stored on the server.  
+    - Accidentally uploaded profiles can be deleted by providing a secret edit-token.  
+  - Uploaded flamegraphs have a stable URL that can be viewed by anyone (for public flamegraphs) or only by those with a read-token (for private flamegraphs). The flamegraph renderer is reused from clj-async-profiler. User can explore the flamegraph interactively, change the viewing preferences, and also save those preferences so that other users can open the pre-configured flamegraph later.  
+
+I also made two releases of clj-async-profiler — 1.5.0 and 1.5.1 follow-up. These releases unified the approach to flamegraph configuration, enabling different ways to change the configuration:  
+- Specify the configuration before generate a flamegraph, using code.  
+- Save the current configuration (set in UI) to the server for the subsequently generated flamegraphs to inherit.  
+- Copy and paste configuration between different flamegraphs in the UI.  
+- Provide configuration in the URL parameter.  
+All of the configuration improvements were ported to Flamebin as well.  
+
+Finally, I've been updating the [Clojure Goes Fast knowledge base](https://clojure-goes-fast.com/kb/profiling/clj-async-profiler/) to cover the new clj-async-profiler and Flamebin features.  <br>  
+
+---
+
+## Jank: Jeaye Wilkerson  
+Q3 2024 Report No. 2, Published Nov. 30, 2024   
+Hi everyone! It's been a very busy couple of months as I've been developing
+jank's LLVM IR generation, improving jank's semantic analysis, and furthering
+jank's module loading system. Thank you to all of my Github sponsors and to
+Clojurists Together, who help me pay the bills. As of January 2025, I'll be
+working on jank full-time and every new sponsor means that much more.
+Without further ado, let's dive into the details of the past couple of months.
+
+### LLVM IR
+The main focus of the past couple of months has been filling out jank's LLVM IR
+generation. This has required further improving some of its semantic analysis
+since I was previously able to cut some corners when generating C++ code.  
+
+At this point, all AST nodes in jank have working and tested IR generation
+except for `try`, since doing so requires hooking into the C++ runtime's unwind
+mechanism and I've been saving that rabbit hole for last.  
+
+IR generation has caused so many fun bugs the past couple of months that I had
+to look into a better way of organizing my notes for this quarter. There was
+just too much. When developing a language, especially in a pre-alpha stage like
+jank, when something crashes or is otherwise completely wrong, the issue could
+be anywhere from lexing to parsing to semantic analysis to code generation to
+object modeling to core function algorithms to data structures. I think there
+were some of each discovered in the past couple of months, but the majority of
+them were in the new IR generation.  
+
+### Named recursion  
+One of the fun bugs I ran into was with how Clojure handles recursion through
+the function's own name. This doesn't go through the var; it just references the
+function object directly. The same applies with just a self-reference. For
+example:  
+
+```clojure
+(fn foo []
+  foo)
+
+(defn foo []
+  foo)
+```
+
+In Clojure, and in jank with C++ generation, each function is an object (struct
+or class). A self-reference can literally just mean `this`. But with the LLVM IR
+generation, each function, and more specifically each function arity, is
+compiled to a dedicated C function. Aside from closures, which get an implicit
+first argument which is a generated struct of the captured values, the function
+itself has no other identifying state or object. This means a self-reference
+actually needs to build a new object. Previously, I would always just generate a
+local into each function like so:  
+
+```cpp
+auto const foo{ this };
+```
+
+Then I would register that local automatically during semantic analysis for
+functions. A self-reference would then just be a `local_reference` AST node.
+This can't carry over well to IR generation, so I've added two new AST nodes:  
+
+1. `recursion_reference`, which is created when we analyze that we're referring
+   to the current function by name somewhere  
+2. `named_recursion`, which is created when we're analyzing a `call` and find
+   that the source expression is a `recursion_reference`  
+
+This does mean that a self-reference within a function wouldn't be `identical?`
+to the invoking object, but that's something we can document and otherwise
+really not care about.  
+
+### Startup performance  
+Ultimately, the main reason for generating LLVM IR is that C++ is too slow to
+compile. If compiling `clojure.core` means generating 100K lines of C++ to
+compile, we end up waiting far too long for jank to start up. On my machine, it
+took around **12 seconds**.  
+
+Now, with IR generation, and a lot more functionality baked into jank itself,
+rather than JIT compiled, compiling `clojure.core` from source takes only **2 seconds**.
+This is fast enough to where we can easily include it as part of jank's
+build system. We can then pre-compile the sources to binaries and load those
+instead.  
+
+When I tried this with C++ generation, it took **4 minutes** to compile all of
+the C++ generated for `clojure.core` into a C++20 module with a backing shared
+library. It then took **300ms** to load at runtime, which dropped the start time
+from **12 seconds** to **300ms**. That AOT compilation cost was huge, but the gain was
+also big.  
+
+With IR generation, we can also generate object files. Amazingly, it can be done
+within the same **2 seconds** used to compile `clojure.core` in the first place.
+When loading that object file at runtime, jank can now start up in **150ms**. So, we
+spend a fraction of the time actually compiling the code and even less time
+loading it. Overall, for startup performance, LLVM IR has been a huge win. This
+is exactly what I wanted and I'm very pleased with the results.  
+
+<figure>
+  <object type="image/svg+xml" data="https://jank-lang.org/img/blog/2024-11-29-llvm-ir/startup-time.plot.svg" width="100%">
+    <img src="https://jank-lang.org/img/blog/2024-11-29-llvm-ir/startup-time.plot.svg" width="100%"></img>
+  </object>
+</figure>
+
+Note, when all of this is baked into the executable AOT, startup time is around
+**50ms**. jank doesn't support AOT compilation of full programs yet, but I've
+manually added the object files to jank's CMake build in order to test this out.
+Once we do have AOT compilation to binaries, we can also add direct linking,
+link-time optimizations (LTO), etc. and drop these numbers down even further.  
+
+### Runtime performance
+Runtime performance will be negatively impacted by IR generation, at least to
+start. The C++ code jank used to generate was quite optimized. I was taking
+advantage of various C++ features, like function overloading, type inference,
+and easy (yet ambiguous) unboxing of numerical values. With IR gen, we need to
+do all of those manually, rather than rely on a C++ compiler to help. This will
+take more work, but it also allows us to tailor the optimizations to best fit jank.  
+
+I'm not ready to report any benchmark results for runtime performance
+differences yet, since I don't think measuring the initial IR generation against
+the previous C++ generation is a good usage of time. Optimizing IR can happen as
+we go, without breaking any ABI compatibility. I'm more focused on getting
+jank released right now.  
+
+## Build system and portability improvements
+Apart from working on LLVM IR generation the past couple of months, I've put a
+fair amount of time into improving jank's builds system and dependency
+management. In particular, vcpkg has been removed entirely. I was using vcpkg to
+bring in some C and C++ source dependencies, but some of them regularly fail to
+build from source on very normal setups and vcpkg on its own causes issues with
+build systems such as Nix. Altogether, it's well known that the build system and
+dependency tooling for C and C++ is terrible. While I aim for jank to improve
+that, in its own way, we still need to suffer through it for the compiler
+itself.  
+
+In order to remove vcpkg, I had to address all of the dependencies it was
+pulling in.  
+
+* bdwgc (Boehm GC) requires compilation
+  * Added a submodule and hooked into CMake
+  * [Required a PR for CMake compatibility](https://github.com/ivmai/bdwgc/pull/675)
+* fmt requires compilation
+  * Added a submodule and hooked into CMake
+* libzipp requires compilation
+  * Added a submodule and hooked into CMake
+* immer is header-only
+  * Added a submodule
+* magic_enum is header-only
+  * Added a submodule
+* cli11 is header-only
+  * Added a submodule
+* doctest is in all major package repos
+* boost is in all major package repos
+  * `boost::preprocessor` isn't found by CMake on Ubuntu, but it's there
+    * Doesn't exist in brew's package, though
+    * Had to add as a submodule
+  * Causes Clang to crash while building `incremental.pch`
+
+All in all, this required **seven new git submodules**. Even something as
+commonplace as boost led to dependency issues across various platforms. I did
+all of my testing on Ubuntu, Arch, NixOS, and macOS. Once I had all of those
+submodules, I still ran into some issues with Clang hard crashing while trying
+to compile an incremental pre-compiled header (PCH) with boost. This was the
+final straw with PCHs for me.  
+
+### Pre-compiled headers  
+jank started out with PCHs from the beginning. I was concerned about
+compile-times and I knew that many source files would need access to the whole
+object model in order to compile. While this remains true, I didn't expect
+that PCHs would be such a headache when JIT compiling C++. There have been a
+handful of Cling bugs and then Clang bugs related to loading PCHs into the
+incremental C++ environment. I've spent entire days compiling Clang/LLVM while
+bisecting in order to find root causes.
+[Haruki](https://github.com/jank-lang/jank/pull/94) has been so close to building
+jank on Nix but has been running into issues when compiling the incremental PCH.
+This has been a long time coming.  
+
+After removing the PCHs, fixing all source files to include what they need,
+refactoring some heavy headers so they can be used less often, and running some
+tooling to further clean things up, we can wipe our hands of all of that. jank
+does still need to JIT compile C++ code, but it can do so using both a C API and
+a C++ API. Devs using jank can include what they need.  
+
+Previously, just loading the incremental PCH with all of jank's headers took a
+whopping 2.5 seconds every start up and we always paid that cost. Now, better
+following the (intended) nature of C++, we can pay for what we use.  
+
+### Community update  
+I have not been the only one working on jank. The past couple of months, my
+newest mentee through the [SciCloj mentorship program](https://scicloj.github.io/docs/community/groups/open-source-mentoring/),
+[Monty Bichouna](https://github.com/stmonty), has wrapped up Unicode lexing support. This
+allows jank the important ability to properly represent Unicode symbols and
+keywords. Monty's work builds on Saket's recent work to add Unicode support for
+character objects.  
+
+To further jank's interop story, [Saket Patel](https://github.com/Samy-33) also
+recently merged some changes which allow jank to accept include paths, linker
+paths, and linked shared libraries. This means that you can now include other C
+and C++ libraries from your JIT compiled bridge code and have the JIT linker
+resolve those symbols in your libraries. In other words,
+**it's now possible to use jank to wrap arbitrary C and C++ libraries**.
+Saket took this further by adding an opaque pointer wrapper object which can
+store any non-owned native pointer to be passed through any jank function and
+stored in any jank data structure. Each of these is a small step toward a much
+richer interop story. Saket also added persistent history to jank's CLI REPL and
+improved its usability by hooking into LLVM's line editing capabilities.  
+
+[Jianling Zhong](https://github.com/jianlingzhong) added support for ratio objects in
+jank, including the whole polymorphic math treatment necessary for them. He also
+implemented jank's delay object and corresponding `clojure.core/delay` macro as
+well as jank's repeat object, which backs the `clojure.core/repeat` function.  
+
+Finally, [Paula Gearon](https://github.com/quoll) has been making excellent
+progress on a sister project to jank,
+[clojure.core-test](https://github.com/jank-lang/clojure.core-test), which is a
+cross-dialect test suite for all of `clojure.core`. Ultimately, my goal with
+this is to aid Clojure dialect developers by providing a thorough test suite for
+Clojure's core functions. By being able to run and pass this suite, my
+confidence in jank will be strong. I'm sure other dialect developers will feel
+similarly.  
+
+### What's next?  
+I need to fix a couple remaining bugs with the LLVM IR generation and then
+implement IR generation for `try` nodes, in the next couple of weeks. After
+that, the next big goal is error reporting. This is an exciting task to tackle,
+since the impact of it is going to feel so rewarding. I have been suffering
+jank's terrible error messages for years. Even worse, we as an industry have
+been suffering terrible error messages for decades. There's been some exciting
+progress in [Elm](https://elm-lang.org/news/compiler-errors-for-humans),
+[Rust](https://blog.rust-lang.org/2016/08/10/Shape-of-errors-to-come.html), etc
+for improving the way errors are reported, providing
+actionable feedback, and including sufficient context to make errors less
+cryptic. I don't think that Clojure does well in this area, currently, and I aim
+to raise the bar.  
+
+If that sounds interesting, stay tuned for my next update!  <br>
+
+---
+
+