Skip to content

Commit

Permalink
Add proposal for archiving Canary (#134)
Browse files Browse the repository at this point in the history
* Add proposal for archiving Canary

Signed-off-by: Lukas Kral <lukywill16@gmail.com>

* Jakub's comment

Signed-off-by: Lukas Kral <lukywill16@gmail.com>

* Apply suggestions from Paul

Co-authored-by: PaulRMellor <47596553+PaulRMellor@users.noreply.github.com>
Signed-off-by: Lukáš Král <53821852+im-konge@users.noreply.github.com>

* Jakub's comments

Signed-off-by: Lukas Kral <lukywill16@gmail.com>

* add canary proposal to README

Signed-off-by: Lukas Kral <lukywill16@gmail.com>

---------

Signed-off-by: Lukas Kral <lukywill16@gmail.com>
Signed-off-by: Lukáš Král <53821852+im-konge@users.noreply.github.com>
Co-authored-by: PaulRMellor <47596553+PaulRMellor@users.noreply.github.com>
  • Loading branch information
im-konge and PaulRMellor authored Oct 18, 2024
1 parent 0ebbd70 commit 458049c
Show file tree
Hide file tree
Showing 2 changed files with 180 additions and 87 deletions.
92 changes: 92 additions & 0 deletions 086-archive-canary.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,92 @@
# Archive Canary project

This proposal is about archiving the [Canary project](https://github.com/strimzi/strimzi-canary).

## Current situation

The Canary project, written in Go programming language, is a component for monitoring a Kafka cluster.
It provides functionality to periodically check the availability of the Kafka cluster during normal run, upgrades, downgrades, and rolling updates.
That is done by connection check to the cluster, but also producer and consumer, that periodically do the message transmission between Kafka cluster and Canary.
The Canary then provides Prometheus metrics and alerts for users or cluster admins to react on the Kafka cluster issues.

## Motivation

The Canary, as mentioned, is written in Go, however the Strimzi organization (and the engineers working on the projects inside this organization) is focused on using the Java programming language.
This means the team is missing the necessary expertise in Go in order to solve various issues in Canary.
Based on the previous proposal from PR [#58](https://github.com/strimzi/proposals/pull/58), the Sarama Kafka client library lacks features that are available in the official
Java Kafka client library - and the Sarama library is not the official Kafka library in comparison to the Java one.
Additionally, there are multiple [issues created in the Canary project](https://github.com/strimzi/strimzi-canary/issues) that have remained with comment or unresolved for a significant amount of time.
The project's dependencies haven't been updated for two years now, meaning that there can be a lot of CVEs, unresolved issues with newer Kafka versions, and so on.
The inactivity of the project shows that we do not have the time to continue developing Canary, which is also the main reason for creating this proposal.

Because of these issues, I'm proposing to archive the Canary project.

## Canary in Java

After few releases of Canary we realized that there is not enough expertise for the Go language and Sarama Kafka client doesn't provide functionality we need,
so the [PR proposing to move Canary to Java](https://github.com/strimzi/proposals/pull/58) was created.
The proposal contained all issues with the current implementation together with proposed implementation and changes for the Canary in Java.
In parallel with the proposal, the POC was written in Java and is available in the [im-konge/canary-java](https://github.com/im-konge/canary-java) repository.
However, during the implementation process, we found out that few things are not possible using Java Kafka clients (for example the connection check that was one of the main features of Canary) and that
metrics related to Sarama client are not relevant anymore.
Changes like this would break the backwards compatibility, meaning that it would not be 1:1 copy of the Canary in Go.

Other than that, we thought about changing the metrics provided by Canary to be more insightful, but we were unable to reach a consensus on the specific metrics to include.
We had discussions with community users about how the Canary can be more useful to them, but the users ended up writing their own Canary-like tool
with functionality useful to them.

Because we didn't move forward with the proposal and agreement on the future of the Canary, after more than one year from the moment the proposal was created, we decided to close the
proposal, and we agreed to archive the project on [community meeting on May 30th 2024](https://youtu.be/UpStul__uCw?si=GTA5edXJEnGgxP1a).
If the community finds Canary valuable, provides feedback on how to improve its metrics and functionality, and we have sufficient capacity and resources, we will consider proposing a new project to rewrite Canary in Java.

## Proposal

After this proposal is approved, we should:
- archive the Canary project
- remove the Canary install files from Strimzi operators repository
- now from the `/packaging/install` folder
- as part of the next Strimzi release, the installation files will be removed from the `/install` folder
- delete the mentions about the Canary project from the documentation and automation files (Makefiles)
- inform users about archiving the project

If the community:

- finds Canary useful
- provides additional information about how to improve its metrics or functionality
- and we have the capacity to maintain the project

we can consider proposing a new project to rewrite Canary in Java.

## Affected/not affected projects

The only affected project is the [Canary](https://github.com/strimzi/strimzi-canary) that should be archived.
In the Strimzi operators repository, the only affected parts are the [installation files](https://github.com/strimzi/strimzi-kafka-operator/tree/main/install/canary) and
[in development installation files](https://github.com/strimzi/strimzi-kafka-operator/tree/main/packaging/install/canary) that should be deleted by this proposal.

## Compatibility

The backwards compatibility is not relevant in this case, as the project will be archived and there is no other solution currently that would replace it.

## Rejected alternatives

### Maintaining Canary in Go and providing additional functionality

One of the rejected alternative was to keep updating the Canary in Go and add more functionality to it.
This was discussed and rejected because of:
- lack of Go experts in the Strimzi organization
- resources - it would take a lot of time to learn Go and properly testing every new change without knowing how it will work
- missing functionality in the Sarama Kafka client

### Maintaining Canary for dependency updates

Another alternative was to keep Canary project and updating the dependencies.
Some of the dependency updates can be without breaking changes, but from time to time there are changes in the dependencies that requires additional
changes to the code, which brings us to the same situation as the previous alternative - someone would need to do the changes to the code,
test it properly and then release it.
Because of these issues, we decided to reject this alternative.

### Rewrite Canary in Java

A final alternative was to rewrite Canary in Java, but as was mentioned in the [Canary in Java](#canary-in-java), it would not be the same Canary as in Go,
and we didn't agree on how the new metrics of the Canary in Java (and the overall implementation) should look like.
Because of this, we decided to reject this alternative.
Loading

0 comments on commit 458049c

Please sign in to comment.