You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Based on the techniques detailed in Section 2 of this paper, there are three possible techniques here:
Simple text-based matching of exception types and stack traces (NOT exception messages, which can contain irrelevant details such as IP addresses that are unique to each run)
Failure Log Classifier trained on existing known flaky failures.
Term Frequency-Inverse Document Frequency of tokenised failure logs
Distance metric between embedding vectors (OpenAI API)
We could also calculate embeddings of failure logs and measure similarity that way.