Skip to content

Failure Similarity: The stack trace or error message resembles patterns of past flaky failures. #18

@jmafoster1

Description

@jmafoster1

Based on the techniques detailed in Section 2 of this paper, there are three possible techniques here:

  • Simple text-based matching of exception types and stack traces (NOT exception messages, which can contain irrelevant details such as IP addresses that are unique to each run)
  • Failure Log Classifier trained on existing known flaky failures.
  • Term Frequency-Inverse Document Frequency of tokenised failure logs
  • Distance metric between embedding vectors (OpenAI API)

We could also calculate embeddings of failure logs and measure similarity that way.

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions