-
Notifications
You must be signed in to change notification settings - Fork 361
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Production - [Alerting] Android emulator failure rate alert #11605
Comments
💔 Metric state changed to alerting
|
3 similar comments
💔 Metric state changed to alerting
|
💔 Metric state changed to alerting
|
💔 Metric state changed to alerting
|
These are the sources of the problematic work items
The run: https://dev.azure.com/dnceng-public/public/_build/results?buildId=81161&view=results The query: Metrics
| where EventType == "_MobileDeviceOperation" and MetricName == "ExitCode" and Timestamp > now()-3d
| extend Dimensions = parse_json(Dimensions)
| extend
ExitCode = MetricValue,
Command = tostring(Dimensions.command),
Platform = tostring(Dimensions.platform),
Target = tostring(Dimensions.target)
| where Platform == "android" and ExitCode == 80 and Target != ""
| join kind=inner Jobs on JobId
| extend Props = parse_json(Properties)
| extend
Stage=tostring(Props['System.PhaseName']),
Pipeline=tostring(Props.DefinitionName)
| summarize count() by Source, JobName, Pipeline, Stage
| order by count_ desc |
Doesn't seem like an infra issue. The exit code is
|
@akoeplinger I might have found something possibly interesting to look into. There was a spike on Friday->Saturday night, Redmond evening, when the This alert never fires and there are no infra failures by the 3rd attempt for Android emulators (like ever). Literally, this line graph is not a line graph as there are no data points except for Saturday: All work items end with |
I checked a few of these failures and they all crash with this in the adb log:
|
This is almost certainly because of dotnet/runtime#78018 |
@premun I've opened a PR to revert the problematic change, thanks for catching it! |
Thanks for investigating this @premun and taking action @akoeplinger. Also, happy to see our mobile telemetry helping to identify a product issue. |
💔 Metric state changed to alerting
Go to rule
@dotnet/dnceng, please investigate
Automation information below, do not change
Grafana-Automated-Alert-Id-e38f14fe3367451d8de43da6e2453fdd
The text was updated successfully, but these errors were encountered: