November 27, 2025ΒΆ
Generated: 2025-11-27 04:29 UTC
Total Duration: 10h 30m 9s
Iterations: 1
Judge (classifier) model: gpt-4.1
About this BenchmarkΒΆ
HolmesGPT is continuously evaluated against real-world Kubernetes and cloud troubleshooting scenarios.
If you find scenarios that HolmesGPT does not perform well on, please consider adding them as evals to the benchmark.
Model Accuracy ComparisonΒΆ
| Model | Pass | Fail | Skip/Error | Total | Success Rate |
|---|---|---|---|---|---|
| deepseek-3.1 | 62 | 32 | 22 | 116 | π‘ 66% (62/94) |
| gpt-5 | 47 | 49 | 20 | 116 | π‘ 49% (47/96) |
| gpt-5.1 | 64 | 31 | 21 | 116 | π‘ 67% (64/95) |
| haiku-4.5 | 66 | 29 | 21 | 116 | π‘ 69% (66/95) |
| sonnet-4.5 | 77 | 17 | 22 | 116 | π‘ 82% (77/94) |
Model Cost ComparisonΒΆ
| Model | Tests | Avg Cost | Min Cost | Max Cost | Total Cost |
|---|---|---|---|---|---|
| gpt-5 | 87 | $0.05 | $0.01 | $0.18 | $3.97 |
| gpt-5.1 | 86 | $0.10 | $0.01 | $0.31 | $8.61 |
| haiku-4.5 | 88 | $0.05 | $0.02 | $0.13 | $4.24 |
| sonnet-4.5 | 87 | $0.15 | $0.05 | $0.62 | $13.22 |
Model Latency ComparisonΒΆ
| Model | Avg (s) | Min (s) | Max (s) | P50 (s) | P95 (s) |
|---|---|---|---|---|---|
| deepseek-3.1 | 56.7 | 6.6 | 183.1 | 46.6 | 140.8 |
| gpt-5 | 28.4 | 3.8 | 613.5 | 17.5 | 46.8 |
| gpt-5.1 | 127.7 | 7.2 | 868.9 | 100.2 | 301.1 |
| haiku-4.5 | 31.3 | 0.0 | 207.0 | 26.2 | 77.4 |
| sonnet-4.5 | 41.9 | 0.0 | 211.9 | 38.2 | 80.0 |
Performance by TagΒΆ
Success rate by test category and model:
| Tag | deepseek-3.1 | gpt-5 | gpt-5.1 | haiku-4.5 | sonnet-4.5 | Warnings |
|---|---|---|---|---|---|---|
| chain-of-causation | π΄ 0% (0/1) | π΄ 0% (0/1) | π΄ 0% (0/1) | π΄ 0% (0/1) | π΄ 0% (0/1) | β οΈ 45 skipped |
| compaction | π‘ 71% (5/7) | π’ 100% (7/7) | π΄ 0% (0/7) | π‘ 29% (2/7) | π‘ 43% (3/7) | |
| context_window | π‘ 50% (3/6) | π‘ 50% (3/6) | π‘ 50% (3/6) | π‘ 50% (3/6) | π‘ 67% (4/6) | β οΈ 5 skipped |
| counting | π‘ 50% (2/4) | π’ 100% (4/4) | π’ 100% (4/4) | π‘ 50% (2/4) | π’ 100% (4/4) | |
| database | π’ 100% (1/1) | π΄ 0% (0/1) | π’ 100% (1/1) | π’ 100% (1/1) | π’ 100% (1/1) | β οΈ 15 skipped |
| datadog | π‘ 50% (2/4) | π‘ 75% (ΒΎ) | π‘ 75% (ΒΎ) | π‘ 75% (ΒΎ) | π‘ 75% (ΒΎ) | |
| datetime | π‘ 75% (ΒΎ) | π‘ 75% (ΒΎ) | π‘ 75% (ΒΎ) | π‘ 75% (ΒΎ) | π’ 100% (4/4) | β οΈ 10 skipped |
| easy | π‘ 78% (32/41) | π‘ 56% (23/41) | π‘ 68% (28/41) | π‘ 76% (31/41) | π‘ 90% (37/41) | β οΈ 5 skipped |
| hard | π‘ 40% (4/10) | π‘ 20% (2/10) | π‘ 40% (4/10) | π‘ 40% (4/10) | π‘ 70% (7/10) | β οΈ 65 skipped |
| kafka | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | β οΈ 10 skipped |
| kubernetes | π‘ 65% (28/43) | π‘ 40% (17/43) | π‘ 67% (29/43) | π‘ 72% (31/43) | π‘ 81% (35/43) | β οΈ 50 skipped |
| logs | π‘ 58% (15/26) | π‘ 50% (13/26) | π‘ 77% (20/26) | π‘ 69% (18/26) | π‘ 77% (20/26) | β οΈ 35 skipped |
| medium | π‘ 60% (26/43) | π‘ 49% (22/45) | π‘ 73% (32/44) | π‘ 70% (31/44) | π‘ 77% (33/43) | β οΈ 36 skipped |
| network | π‘ 50% (2/4) | π‘ 25% (ΒΌ) | π‘ 50% (2/4) | π‘ 75% (ΒΎ) | π‘ 75% (ΒΎ) | |
| no-cicd | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | β οΈ 5 skipped |
| numerical | π’ 100% (1/1) | π΄ 0% (0/1) | π’ 100% (1/1) | π’ 100% (1/1) | π’ 100% (1/1) | |
| one-test | π’ 100% (1/1) | π΄ 0% (0/1) | π’ 100% (1/1) | π’ 100% (1/1) | π’ 100% (1/1) | |
| port-forward | π΄ 0% (0/5) | π΄ 0% (0/5) | π‘ 20% (β ) | π΄ 0% (0/5) | π‘ 20% (β ) | β οΈ 35 skipped |
| prometheus | π΄ 0% (0/2) | π΄ 0% (0/2) | π΄ 0% (0/2) | π΄ 0% (0/2) | π΄ 0% (0/2) | β οΈ 25 skipped |
| question-answer | π‘ 75% (ΒΎ) | π‘ 50% (2/4) | π’ 100% (4/4) | π’ 100% (4/4) | π’ 100% (4/4) | |
| runbooks | π‘ 67% (4/6) | π‘ 50% (3/6) | π‘ 67% (4/6) | π’ 100% (6/6) | π’ 100% (6/6) | β οΈ 5 skipped |
| slackbot | βͺοΈ - | π΄ 0% (0/1) | βͺοΈ - | βͺοΈ - | βͺοΈ - | β οΈ 4 skipped |
| traces | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | β οΈ 25 skipped |
| transparency | π‘ 79% (11/14) | π‘ 71% (10/14) | π‘ 86% (12/14) | π‘ 79% (11/14) | π‘ 93% (13/14) | β οΈ 5 skipped |
| Overall | π‘ 66% (62/94) | π‘ 49% (47/96) | π‘ 67% (64/95) | π‘ 69% (66/95) | π‘ 82% (77/94) | β οΈ 106 skipped |
Raw ResultsΒΆ
Status of all evaluations across models. Color coding:
- π’ Passing 100% (stable)
- π‘ Passing 1-99%
- π΄ Passing 0% (failing)
- π§ Mock data failure (missing or invalid test data)
- β οΈ Setup failure (environment/infrastructure issue)
- β±οΈ Timeout or rate limit error
- βοΈ Test skipped (e.g., known issue or precondition not met)
Detailed Raw ResultsΒΆ
| Eval ID | deepseek-3.1 | gpt-5 | gpt-5.1 | haiku-4.5 | sonnet-4.5 |
|---|---|---|---|---|---|
| 001_compaction π | π’ 100% (1/1) / β±οΈ 94.1s | π’ 100% (1/1) / β±οΈ 72.4s | π΄ 0% (0/1) / β±οΈ 181.2s | π’ 100% (1/1) / β±οΈ 55.1s | π’ 100% (1/1) / β±οΈ 78.4s |
| 002_buried_exception π | π’ 100% (1/1) / β±οΈ 30.3s | π’ 100% (1/1) / β±οΈ 28.3s | π΄ 0% (0/1) / β±οΈ 156.4s | π’ 100% (1/1) / β±οΈ 29.6s | π’ 100% (1/1) / β±οΈ 47.3s |
| 003_cascading_failure π | π΄ 0% (0/1) / β±οΈ 53.1s | π’ 100% (1/1) / β±οΈ 37.2s | π΄ 0% (0/1) / β±οΈ 172.2s | π΄ 0% (0/1) / β±οΈ 0.0s | π΄ 0% (0/1) / β±οΈ 0.1s |
| 004_multiple_root_causes π | π΄ 0% (0/1) / β±οΈ 23.1s | π’ 100% (1/1) / β±οΈ 232.3s | π΄ 0% (0/1) / β±οΈ 234.9s | π΄ 0% (0/1) / β±οΈ 0.1s | π΄ 0% (0/1) / β±οΈ 0.0s |
| 005_configuration_change π | π’ 100% (1/1) / β±οΈ 34.4s | π’ 100% (1/1) / β±οΈ 27.4s | π΄ 0% (0/1) / β±οΈ 159.1s | π΄ 0% (0/1) / β±οΈ 0.0s | π΄ 0% (0/1) / β±οΈ 0.1s |
| 007_negative_findings π | π’ 100% (1/1) / β±οΈ 20.9s | π’ 100% (1/1) / β±οΈ 31.0s | π΄ 0% (0/1) / β±οΈ 193.5s | π΄ 0% (0/1) / β±οΈ 0.0s | π΄ 0% (0/1) / β±οΈ 0.1s |
| 008_very_long_conversation π | π’ 100% (1/1) / β±οΈ 33.1s | π’ 100% (1/1) / β±οΈ 32.6s | π΄ 0% (0/1) / β±οΈ 163.6s | π΄ 0% (0/1) / β±οΈ 55.2s | π’ 100% (1/1) / β±οΈ 75.9s |
| 01_how_many_pods π | π΄ 0% (0/1) / β±οΈ 16.6s | π’ 100% (1/1) / β±οΈ 15.8s / π° $0.03 | π’ 100% (1/1) / β±οΈ 27.3s / π° $0.03 | π’ 100% (1/1) / β±οΈ 15.5s / π° $0.03 | π’ 100% (1/1) / β±οΈ 17.3s / π° $0.08 |
| 02_what_is_wrong_with_pod π | π’ 100% (1/1) / β±οΈ 50.6s | π’ 100% (1/1) / β±οΈ 28.1s / π° $0.08 | π’ 100% (1/1) / β±οΈ 88.4s / π° $0.08 | π’ 100% (1/1) / β±οΈ 21.9s / π° $0.04 | π’ 100% (1/1) / β±οΈ 32.5s / π° $0.13 |
| 03_what_is_the_command_to_port_forward π | π’ 100% (1/1) / β±οΈ 19.1s | π΄ 0% (0/1) / β±οΈ 5.5s / π° $0.01 | π’ 100% (1/1) / β±οΈ 33.3s / π° $0.03 | π’ 100% (1/1) / β±οΈ 13.0s / π° $0.02 | π’ 100% (1/1) / β±οΈ 24.7s / π° $0.07 |
| 04_related_k8s_events π | π’ 100% (1/1) / β±οΈ 39.0s | π΄ 0% (0/1) / β±οΈ 5.0s | π’ 100% (1/1) / β±οΈ 45.0s / π° $0.05 | π’ 100% (1/1) / β±οΈ 19.1s / π° $0.04 | π’ 100% (1/1) / β±οΈ 25.0s / π° $0.10 |
| 05_image_version π | π’ 100% (1/1) / β±οΈ 33.1s | π’ 100% (1/1) / β±οΈ 24.0s / π° $0.05 | π’ 100% (1/1) / β±οΈ 32.9s / π° $0.05 | π’ 100% (1/1) / β±οΈ 20.3s / π° $0.03 | π’ 100% (1/1) / β±οΈ 25.9s / π° $0.10 |
| 08_sock_shop_frontend π | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - |
| 09_crashpod π | π’ 100% (1/1) / β±οΈ 107.4s | π΄ 0% (0/1) / β±οΈ 4.1s / π° $0.01 | π’ 100% (1/1) / β±οΈ 94.8s / π° $0.09 | π’ 100% (1/1) / β±οΈ 44.2s / π° $0.07 | π’ 100% (1/1) / β±οΈ 43.9s / π° $0.17 |
| 100a_historical_logs π | π΄ 0% (0/1) / β±οΈ 53.2s | π΄ 0% (0/1) / β±οΈ 14.9s / π° $0.02 | π’ 100% (1/1) / β±οΈ 170.5s / π° $0.12 | π΄ 0% (0/1) / β±οΈ 207.0s / π° $0.13 | π’ 100% (1/1) / β±οΈ 54.2s / π° $0.15 |
| 100b_historical_logs_nonstandard_label π | π΄ 0% (0/1) / β±οΈ 46.6s | π΄ 0% (0/1) / β±οΈ 17.5s / π° $0.03 | π΄ 0% (0/1) / β±οΈ 301.1s / π° $0.21 | π΄ 0% (0/1) / β±οΈ 79.4s / π° $0.10 | π΄ 0% (0/1) / β±οΈ 52.8s / π° $0.20 |
| 101_historical_logs_pod_deleted π | π΄ 0% (0/1) / β±οΈ 70.4s | π΄ 0% (0/1) / β±οΈ 9.7s / π° $0.01 | π΄ 0% (0/1) / β±οΈ 149.5s / π° $0.12 | π΄ 0% (0/1) / β±οΈ 42.9s / π° $0.04 | π΄ 0% (0/1) / β±οΈ 57.8s / π° $0.12 |
| 103_logs_transparency_default_limit π | π΄ 0% (0/1) / β±οΈ 25.0s | π’ 100% (1/1) / β±οΈ 28.0s / π° $0.07 | π’ 100% (1/1) / β±οΈ 100.2s / π° $0.11 | π’ 100% (1/1) / β±οΈ 36.4s / π° $0.05 | π’ 100% (1/1) / β±οΈ 37.1s / π° $0.12 |
| 104a_postgres_root_issue π | π’ 100% (1/1) / β±οΈ 95.6s | π΄ 0% (0/1) / β±οΈ 41.2s / π° $0.11 | π’ 100% (1/1) / β±οΈ 136.3s / π° $0.13 | π’ 100% (1/1) / β±οΈ 44.8s / π° $0.07 | π’ 100% (1/1) / β±οΈ 66.5s / π° $0.18 |
| 104b_postgres_missing_index_pgstat π | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - |
| 104c_postgres_minimal_missing_index π | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - |
| 105_redis_wrong_data_structure π | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - |
| 107_log_filter_http_status_code π | π΄ 0% (0/1) / β±οΈ 118.1s | π’ 100% (1/1) / β±οΈ 35.7s / π° $0.09 | π’ 100% (1/1) / β±οΈ 251.9s / π° $0.20 | π’ 100% (1/1) / β±οΈ 55.4s / π° $0.08 | π’ 100% (1/1) / β±οΈ 59.6s / π° $0.21 |
| 108_logs_nearby_lines π | π΄ 0% (0/1) / β±οΈ 75.2s | π΄ 0% (0/1) / β±οΈ 28.3s / π° $0.07 | π΄ 0% (0/1) / β±οΈ 133.0s | π΄ 0% (0/1) / β±οΈ 77.4s / π° $0.11 | π΄ 0% (0/1) / β±οΈ 76.6s / π° $0.24 |
| 109_logs_transparency_not_found π | π’ 100% (1/1) / β±οΈ 49.0s | π’ 100% (1/1) / β±οΈ 25.7s / π° $0.10 | π’ 100% (1/1) / β±οΈ 70.5s / π° $0.07 | π’ 100% (1/1) / β±οΈ 31.0s / π° $0.04 | π’ 100% (1/1) / β±οΈ 23.1s / π° $0.10 |
| 10_image_pull_backoff π | π’ 100% (1/1) / β±οΈ 140.8s | π΄ 0% (0/1) / β±οΈ 5.0s / π° $0.01 | π’ 100% (1/1) / β±οΈ 80.5s / π° $0.10 | π’ 100% (1/1) / β±οΈ 36.5s / π° $0.06 | π’ 100% (1/1) / β±οΈ 41.0s / π° $0.15 |
| 110_k8s_events_image_pull π | π’ 100% (1/1) / β±οΈ 127.1s | π’ 100% (1/1) / β±οΈ 21.3s / π° $0.05 | π’ 100% (1/1) / β±οΈ 103.8s / π° $0.11 | π’ 100% (1/1) / β±οΈ 22.1s / π° $0.04 | π’ 100% (1/1) / β±οΈ 25.9s / π° $0.10 |
| 111_disabled_datadog_traces π | π΄ 0% (0/1) / β±οΈ 13.8s | π’ 100% (1/1) / β±οΈ 7.6s / π° $0.01 | π’ 100% (1/1) / β±οΈ 49.8s / π° $0.04 | π’ 100% (1/1) / β±οΈ 10.0s / π° $0.02 | π’ 100% (1/1) / β±οΈ 12.2s / π° $0.05 |
| 111_pod_names_contain_service π | π΄ 0% (0/1) / β±οΈ 7.0s | π΄ 0% (0/1) / β±οΈ 5.0s / π° $0.01 | π΄ 0% (0/1) / β±οΈ 7.2s / π° $0.01 | π’ 100% (1/1) / β±οΈ 47.6s / π° $0.06 | π’ 100% (1/1) / β±οΈ 45.5s / π° $0.28 |
| 112_find_pvcs_by_uuid π | π’ 100% (1/1) / β±οΈ 91.5s | π΄ 0% (0/1) / β±οΈ 8.9s / π° $0.01 | π’ 100% (1/1) / β±οΈ 52.9s / π° $0.04 | π’ 100% (1/1) / β±οΈ 23.6s / π° $0.04 | π’ 100% (1/1) / β±οΈ 35.5s / π° $0.15 |
| 114_checkout_latency_tracing_rebuild[0] π | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - |
| 115_checkout_errors_tracing[0] π | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - |
| 11_init_containers π | π’ 100% (1/1) / β±οΈ 89.4s | π΄ 0% (0/1) / β±οΈ 4.3s / π° $0.01 | π’ 100% (1/1) / β±οΈ 84.4s / π° $0.11 | π’ 100% (1/1) / β±οΈ 36.1s / π° $0.05 | π’ 100% (1/1) / β±οΈ 37.6s / π° $0.15 |
| 121_new_relic_checkout_errors_tracing[0] π | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - |
| 122_new_relic_checkout_latency_tracing_rebuild[0] π | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - |
| 123_new_relic_checkout_errors_tracing[0] π | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - |
| 124_checkout_latency_prometheus[0] π | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - |
| 12_job_crashing π | π΄ 0% (0/1) / β±οΈ 78.3s | π΄ 0% (0/1) / β±οΈ 37.5s / π° $0.10 | π΄ 0% (0/1) / β±οΈ 63.5s / π° $0.07 | π’ 100% (1/1) / β±οΈ 31.8s / π° $0.05 | π’ 100% (1/1) / β±οΈ 56.5s / π° $0.18 |
| 13a_pending_node_selector_basic π | π’ 100% (1/1) / β±οΈ 45.1s | π΄ 0% (0/1) / β±οΈ 5.2s / π° $0.01 | π΄ 0% (0/1) / β±οΈ 15.7s / π° $0.01 | π’ 100% (1/1) / β±οΈ 32.3s / π° $0.05 | π’ 100% (1/1) / β±οΈ 39.2s / π° $0.16 |
| 13b_pending_node_selector_detailed π | π’ 100% (1/1) / β±οΈ 66.4s | π΄ 0% (0/1) / β±οΈ 5.0s / π° $0.01 | π’ 100% (1/1) / β±οΈ 211.1s / π° $0.19 | π’ 100% (1/1) / β±οΈ 33.4s / π° $0.05 | π’ 100% (1/1) / β±οΈ 52.8s / π° $0.17 |
| 14_pending_resources π | π’ 100% (1/1) / β±οΈ 46.6s | π΄ 0% (0/1) / β±οΈ 4.7s / π° $0.02 | π’ 100% (1/1) / β±οΈ 100.2s / π° $0.09 | π΄ 0% (0/1) / β±οΈ 8.9s / π° $0.02 | π’ 100% (1/1) / β±οΈ 47.1s / π° $0.17 |
| 156_kafka_opensearch_latency π | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - |
| 159_prometheus_high_cardinality_cpu[0] π | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - |
| 159_prometheus_high_cardinality_cpu[1] π | π΄ 0% (0/1) / β±οΈ 32.2s | π΄ 0% (0/1) / β±οΈ 16.1s / π° $0.03 | π΄ 0% (0/1) / β±οΈ 165.9s / π° $0.10 | π΄ 0% (0/1) / β±οΈ 13.9s / π° $0.03 | π΄ 0% (0/1) / β±οΈ 20.6s / π° $0.09 |
| 159_prometheus_high_cardinality_cpu[2] π | π΄ 0% (0/1) / β±οΈ 32.3s | π΄ 0% (0/1) / β±οΈ 16.6s / π° $0.05 | π΄ 0% (0/1) / β±οΈ 115.0s / π° $0.09 | π΄ 0% (0/1) / β±οΈ 15.9s / π° $0.04 | π΄ 0% (0/1) / β±οΈ 23.6s / π° $0.10 |
| 15_failed_readiness_probe π | π΄ 0% (0/1) / β±οΈ 7.3s | π΄ 0% (0/1) / β±οΈ 4.3s / π° $0.01 | π’ 100% (1/1) / β±οΈ 58.7s / π° $0.08 | π’ 100% (1/1) / β±οΈ 33.6s / π° $0.05 | π’ 100% (1/1) / β±οΈ 47.9s / π° $0.22 |
| 160_electricity_market_bidding_bug[0] π | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - |
| 161_bidding_version_performance[0] π | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - |
| 16_failed_no_toolset_found π | π΄ 0% (0/1) / β±οΈ 8.7s | π΄ 0% (0/1) / β±οΈ 9.1s / π° $0.01 | π΄ 0% (0/1) / β±οΈ 10.9s / π° $0.01 | π΄ 0% (0/1) / β±οΈ 10.9s / π° $0.02 | π΄ 0% (0/1) / β±οΈ 15.5s / π° $0.07 |
| 17_oom_kill π | π΄ 0% (0/1) / β±οΈ 73.6s | π΄ 0% (0/1) / β±οΈ 5.4s / π° $0.01 | π΄ 0% (0/1) / β±οΈ 113.9s / π° $0.12 | π΄ 0% (0/1) / β±οΈ 39.6s / π° $0.06 | π΄ 0% (0/1) / β±οΈ 55.0s / π° $0.20 |
| 18_oom_kill_from_issues_history π | π΄ 0% (0/1) / β±οΈ 37.5s | π’ 100% (1/1) / β±οΈ 27.8s / π° $0.06 | π΄ 0% (0/1) / β±οΈ 298.2s / π° $0.26 | π’ 100% (1/1) / β±οΈ 32.7s / π° $0.05 | π’ 100% (1/1) / β±οΈ 60.1s / π° $0.17 |
| 19_detect_missing_app_details π | π’ 100% (1/1) / β±οΈ 57.0s | π΄ 0% (0/1) / β±οΈ 24.4s / π° $0.07 | π’ 100% (1/1) / β±οΈ 359.0s / π° $0.27 | π’ 100% (1/1) / β±οΈ 34.6s / π° $0.04 | π’ 100% (1/1) / β±οΈ 80.0s / π° $0.31 |
| 20_long_log_file_search π | π’ 100% (1/1) / β±οΈ 91.6s | π’ 100% (1/1) / β±οΈ 28.1s / π° $0.05 | π’ 100% (1/1) / β±οΈ 105.8s / π° $0.08 | π΄ 0% (0/1) / β±οΈ 26.2s / π° $0.04 | π’ 100% (1/1) / β±οΈ 39.3s / π° $0.12 |
| 21_job_fail_curl_no_svc_account π | π΄ 0% (0/1) / β±οΈ 25.2s | π΄ 0% (0/1) / β±οΈ 613.5s / π° $0.04 | π’ 100% (1/1) / β±οΈ 868.9s / π° $0.23 | π΄ 0% (0/1) / β±οΈ 7.8s / π° $0.02 | π’ 100% (1/1) / β±οΈ 47.8s / π° $0.15 |
| 22_high_latency_dbi_down π | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - |
| 23_app_error_in_current_logs π | π΄ 0% (0/1) / β±οΈ 73.0s | π΄ 0% (0/1) / β±οΈ 35.0s / π° $0.09 | π΄ 0% (0/1) / β±οΈ 125.0s / π° $0.14 | π΄ 0% (0/1) / β±οΈ 51.4s / π° $0.07 | π΄ 0% (0/1) / β±οΈ 66.0s / π° $0.21 |
| 24_misconfigured_pvc π | π’ 100% (1/1) / β±οΈ 65.1s | π΄ 0% (0/1) / β±οΈ 4.2s / π° $0.01 | π΄ 0% (0/1) / β±οΈ 16.8s / π° $0.02 | π’ 100% (1/1) / β±οΈ 48.4s / π° $0.07 | π’ 100% (1/1) / β±οΈ 41.5s / π° $0.17 |
| 24a_misconfigured_pvc_basic π | π΄ 0% (0/1) / β±οΈ 7.0s | π΄ 0% (0/1) / β±οΈ 3.8s / π° $0.01 | π΄ 0% (0/1) / β±οΈ 14.3s / π° $0.01 | π΄ 0% (0/1) / β±οΈ 8.5s / π° $0.02 | π’ 100% (1/1) / β±οΈ 52.4s / π° $0.18 |
| 24b_misconfigured_pvc_detailed π | π΄ 0% (0/1) / β±οΈ 111.9s | π΄ 0% (0/1) / β±οΈ 4.7s / π° $0.01 | π΄ 0% (0/1) / β±οΈ 14.5s / π° $0.01 | π΄ 0% (0/1) / β±οΈ 7.4s / π° $0.02 | π’ 100% (1/1) / β±οΈ 73.2s / π° $0.24 |
| 25_misconfigured_ingress_class π | π΄ 0% (0/1) / β±οΈ 73.2s | π΄ 0% (0/1) / β±οΈ 12.1s / π° $0.02 | π΄ 0% (0/1) / β±οΈ 33.3s / π° $0.03 | π΄ 0% (0/1) / β±οΈ 11.4s / π° $0.02 | π΄ 0% (0/1) / β±οΈ 13.4s / π° $0.05 |
| 26_page_render_times π | π’ 100% (1/1) / β±οΈ 57.7s | π΄ 0% (0/1) / β±οΈ 6.4s | π’ 100% (1/1) / β±οΈ 271.9s / π° $0.23 | π’ 100% (1/1) / β±οΈ 23.9s / π° $0.06 | π’ 100% (1/1) / β±οΈ 33.2s / π° $0.13 |
| 27a_multi_container_logs π | π’ 100% (1/1) / β±οΈ 46.6s | π’ 100% (1/1) / β±οΈ 39.0s / π° $0.10 | π’ 100% (1/1) / β±οΈ 116.7s / π° $0.13 | π’ 100% (1/1) / β±οΈ 23.3s / π° $0.05 | π’ 100% (1/1) / β±οΈ 44.7s / π° $0.17 |
| 27b_multi_container_logs π | π’ 100% (1/1) / β±οΈ 18.5s | π’ 100% (1/1) / β±οΈ 39.3s / π° $0.10 | π’ 100% (1/1) / β±οΈ 160.0s / π° $0.17 | π’ 100% (1/1) / β±οΈ 20.1s / π° $0.04 | π’ 100% (1/1) / β±οΈ 29.9s / π° $0.13 |
| 28_permissions_error π | π’ 100% (1/1) / β±οΈ 23.8s | π΄ 0% (0/1) / β±οΈ 19.8s / π° $0.05 | π΄ 0% (0/1) / β±οΈ 96.6s / π° $0.09 | π΄ 0% (0/1) / β±οΈ 11.2s / π° $0.03 | π’ 100% (1/1) / β±οΈ 19.7s / π° $0.09 |
| 33_cpu_metrics_discovery π | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - |
| 39_failed_toolset π | π’ 100% (1/1) / β±οΈ 50.2s | π΄ 0% (0/1) / β±οΈ 8.4s / π° $0.01 | π’ 100% (1/1) / β±οΈ 84.5s / π° $0.06 | π’ 100% (1/1) / β±οΈ 27.7s / π° $0.03 | π’ 100% (1/1) / β±οΈ 33.5s / π° $0.09 |
| 41_setup_argo π | π’ 100% (1/1) / β±οΈ 6.6s | π’ 100% (1/1) / β±οΈ 10.1s / π° $0.01 | π’ 100% (1/1) / β±οΈ 33.7s / π° $0.02 | π’ 100% (1/1) / β±οΈ 7.0s / π° $0.02 | π’ 100% (1/1) / β±οΈ 10.6s / π° $0.05 |
| 42_dns_issues_result_new_tools_no_runbook π | π΄ 0% (0/1) / β±οΈ 114.4s | π΄ 0% (0/1) / β±οΈ 7.1s / π° $0.02 | π΄ 0% (0/1) / β±οΈ 239.7s / π° $0.12 | π’ 100% (1/1) / β±οΈ 30.3s / π° $0.04 | π’ 100% (1/1) / β±οΈ 92.4s / π° $0.11 |
| 42_dns_issues_steps_new_tools π | π’ 100% (1/1) / β±οΈ 182.9s | π΄ 0% (0/1) / β±οΈ 11.8s / π° $0.03 | π’ 100% (1/1) / β±οΈ 212.1s / π° $0.18 | π’ 100% (1/1) / β±οΈ 135.3s / π° $0.08 | π’ 100% (1/1) / β±οΈ 211.9s / π° $0.25 |
| 43_current_datetime_from_prompt π | π’ 100% (1/1) / β±οΈ 6.6s | π΄ 0% (0/1) / β±οΈ 9.6s / π° $0.02 | π’ 100% (1/1) / β±οΈ 34.7s / π° $0.03 | π’ 100% (1/1) / β±οΈ 5.8s / π° $0.02 | π’ 100% (1/1) / β±οΈ 7.0s / π° $0.06 |
| 43_slack_deployment_logs π | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - |
| 44_slack_statefulset_logs π | βͺοΈ - | π΄ 0% (0/1) / β±οΈ 4.7s / π° $0.01 | βͺοΈ - | βͺοΈ - | βͺοΈ - |
| 45_fetch_deployment_logs_simple π | π’ 100% (1/1) / β±οΈ 55.0s | π’ 100% (1/1) / β±οΈ 28.9s / π° $0.10 | π΄ 0% (0/1) / β±οΈ 15.2s / π° $0.01 | π’ 100% (1/1) / β±οΈ 24.7s / π° $0.04 | π’ 100% (1/1) / β±οΈ 28.6s / π° $0.12 |
| 48_logs_since_thursday π | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - |
| 50_logs_since_specific_date π | π’ 100% (1/1) / β±οΈ 25.4s | π΄ 0% (0/1) / β±οΈ 9.1s / π° $0.01 | π’ 100% (1/1) / β±οΈ 55.5s / π° $0.05 | π’ 100% (1/1) / β±οΈ 11.0s / π° $0.02 | π’ 100% (1/1) / β±οΈ 20.4s / π° $0.06 |
| 50a_logs_since_last_specific_month π | π΄ 0% (0/1) / β±οΈ 8.7s | π΄ 0% (0/1) / β±οΈ 12.2s / π° $0.02 | π’ 100% (1/1) / β±οΈ 89.8s / π° $0.04 | π’ 100% (1/1) / β±οΈ 16.4s / π° $0.02 | π’ 100% (1/1) / β±οΈ 18.7s / π° $0.07 |
| 51_logs_summarize_errors π | π΄ 0% (0/1) / β±οΈ 17.9s | π΄ 0% (0/1) / β±οΈ 5.6s / π° $0.01 | π’ 100% (1/1) / β±οΈ 90.1s / π° $0.05 | π’ 100% (1/1) / β±οΈ 15.9s / π° $0.03 | π’ 100% (1/1) / β±οΈ 26.8s / π° $0.08 |
| 52_logs_login_issues π | π’ 100% (1/1) / β±οΈ 30.3s | π΄ 0% (0/1) / β±οΈ 4.0s / π° $0.01 | π’ 100% (1/1) / β±οΈ 93.9s / π° $0.09 | π’ 100% (1/1) / β±οΈ 24.6s / π° $0.03 | π’ 100% (1/1) / β±οΈ 29.6s / π° $0.10 |
| 53_logs_find_term π | π’ 100% (1/1) / β±οΈ 20.8s | π΄ 0% (0/1) / β±οΈ 4.5s / π° $0.01 | π’ 100% (1/1) / β±οΈ 62.0s / π° $0.04 | π’ 100% (1/1) / β±οΈ 15.2s / π° $0.04 | π’ 100% (1/1) / β±οΈ 22.0s / π° $0.11 |
| 54_not_truncated_when_getting_pods π | π’ 100% (1/1) / β±οΈ 33.9s | π΄ 0% (0/1) / β±οΈ 6.3s / π° $0.01 | π’ 100% (1/1) / β±οΈ 70.6s / π° $0.06 | π’ 100% (1/1) / β±οΈ 20.1s / π° $0.03 | π’ 100% (1/1) / β±οΈ 23.9s / π° $0.08 |
| 55_kafka_runbook π | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - |
| 57_wrong_namespace π | π’ 100% (1/1) / β±οΈ 20.5s | π΄ 0% (0/1) / β±οΈ 6.8s / π° $0.02 | π’ 100% (1/1) / β±οΈ 65.0s / π° $0.06 | π’ 100% (1/1) / β±οΈ 24.8s / π° $0.03 | π΄ 0% (0/1) / β±οΈ 29.6s / π° $0.10 |
| 59_label_based_counting π | π’ 100% (1/1) / β±οΈ 20.9s | π’ 100% (1/1) / β±οΈ 19.4s / π° $0.03 | π’ 100% (1/1) / β±οΈ 38.6s / π° $0.03 | π΄ 0% (0/1) / β±οΈ 15.6s / π° $0.03 | π’ 100% (1/1) / β±οΈ 16.4s / π° $0.09 |
| 60_count_less_than π | π΄ 0% (0/1) / β±οΈ 23.1s | π’ 100% (1/1) / β±οΈ 15.0s / π° $0.04 | π’ 100% (1/1) / β±οΈ 29.0s / π° $0.04 | π’ 100% (1/1) / β±οΈ 20.2s / π° $0.03 | π’ 100% (1/1) / β±οΈ 23.4s / π° $0.10 |
| 61_exact_match_counting π | π’ 100% (1/1) / β±οΈ 19.2s | π’ 100% (1/1) / β±οΈ 15.4s / π° $0.04 | π’ 100% (1/1) / β±οΈ 27.2s / π° $0.03 | π΄ 0% (0/1) / β±οΈ 16.4s / π° $0.03 | π’ 100% (1/1) / β±οΈ 17.5s / π° $0.09 |
| 62_fetch_error_logs_with_errors π | π’ 100% (1/1) / β±οΈ 25.9s | π’ 100% (1/1) / β±οΈ 18.4s / π° $0.04 | π’ 100% (1/1) / β±οΈ 38.1s / π° $0.04 | π’ 100% (1/1) / β±οΈ 20.9s / π° $0.03 | π’ 100% (1/1) / β±οΈ 24.3s / π° $0.10 |
| 63_fetch_error_logs_no_errors π | π’ 100% (1/1) / β±οΈ 30.5s | π’ 100% (1/1) / β±οΈ 23.6s / π° $0.05 | π’ 100% (1/1) / β±οΈ 102.8s / π° $0.07 | π’ 100% (1/1) / β±οΈ 33.8s / π° $0.05 | π’ 100% (1/1) / β±οΈ 32.5s / π° $0.12 |
| 64_keda_vs_hpa_confusion π | π’ 100% (1/1) / β±οΈ 183.1s | π΄ 0% (0/1) / β±οΈ 6.3s / π° $0.01 | π’ 100% (1/1) / β±οΈ 287.8s / π° $0.22 | π΄ 0% (0/1) / β±οΈ 38.5s / π° $0.06 | π΄ 0% (0/1) / β±οΈ 37.2s / π° $0.14 |
| 65_health_check_followup π | π’ 100% (1/1) / β±οΈ 69.4s | π’ 100% (1/1) / β±οΈ 41.6s / π° $0.12 | π’ 100% (1/1) / β±οΈ 136.5s / π° $0.13 | π΄ 0% (0/1) / β±οΈ 43.5s / π° $0.07 | π’ 100% (1/1) / β±οΈ 52.7s / π° $0.19 |
| 71_connection_pool_starvation π | π’ 100% (1/1) / β±οΈ 95.2s | π’ 100% (1/1) / β±οΈ 29.7s / π° $0.07 | π’ 100% (1/1) / β±οΈ 206.3s / π° $0.22 | π’ 100% (1/1) / β±οΈ 39.5s / π° $0.07 | π’ 100% (1/1) / β±οΈ 39.4s / π° $0.19 |
| 73a_time_window_anomaly π | π’ 100% (1/1) / β±οΈ 68.7s | π’ 100% (1/1) / β±οΈ 36.0s / π° $0.09 | π’ 100% (1/1) / β±οΈ 90.7s / π° $0.09 | π’ 100% (1/1) / β±οΈ 42.9s / π° $0.07 | π’ 100% (1/1) / β±οΈ 54.0s / π° $0.19 |
| 73b_time_window_anomaly π | π΄ 0% (0/1) / β±οΈ 107.0s | π’ 100% (1/1) / β±οΈ 29.2s / π° $0.08 | π΄ 0% (0/1) / β±οΈ 115.4s | π΄ 0% (0/1) / β±οΈ 55.9s / π° $0.05 | π’ 100% (1/1) / β±οΈ 70.5s / π° $0.34 |
| 76_service_discovery_issue π | π’ 100% (1/1) / β±οΈ 127.0s | π’ 100% (1/1) / β±οΈ 29.5s / π° $0.07 | π’ 100% (1/1) / β±οΈ 82.8s / π° $0.13 | π’ 100% (1/1) / β±οΈ 45.9s / π° $0.07 | π’ 100% (1/1) / β±οΈ 52.1s / π° $0.17 |
| 77_liveness_probe_misconfiguration π | π’ 100% (1/1) / β±οΈ 46.5s | π’ 100% (1/1) / β±οΈ 33.2s / π° $0.08 | π’ 100% (1/1) / β±οΈ 90.4s / π° $0.11 | π’ 100% (1/1) / β±οΈ 31.7s / π° $0.05 | π’ 100% (1/1) / β±οΈ 57.0s / π° $0.19 |
| 78a_missing_cpu_limits π | π’ 100% (1/1) / β±οΈ 126.7s | π’ 100% (1/1) / β±οΈ 29.6s / π° $0.08 | π’ 100% (1/1) / β±οΈ 288.9s / π° $0.24 | π’ 100% (1/1) / β±οΈ 39.0s / π° $0.05 | π’ 100% (1/1) / β±οΈ 54.8s / π° $0.16 |
| 78b_cpu_quota_exceeded π | π΄ 0% (0/1) / β±οΈ 95.0s | π΄ 0% (0/1) / β±οΈ 30.8s / π° $0.05 | π’ 100% (1/1) / β±οΈ 124.5s / π° $0.14 | π’ 100% (1/1) / β±οΈ 38.1s / π° $0.06 | π’ 100% (1/1) / β±οΈ 59.4s / π° $0.17 |
| 79_configmap_mount_issue π | π’ 100% (1/1) / β±οΈ 40.1s | π’ 100% (1/1) / β±οΈ 33.5s / π° $0.08 | π’ 100% (1/1) / β±οΈ 161.3s / π° $0.12 | π’ 100% (1/1) / β±οΈ 28.8s / π° $0.04 | π’ 100% (1/1) / β±οΈ 38.2s / π° $0.13 |
| 80_pvc_storage_class_mismatch π | π’ 100% (1/1) / β±οΈ 53.9s | π΄ 0% (0/1) / β±οΈ 35.2s / π° $0.09 | π’ 100% (1/1) / β±οΈ 105.2s / π° $0.11 | π’ 100% (1/1) / β±οΈ 38.0s / π° $0.06 | π’ 100% (1/1) / β±οΈ 49.8s / π° $0.18 |
| 81_service_account_permission_denied π | π’ 100% (1/1) / β±οΈ 62.0s | π’ 100% (1/1) / β±οΈ 32.6s / π° $0.08 | π’ 100% (1/1) / β±οΈ 100.1s / π° $0.12 | π’ 100% (1/1) / β±οΈ 41.1s / π° $0.09 | π’ 100% (1/1) / β±οΈ 71.4s / π° $0.26 |
| 82_pod_anti_affinity_conflict π | π’ 100% (1/1) / β±οΈ 139.8s | π΄ 0% (0/1) / β±οΈ 39.0s / π° $0.10 | π’ 100% (1/1) / β±οΈ 167.7s / π° $0.16 | π΄ 0% (0/1) / β±οΈ 46.9s / π° $0.08 | π΄ 0% (0/1) / β±οΈ 42.0s / π° $0.17 |
| 83_secret_not_found π | π’ 100% (1/1) / β±οΈ 48.9s | π’ 100% (1/1) / β±οΈ 29.8s / π° $0.08 | π’ 100% (1/1) / β±οΈ 173.5s / π° $0.13 | π’ 100% (1/1) / β±οΈ 29.6s / π° $0.05 | π’ 100% (1/1) / β±οΈ 47.6s / π° $0.17 |
| 84_network_policy_blocking_traffic π | π’ 100% (1/1) / β±οΈ 178.7s | π’ 100% (1/1) / β±οΈ 42.6s / π° $0.09 | π’ 100% (1/1) / β±οΈ 199.8s / π° $0.17 | π’ 100% (1/1) / β±οΈ 54.1s / π° $0.08 | π’ 100% (1/1) / β±οΈ 74.4s / π° $0.26 |
| 85_hpa_not_scaling π | π’ 100% (1/1) / β±οΈ 56.4s | π’ 100% (1/1) / β±οΈ 46.8s / π° $0.08 | π’ 100% (1/1) / β±οΈ 126.6s / π° $0.13 | π’ 100% (1/1) / β±οΈ 33.2s / π° $0.05 | π’ 100% (1/1) / β±οΈ 39.9s / π° $0.15 |
| 86_configmap_like_but_secret π | π’ 100% (1/1) / β±οΈ 66.2s | π’ 100% (1/1) / β±οΈ 37.9s / π° $0.11 | π΄ 0% (0/1) / β±οΈ 14.0s / π° $0.01 | π’ 100% (1/1) / β±οΈ 43.6s / π° $0.06 | π’ 100% (1/1) / β±οΈ 46.6s / π° $0.15 |
| 89_runbook_missing_cloudwatch π | π’ 100% (1/1) / β±οΈ 22.5s | π’ 100% (1/1) / β±οΈ 14.0s / π° $0.02 | π’ 100% (1/1) / β±οΈ 60.0s / π° $0.05 | π’ 100% (1/1) / β±οΈ 17.7s / π° $0.02 | π’ 100% (1/1) / β±οΈ 39.8s / π° $0.10 |
| 90_runbook_basic_selection π | π’ 100% (1/1) / β±οΈ 115.2s | π’ 100% (1/1) / β±οΈ 29.5s / π° $0.06 | π’ 100% (1/1) / β±οΈ 292.6s / π° $0.26 | π’ 100% (1/1) / β±οΈ 84.4s / π° $0.11 | π’ 100% (1/1) / β±οΈ 147.5s / π° $0.62 |
| 91f_datadog_logs_historical_pod π | π΄ 0% (0/1) / β±οΈ 19.8s | π΄ 0% (0/1) / β±οΈ 7.8s / π° $0.01 | π΄ 0% (0/1) / β±οΈ 202.7s / π° $0.13 | π΄ 0% (0/1) / β±οΈ 11.1s / π° $0.02 | π΄ 0% (0/1) / β±οΈ 12.3s / π° $0.05 |
| 93_calling_datadog[0] π | π’ 100% (1/1) / β±οΈ 20.3s | π’ 100% (1/1) / β±οΈ 9.9s / π° $0.06 | π’ 100% (1/1) / β±οΈ 36.8s / π° $0.06 | π’ 100% (1/1) / β±οΈ 9.0s / π° $0.04 | π’ 100% (1/1) / β±οΈ 19.2s / π° $0.13 |
| 93_calling_datadog[1] π | π΄ 0% (0/1) / β±οΈ 10.9s | π’ 100% (1/1) / β±οΈ 17.5s / π° $0.05 | π’ 100% (1/1) / β±οΈ 56.1s / π° $0.08 | π’ 100% (1/1) / β±οΈ 8.8s / π° $0.04 | π’ 100% (1/1) / β±οΈ 14.9s / π° $0.13 |
| 93_calling_datadog[2] π | π’ 100% (1/1) / β±οΈ 16.0s | π’ 100% (1/1) / β±οΈ 10.8s / π° $0.04 | π’ 100% (1/1) / β±οΈ 29.1s / π° $0.05 | π’ 100% (1/1) / β±οΈ 9.5s / π° $0.04 | π’ 100% (1/1) / β±οΈ 15.7s / π° $0.13 |
| 93_events_since_specific_date π | βͺοΈ - | π΄ 0% (0/1) / β±οΈ 4.4s / π° $0.01 | π΄ 0% (0/1) / β±οΈ 14.5s / π° $0.01 | π΄ 0% (0/1) / β±οΈ 7.5s / π° $0.02 | βͺοΈ - |
| 94_runbook_transparency π | π’ 100% (1/1) / β±οΈ 52.3s | π’ 100% (1/1) / β±οΈ 33.7s / π° $0.07 | π’ 100% (1/1) / β±οΈ 348.9s / π° $0.10 | π’ 100% (1/1) / β±οΈ 67.5s / π° $0.10 | π’ 100% (1/1) / β±οΈ 123.3s / π° $0.30 |
| 96_no_matching_runbook π | π΄ 0% (0/1) / β±οΈ 165.1s | π΄ 0% (0/1) / β±οΈ 51.4s / π° $0.18 | π΄ 0% (0/1) / β±οΈ 485.3s / π° $0.31 | π’ 100% (1/1) / β±οΈ 52.8s / π° $0.07 | π’ 100% (1/1) / β±οΈ 70.7s / π° $0.26 |
| 97_logs_clarification_needed π | π’ 100% (1/1) / β±οΈ 8.0s | π’ 100% (1/1) / β±οΈ 5.5s / π° $0.01 | π’ 100% (1/1) / β±οΈ 7.2s / π° $0.01 | π’ 100% (1/1) / β±οΈ 6.4s / π° $0.02 | π’ 100% (1/1) / β±οΈ 9.0s / π° $0.06 |
| 98_logs_transparency_default_time π | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - | βͺοΈ - |
| 99_logs_transparency_custom_time π | π’ 100% (1/1) / β±οΈ 74.5s | π’ 100% (1/1) / β±οΈ 20.2s / π° $0.05 | π’ 100% (1/1) / β±οΈ 99.7s / π° $0.10 | π’ 100% (1/1) / β±οΈ 24.4s / π° $0.04 | π’ 100% (1/1) / β±οΈ 35.0s / π° $0.12 |
Results are automatically generated and updated weekly. View full traces and detailed analysis in Braintrust experiment: local-benchmark-20251126-175926.