Fix log injection smoke test flakiness under CI load#10999
Merged
gh-worker-dd-mergequeue-cf854d[bot] merged 1 commit intomasterfrom Mar 30, 2026
Merged
Conversation
BenchmarksStartupParameters
See matching parameters
SummaryFound 0 performance improvements and 0 performance regressions! Performance is the same for 63 metrics, 8 unstable metrics. Startup time reports for petclinicgantt
title petclinic - global startup overhead: candidate=1.61.0-SNAPSHOT~1c9c252b82, baseline=1.61.0-SNAPSHOT~92b8b827eb
dateFormat X
axisFormat %s
section tracing
Agent [baseline] (1.064 s) : 0, 1063750
Total [baseline] (11.008 s) : 0, 11008048
Agent [candidate] (1.06 s) : 0, 1059570
Total [candidate] (11.07 s) : 0, 11070356
section appsec
Agent [baseline] (1.257 s) : 0, 1256964
Total [baseline] (11.187 s) : 0, 11187479
Agent [candidate] (1.254 s) : 0, 1253918
Total [candidate] (11.14 s) : 0, 11140108
section iast
Agent [baseline] (1.232 s) : 0, 1232248
Total [baseline] (11.356 s) : 0, 11356300
Agent [candidate] (1.229 s) : 0, 1229031
Total [candidate] (11.31 s) : 0, 11309531
section profiling
Agent [baseline] (1.187 s) : 0, 1186912
Total [baseline] (11.016 s) : 0, 11016357
Agent [candidate] (1.187 s) : 0, 1186966
Total [candidate] (11.055 s) : 0, 11055076
gantt
title petclinic - break down per module: candidate=1.61.0-SNAPSHOT~1c9c252b82, baseline=1.61.0-SNAPSHOT~92b8b827eb
dateFormat X
axisFormat %s
section tracing
crashtracking [baseline] (1.216 ms) : 0, 1216
crashtracking [candidate] (1.218 ms) : 0, 1218
BytebuddyAgent [baseline] (631.58 ms) : 0, 631580
BytebuddyAgent [candidate] (630.323 ms) : 0, 630323
AgentMeter [baseline] (29.438 ms) : 0, 29438
AgentMeter [candidate] (29.326 ms) : 0, 29326
GlobalTracer [baseline] (257.857 ms) : 0, 257857
GlobalTracer [candidate] (257.363 ms) : 0, 257363
AppSec [baseline] (31.902 ms) : 0, 31902
AppSec [candidate] (31.776 ms) : 0, 31776
Debugger [baseline] (60.431 ms) : 0, 60431
Debugger [candidate] (60.525 ms) : 0, 60525
Remote Config [baseline] (600.344 µs) : 0, 600
Remote Config [candidate] (593.39 µs) : 0, 593
Telemetry [baseline] (8.102 ms) : 0, 8102
Telemetry [candidate] (8.053 ms) : 0, 8053
Flare Poller [baseline] (6.399 ms) : 0, 6399
Flare Poller [candidate] (4.297 ms) : 0, 4297
section appsec
crashtracking [baseline] (1.238 ms) : 0, 1238
crashtracking [candidate] (1.227 ms) : 0, 1227
BytebuddyAgent [baseline] (665.344 ms) : 0, 665344
BytebuddyAgent [candidate] (661.83 ms) : 0, 661830
AgentMeter [baseline] (12.22 ms) : 0, 12220
AgentMeter [candidate] (12.202 ms) : 0, 12202
GlobalTracer [baseline] (259.433 ms) : 0, 259433
GlobalTracer [candidate] (259.182 ms) : 0, 259182
AppSec [baseline] (178.393 ms) : 0, 178393
AppSec [candidate] (179.011 ms) : 0, 179011
Debugger [baseline] (67.003 ms) : 0, 67003
Debugger [candidate] (66.871 ms) : 0, 66871
Remote Config [baseline] (639.404 µs) : 0, 639
Remote Config [candidate] (641.638 µs) : 0, 642
Telemetry [baseline] (8.331 ms) : 0, 8331
Telemetry [candidate] (8.372 ms) : 0, 8372
Flare Poller [baseline] (3.601 ms) : 0, 3601
Flare Poller [candidate] (3.607 ms) : 0, 3607
IAST [baseline] (24.243 ms) : 0, 24243
IAST [candidate] (24.422 ms) : 0, 24422
section iast
crashtracking [baseline] (1.236 ms) : 0, 1236
crashtracking [candidate] (1.233 ms) : 0, 1233
BytebuddyAgent [baseline] (799.658 ms) : 0, 799658
BytebuddyAgent [candidate] (798.095 ms) : 0, 798095
AgentMeter [baseline] (11.418 ms) : 0, 11418
AgentMeter [candidate] (11.374 ms) : 0, 11374
GlobalTracer [baseline] (247.843 ms) : 0, 247843
GlobalTracer [candidate] (247.316 ms) : 0, 247316
AppSec [baseline] (26.534 ms) : 0, 26534
AppSec [candidate] (26.483 ms) : 0, 26483
Debugger [baseline] (69.39 ms) : 0, 69390
Debugger [candidate] (68.735 ms) : 0, 68735
Remote Config [baseline] (531.348 µs) : 0, 531
Remote Config [candidate] (524.338 µs) : 0, 524
Telemetry [baseline] (10.204 ms) : 0, 10204
Telemetry [candidate] (10.162 ms) : 0, 10162
Flare Poller [baseline] (3.758 ms) : 0, 3758
Flare Poller [candidate] (3.689 ms) : 0, 3689
IAST [baseline] (25.392 ms) : 0, 25392
IAST [candidate] (25.311 ms) : 0, 25311
section profiling
ProfilingAgent [baseline] (94.247 ms) : 0, 94247
ProfilingAgent [candidate] (94.163 ms) : 0, 94163
crashtracking [baseline] (1.186 ms) : 0, 1186
crashtracking [candidate] (1.172 ms) : 0, 1172
BytebuddyAgent [baseline] (685.426 ms) : 0, 685426
BytebuddyAgent [candidate] (685.322 ms) : 0, 685322
AgentMeter [baseline] (8.988 ms) : 0, 8988
AgentMeter [candidate] (8.967 ms) : 0, 8967
GlobalTracer [baseline] (215.373 ms) : 0, 215373
GlobalTracer [candidate] (215.642 ms) : 0, 215642
AppSec [baseline] (32.354 ms) : 0, 32354
AppSec [candidate] (32.39 ms) : 0, 32390
Debugger [baseline] (66.321 ms) : 0, 66321
Debugger [candidate] (66.367 ms) : 0, 66367
Remote Config [baseline] (573.537 µs) : 0, 574
Remote Config [candidate] (570.26 µs) : 0, 570
Telemetry [baseline] (7.78 ms) : 0, 7780
Telemetry [candidate] (7.733 ms) : 0, 7733
Flare Poller [baseline] (3.517 ms) : 0, 3517
Flare Poller [candidate] (3.542 ms) : 0, 3542
Profiling [baseline] (94.807 ms) : 0, 94807
Profiling [candidate] (94.711 ms) : 0, 94711
Startup time reports for insecure-bankgantt
title insecure-bank - global startup overhead: candidate=1.61.0-SNAPSHOT~1c9c252b82, baseline=1.61.0-SNAPSHOT~92b8b827eb
dateFormat X
axisFormat %s
section tracing
Agent [baseline] (1.061 s) : 0, 1061146
Total [baseline] (8.883 s) : 0, 8882929
Agent [candidate] (1.058 s) : 0, 1057857
Total [candidate] (8.857 s) : 0, 8856961
section iast
Agent [baseline] (1.233 s) : 0, 1232991
Total [baseline] (9.57 s) : 0, 9569958
Agent [candidate] (1.248 s) : 0, 1248021
Total [candidate] (9.597 s) : 0, 9597400
gantt
title insecure-bank - break down per module: candidate=1.61.0-SNAPSHOT~1c9c252b82, baseline=1.61.0-SNAPSHOT~92b8b827eb
dateFormat X
axisFormat %s
section tracing
crashtracking [baseline] (1.223 ms) : 0, 1223
crashtracking [candidate] (1.215 ms) : 0, 1215
BytebuddyAgent [baseline] (632.077 ms) : 0, 632077
BytebuddyAgent [candidate] (630.242 ms) : 0, 630242
AgentMeter [baseline] (29.348 ms) : 0, 29348
AgentMeter [candidate] (29.392 ms) : 0, 29392
GlobalTracer [baseline] (257.143 ms) : 0, 257143
GlobalTracer [candidate] (257.436 ms) : 0, 257436
AppSec [baseline] (31.814 ms) : 0, 31814
AppSec [candidate] (31.847 ms) : 0, 31847
Debugger [baseline] (59.736 ms) : 0, 59736
Debugger [candidate] (59.413 ms) : 0, 59413
Remote Config [baseline] (592.589 µs) : 0, 593
Remote Config [candidate] (578.766 µs) : 0, 579
Telemetry [baseline] (8.052 ms) : 0, 8052
Telemetry [candidate] (8.009 ms) : 0, 8009
Flare Poller [baseline] (4.964 ms) : 0, 4964
Flare Poller [candidate] (3.502 ms) : 0, 3502
section iast
crashtracking [baseline] (1.229 ms) : 0, 1229
crashtracking [candidate] (1.234 ms) : 0, 1234
BytebuddyAgent [baseline] (799.07 ms) : 0, 799070
BytebuddyAgent [candidate] (812.791 ms) : 0, 812791
AgentMeter [baseline] (11.422 ms) : 0, 11422
AgentMeter [candidate] (11.856 ms) : 0, 11856
GlobalTracer [baseline] (248.906 ms) : 0, 248906
GlobalTracer [candidate] (249.647 ms) : 0, 249647
AppSec [baseline] (26.834 ms) : 0, 26834
AppSec [candidate] (27.034 ms) : 0, 27034
Debugger [baseline] (68.524 ms) : 0, 68524
Debugger [candidate] (67.422 ms) : 0, 67422
Remote Config [baseline] (562.489 µs) : 0, 562
Remote Config [candidate] (531.444 µs) : 0, 531
Telemetry [baseline] (10.732 ms) : 0, 10732
Telemetry [candidate] (11.153 ms) : 0, 11153
Flare Poller [baseline] (3.877 ms) : 0, 3877
Flare Poller [candidate] (3.96 ms) : 0, 3960
IAST [baseline] (25.525 ms) : 0, 25525
IAST [candidate] (25.78 ms) : 0, 25780
LoadParameters
See matching parameters
SummaryFound 1 performance improvements and 3 performance regressions! Performance is the same for 16 metrics, 16 unstable metrics.
Request duration reports for insecure-bankgantt
title insecure-bank - request duration [CI 0.99] : candidate=1.61.0-SNAPSHOT~1c9c252b82, baseline=1.61.0-SNAPSHOT~92b8b827eb
dateFormat X
axisFormat %s
section baseline
no_agent (1.245 ms) : 1233, 1258
. : milestone, 1245,
iast (3.248 ms) : 3202, 3294
. : milestone, 3248,
iast_FULL (6.102 ms) : 6039, 6165
. : milestone, 6102,
iast_GLOBAL (3.553 ms) : 3502, 3604
. : milestone, 3553,
profiling (2.306 ms) : 2280, 2331
. : milestone, 2306,
tracing (1.971 ms) : 1954, 1988
. : milestone, 1971,
section candidate
no_agent (1.251 ms) : 1239, 1263
. : milestone, 1251,
iast (3.24 ms) : 3195, 3285
. : milestone, 3240,
iast_FULL (6.084 ms) : 6021, 6147
. : milestone, 6084,
iast_GLOBAL (3.709 ms) : 3647, 3771
. : milestone, 3709,
profiling (2.25 ms) : 2228, 2272
. : milestone, 2250,
tracing (1.85 ms) : 1835, 1864
. : milestone, 1850,
Request duration reports for petclinicgantt
title petclinic - request duration [CI 0.99] : candidate=1.61.0-SNAPSHOT~1c9c252b82, baseline=1.61.0-SNAPSHOT~92b8b827eb
dateFormat X
axisFormat %s
section baseline
no_agent (19.573 ms) : 19368, 19779
. : milestone, 19573,
appsec (18.695 ms) : 18506, 18884
. : milestone, 18695,
code_origins (17.784 ms) : 17610, 17959
. : milestone, 17784,
iast (18.076 ms) : 17897, 18255
. : milestone, 18076,
profiling (20.384 ms) : 20175, 20593
. : milestone, 20384,
tracing (17.726 ms) : 17549, 17902
. : milestone, 17726,
section candidate
no_agent (17.368 ms) : 17191, 17545
. : milestone, 17368,
appsec (18.618 ms) : 18430, 18805
. : milestone, 18618,
code_origins (18.038 ms) : 17861, 18215
. : milestone, 18038,
iast (19.201 ms) : 19003, 19400
. : milestone, 19201,
profiling (19.933 ms) : 19730, 20136
. : milestone, 19933,
tracing (17.851 ms) : 17673, 18029
. : milestone, 17851,
DacapoParameters
See matching parameters
SummaryFound 1 performance improvements and 0 performance regressions! Performance is the same for 11 metrics, 0 unstable metrics.
Execution time for biojavagantt
title biojava - execution time [CI 0.99] : candidate=1.61.0-SNAPSHOT~1c9c252b82, baseline=1.61.0-SNAPSHOT~92b8b827eb
dateFormat X
axisFormat %s
section baseline
no_agent (15.043 s) : 15043000, 15043000
. : milestone, 15043000,
appsec (14.733 s) : 14733000, 14733000
. : milestone, 14733000,
iast (17.965 s) : 17965000, 17965000
. : milestone, 17965000,
iast_GLOBAL (17.758 s) : 17758000, 17758000
. : milestone, 17758000,
profiling (14.971 s) : 14971000, 14971000
. : milestone, 14971000,
tracing (14.788 s) : 14788000, 14788000
. : milestone, 14788000,
section candidate
no_agent (14.964 s) : 14964000, 14964000
. : milestone, 14964000,
appsec (14.832 s) : 14832000, 14832000
. : milestone, 14832000,
iast (18.353 s) : 18353000, 18353000
. : milestone, 18353000,
iast_GLOBAL (17.934 s) : 17934000, 17934000
. : milestone, 17934000,
profiling (15.565 s) : 15565000, 15565000
. : milestone, 15565000,
tracing (14.908 s) : 14908000, 14908000
. : milestone, 14908000,
Execution time for tomcatgantt
title tomcat - execution time [CI 0.99] : candidate=1.61.0-SNAPSHOT~1c9c252b82, baseline=1.61.0-SNAPSHOT~92b8b827eb
dateFormat X
axisFormat %s
section baseline
no_agent (1.48 ms) : 1468, 1491
. : milestone, 1480,
appsec (3.831 ms) : 3609, 4053
. : milestone, 3831,
iast (2.257 ms) : 2188, 2326
. : milestone, 2257,
iast_GLOBAL (2.31 ms) : 2241, 2379
. : milestone, 2310,
profiling (2.115 ms) : 2059, 2171
. : milestone, 2115,
tracing (2.081 ms) : 2027, 2135
. : milestone, 2081,
section candidate
no_agent (1.485 ms) : 1473, 1496
. : milestone, 1485,
appsec (2.531 ms) : 2477, 2586
. : milestone, 2531,
iast (2.266 ms) : 2197, 2335
. : milestone, 2266,
iast_GLOBAL (2.308 ms) : 2238, 2377
. : milestone, 2308,
profiling (2.096 ms) : 2042, 2151
. : milestone, 2096,
tracing (2.096 ms) : 2041, 2150
. : milestone, 2096,
|
454b292 to
d07fb98
Compare
485cafd to
5249f8d
Compare
Increase BaseApplication.TIMEOUT_IN_NANOS from 10s to 30s to align with the test's PollingConditions timeout (also 30s). This is the only smoke test with a bidirectional handshake: the app blocks waiting for remote config changes, and the test blocks waiting for traces. Under CI load, the trace writer's first flush (including feature discovery) can exceed 10s. When the app's timeout fires first, it crashes the process, killing the daemon trace-writer thread before it delivers traces — so traceCount stays at 0. Aligning both timeouts to 30s ensures the test's polling condition is always the binding constraint, preventing the app from independently crashing. Observed across all JDK versions (8, 11, 17, 21, 25) — 17 failures in the last 30 days. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
5249f8d to
1c9c252
Compare
mhlidd
approved these changes
Mar 30, 2026
Contributor
|
/merge |
|
View all feedbacks in Devflow UI.
The expected merge time in
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
What Does This Do
Increases
BaseApplication.TIMEOUT_IN_NANOSfrom 10s to 30s to align with the test'sPollingConditionstimeout.Motivation
This is the only smoke test with a bidirectional handshake — the app blocks waiting for remote config changes from the test, and the test blocks waiting for traces from the app. Every other smoke test is one-way. The app's
waitForConditioncrash-on-timeout exists nowhere else in the smoke test suite.The two sides had misaligned timeouts: the test waits up to 30s for traces, but the app only waits 10s for config. Under CI load, the trace writer's first flush (including feature discovery) can exceed 10s. When the app's timeout fires first, it crashes the process, killing the daemon trace-writer thread before it delivers traces — so
traceCountstays at 0.Aligning both timeouts to 30s ensures the test's polling condition is always the binding constraint. The app never independently crashes, the daemon writer stays alive, and traces are always delivered.
Observed across all JDK versions (8, 11, 17, 21, 25) — 17 failures in the last 30 days. The
@Flakyannotation only covered IBM 8 and Oracle JDK 8.Reproduction
Simulated slow trace delivery locally by increasing
dd.trace.flush.interval:flush=9.5sflush=10.5sAdditional Notes
@Flakyannotation is kept as a safety net until CI confirms the fixpup cicd tests search --query 'test_level:test @git.branch:master @test.name:"check raw file injection" @test.final_status:fail' --from 30dJira ticket: N/A — test infrastructure fix
🤖 Generated with Claude Code