detect secrets embedded inside larger tokens by dgageot · Pull Request #2582 · docker/docker-agent · GitHub

dgageot · 2026-04-28T17:09:50Z

secretsscan.Redact / ContainsSecrets used to require a word boundary
around every secret: detection only fired when the recognisable token
sat next to whitespace, punctuation, or the start/end of the input.
That meant values pasted into a larger token leaked through, even when
the prefix and length were a perfect match:

input shape	before	after
`KEY= ghp_…` (with space)	✅	✅
`KEY=ghp_…` (no space)	✅	✅
`or ghp_…`	✅	✅
`BEFOREghp_…AFTER`	❌	✅
`…AKIA…EXAMPLEAFTER`	❌	✅

The fix drops the leading [^0-9a-zA-Z]|^ anchor (withoutWordPrefix
/ startWord) and the trailing [.,]?(\s+|$) anchor (endSecret)
from the rule expressions, plus the equivalent inline anchors on the
alibaba-access-key-id rule. Each rule's payload is already tightly
constrained (fixed-length character classes, explicit token shapes),
so removing the boundary check doesn't broaden the regex enough to
trigger false positives in practice.

Performance is unchanged in shape: the keyword pre-filter still skips
the regex hot path for typical inputs, and Go's RE2-based engine
keeps detection at O(len(text) · len(rules)). The clean-input and
with-secret benchmarks show the same allocation profile (1 / 4
allocs per op) as before.

Two new tests pin the behaviour:

TestRedactDetectsSecretsAcrossWordBoundaries covers GitHub PAT,
AWS access key, and Docker PAT in 12 boundary shapes (alone,
leading alphanumerics, trailing alphanumerics, fully embedded,
mid-KEY=…, …).
TestRedactScalesLinearly is a guard-rail that fails if a future
change reintroduces super-linear behaviour: doubling the input ~16×
must not balloon wall time by more than 128×, well below the ~256×
a true O(n²) regression would produce.

detect secrets embedded inside larger tokens

38bfdc6

dgageot requested a review from a team as a code owner April 28, 2026 17:09

aheritier approved these changes Apr 28, 2026

View reviewed changes

dgageot merged commit 291c33b into docker:main Apr 28, 2026
9 checks passed

BrewTestBot mentioned this pull request Apr 29, 2026

docker-agent 1.54.0 Homebrew/homebrew-core#280008

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

detect secrets embedded inside larger tokens#2582

detect secrets embedded inside larger tokens#2582
dgageot merged 1 commit intodocker:mainfrom
dgageot:board/improving-redact-secrets-for-non-word-bo-98e8d72c

dgageot commented Apr 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dgageot commented Apr 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants