feat(authz): introduce conditional access control via CEL by matheuscscp · Pull Request #4040 · project-zot/zot

matheuscscp · 2026-05-06T14:56:32Z

Closes: #4039

Supersedes: #4036

codecov · 2026-05-06T21:33:01Z

Codecov Report

❌ Patch coverage is 96.11307% with 11 lines in your changes missing coverage. Please review.
✅ Project coverage is 91.68%. Comparing base (ddb6279) to head (bd53acb).

Files with missing lines	Patch %	Lines
pkg/api/controller.go	37.50%	2 Missing and 3 partials ⚠️
pkg/api/authz.go	99.14%	1 Missing and 1 partial ⚠️
pkg/api/config/config.go	83.33%	1 Missing and 1 partial ⚠️
pkg/common/http_server.go	71.42%	1 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #4040      +/-   ##
==========================================
+ Coverage   91.66%   91.68%   +0.02%     
==========================================
  Files         199      199              
  Lines       28602    28828     +226     
==========================================
+ Hits        26217    26432     +215     
- Misses       1535     1540       +5     
- Partials      850      856       +6

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

andaaron

This is a very interesting proposal.

Copilot

Pull request overview

This PR adds conditional access control to Zot’s accessControl policies by introducing CEL-based conditions that must evaluate to true for a policy entry to grant access, with request/user/TLS/network/claims context exposed via a req input object.

Changes:

Extend access-control policy model to include conditions (CEL expressions + operator message) and compile them at config-load/startup/hot-reload time.
Evaluate policy conditions during authorization and surface an operator-provided deny reason in 403 responses when a condition explicitly evaluates to false.
Expose authn-time attributes (OIDC token claims) to authz-time conditions via req.claims, and document the new feature with examples.

Reviewed changes

Copilot reviewed 14 out of 14 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
pkg/requestcontext/user_access_control.go	Adds claim storage on request context and makes glob-pattern storage more robust to setter ordering.
pkg/common/http_server.go	Adds `AuthzFailWithReason` to include a deny reason in DENIED error details.
pkg/cli/server/root.go	Validates config by compiling access-control policy conditions at load time.
pkg/cli/server/root_test.go	Adds config-load tests ensuring policy conditions decode and invalid CEL fails load.
pkg/cel/expression.go	Adds option to declare `map<string, dyn>` CEL variables for richer runtime typing.
pkg/cel/claim_processor.go	Carries raw OIDC claim set through claim processing for later authz use.
pkg/api/controller.go	Stores compiled condition programs in the controller and refreshes them on hot reload.
pkg/api/config/config.go	Extends access-control policy schema with `Conditions` and documents the `req.*` inputs.
pkg/api/authz.go	Implements condition compilation, lookup, evaluation, and deny-reason propagation; wires middleware to use it.
pkg/api/authz_internal_test.go	Adds focused unit tests covering condition semantics, request-field exposure, and reload recompilation.
pkg/api/authn.go	Plumbs OIDC bearer auth results (including claims) into request context and updates AccessController construction.
examples/README.md	Documents conditional policy configuration, available `req.*` fields, and deny-message behavior.
examples/config-policy.json	Adds an example `conditions` configuration under a repository policy.
errors/errors.go	Adds `ErrPolicyConditionNotCompiled` for fail-closed lookups.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

matheuscscp · 2026-05-08T14:32:06Z

@andaaron Fixed your comments here, thanks! Can you pls trigger the CI? 🙏

rchincha · 2026-05-08T16:10:43Z

#4045
^ for the binary size increase failure

andaaron · 2026-05-08T16:38:07Z

@andaaron Fixed your comments here, thanks! Can you pls trigger the CI? 🙏

Can you please rebase? Last commit on main should be fixing the binary size increase.

matheuscscp · 2026-05-08T16:47:58Z

Done 🙏

matheuscscp · 2026-05-08T17:10:48Z

Just pushed a commit to attempt fixing the binary size increase 🙏

matheuscscp · 2026-05-08T17:40:17Z

Looks like the binary size increase issue is now fixed, only the bats flake failed this time, can you pls retrigger it? 🙏

https://github.com/project-zot/zot/actions/runs/25569525149/job/75061677582?pr=4040

Copilot

Pull request overview

Copilot reviewed 15 out of 15 changed files in this pull request and generated 3 comments.

rchincha · 2026-05-08T20:34:13Z

Just a note that we are really betting that CEL is safe and not subject to CEL-injection attacks etc.

"expression": "req.time < timestamp("2099-12-31T23:59:59Z")",

rchincha · 2026-05-08T20:35:10Z

@matheuscscp pls also take a look at copilot reviews.

matheuscscp · 2026-05-08T21:06:05Z

Just a note that we are really betting that CEL is safe and not subject to CEL-injection attacks etc.

Yeah it's pretty hermetic, we use it in Flux in at least 3 different features and controllers, they run on very privileged Flux controller pods in Kubernetes. The Kubernetes API Server also runs CEL for several built-in features, including CRD custom validation and the ValidatingAdmissionPolicy API. These were probably influenced from Google Cloud IAM security features. CEL is often used for security features, you can see by the influence chain.

@matheuscscp pls also take a look at copilot reviews.

👍 Looking right now

rchincha · 2026-05-08T21:38:18Z

@matheuscscp pls also take a look at copilot reviews.

👍 Looking right now

A few outstanding and they look relevant. Will wait for those to be addressed. Otherwise lgtm.

matheuscscp · 2026-05-08T21:41:38Z

@matheuscscp pls also take a look at copilot reviews.

👍 Looking right now

A few outstanding and they look relevant. Will wait for those to be addressed. Otherwise lgtm.

👍 I'm fixing one and replying to the other. I also replied to the first one earlier.

Copilot

Pull request overview

Copilot reviewed 15 out of 15 changed files in this pull request and generated 2 comments.

Signed-off-by: Matheus Pimenta <matheuscscp@gmail.com>

matheuscscp · 2026-05-08T22:03:59Z

Hey @andaaron sorry, I think I may have mislclicked something here:

@rchincha @andaaron All copilot comments addressed. Among the last two, one was repeated from the first three (the one I implemented). For the other two comments from the first batch and the other one from the second batch I just replied, they are not applicable.

Pls approve CI 🙏

rchincha · 2026-05-09T03:21:52Z

Potential risks I see in this change:

/v2/_catalog may over-list repositories
Conditional policies are intentionally included during glob-time filtering without evaluating conditions, so users may see repos in _catalog that they still cannot access later. That can confuse clients/automation that interpret catalog visibility as effective access.

403 responses now expose operator-authored deny reasons
Conditional denies surface conditions[].message in the response body under detail.reason. This is useful, but it can leak internal policy logic, naming conventions, trusted proxy assumptions, or other sensitive details if operators write overly specific messages.

Authz now depends on runtime CEL evaluation
Access decisions now rely on compiled-condition lookup plus runtime evaluation against request data. The code fails closed, which is good for security, but increases the chance of operational regressions where config/cache/type mismatches result in unexpected denies.

req.claims increases authn/authz coupling
Policies can now depend on OIDC claims. That’s powerful, but claim shape and typing can vary across providers/environments, making policies more fragile and less portable.

Dynamic typing may cause production-only failures
Because CEL input is dyn-typed, some mistakes won’t be caught at compile time and may only appear at runtime when a field is missing or differently typed.

Admin policy behavior changes
Admin access can now be conditionally denied. Misconfigured admin conditions could unintentionally block intended admin flows.

Hot reload/cache correctness is important
Since compiled conditions are cached and refreshed on reload, any sequencing/staleness issue could lead to stale or missing programs and fail-closed authorization behavior.

Overall I’d call this medium risk: strong feature addition, but with meaningful behavior changes around authz, visibility semantics, and denial surfacing.

rchincha · 2026-05-09T03:23:52Z

Security-focused review:

Overall this is a positive hardening feature — conditional authz via CEL, load-time compilation, and fail-closed behavior are all good directions. My main security concerns are around disclosure and operator foot-guns:

Conditional deny messages are now client-visible.
conditions[].message is surfaced in 403 response details. That is useful for UX, but it also creates a disclosure channel for internal policy logic, naming conventions, network trust assumptions, or identity expectations if operators write detailed messages. I’d recommend treating message as public-facing text in docs, or making client exposure optional.

/v2/_catalog may now leak repo existence.
Conditional policies are intentionally included optimistically during glob-time filtering, so _catalog may list repos that the caller still cannot actually access later. Even if content access is denied, repo enumeration/naming can itself be sensitive. This should be documented clearly as a security/privacy tradeoff.

req.client.forwardedFor is an operator foot-gun.
The docs correctly say it is untrusted, but exposing it directly in policy conditions makes it easy to write unsafe rules unless req.client.ip is also checked as a trusted proxy. If possible, I’d add stronger guidance/examples for safe vs unsafe usage, or even validation/linting for conditions that reference forwarded-for without a trusted-peer check.

req.claims increases authz power, but also trust/type ambiguity.
Claim-based authorization depends heavily on issuer semantics and claim shape stability across environments/providers. Missing or differently typed claims appear to fail closed, which is good, but this still deserves explicit documentation so operators don’t over-trust claim contents.

Admin policy conditions are high impact.
Allowing admin grants to be conditionally denied is useful, but misconfiguration here can lock out intended recovery/admin paths. I’d suggest calling that out explicitly in docs/release notes.

From a pure security standpoint I like the overall fail-closed model and the fact that internal CEL lookup/eval failures are not leaked to clients. My main ask is stronger guardrails/documentation around what is now intentionally exposed and what operators should treat as untrusted input.

rchincha · 2026-05-09T03:25:20Z

| /v2/_catalog may over-list repositories

This is worrisome. Also, does this happen if there is no CEL in the config?

andaaron · 2026-05-09T06:02:39Z

Note on /v2/_catalog with conditional policies:

Pattern/user/group “read” semantics are unchanged — _catalog is still filtered by the same repo pattern + action checks as before.
CEL conditions act as an additional per-request gate, but they generally cannot be enforced meaningfully at _catalog time because the evaluation inputs for _catalog differ from real repo operations: there is no tag/digest reference (so req.reference* is empty) and many conditions are only well-defined when authorizing a concrete HEAD/GET/PUT against a specific repo+reference.

Result: _catalog may over-list repositories that will later be denied by conditions (i.e., potential repo-name disclosure depending on how conditions are used).

WRT Authz depending on runtime CEL evaluation - this is true, but it is a decision of the server admin, if he wants to use this feature, he needs to properly test the expressions he configures.

matheuscscp · 2026-05-09T08:23:51Z

| /v2/_catalog may over-list repositories

This is worrisome.

I had long discussions about this with Opus 4.7, and Copilot posted two comments about it. I think the option I chose is the best trade off considering UX, performance, implementation complexity and security. See the summary from Opus 4.7 below. Essentially, users will only see repos but will not have access to them. This is much better than not seeing repos they do have access to. (Exactly what Andrei posted above!)

Also, does this happen if there is no CEL in the config?

Not at all. This is only a thing when using conditions. The current behavior is 100% unchanged and conditions are 100% optional!

Again, no need to worry about CEL, they are safe and designed for security features like this by Google! Google, Kubernetes and Flux deeply trust CEL for features like this.

Summary from Opus 4.7 about the discussion.

● Setup: alice's policy is users:[alice], actions:[read],
conditions:[req.repository.startsWith('prod/')] matching pattern **. The registry has prod/api,
prod/db, staging/web. Alice should only see/pull prod/*.

When alice hits /v2/_catalog, the server has to filter the listing. Three ways to do it:

A — evaluate per-repo at listing time. For each repo, build a fresh evalRequest with that repo
name, evaluate alice's conditions, include if permitted.

alice sees exactly prod/api, prod/db. Never staging/web.

Cost: O(repos × policies × conditions) per /v2/_catalog call. For a registry with thousands of
repos this becomes a real latency problem.

B — skip conditions at listing time, include optimistically (what zot does). Alice's policy matches
on user+action so all three repos are candidate-included; conditions are not evaluated.

alice sees prod/api, prod/db, staging/web in the catalog.

When she runs pull staging/web → 403 with the operator's deny message ("only prod allowed" or
whatever).

Real enforcement happens in ac.can per request, against the real repo/reference. The catalog is
just a hint.

C — evaluate at listing time with empty placeholders. Set req.repository = "", evaluate
''.startsWith('prod/') → false. All alice's repos hidden.

alice sees an empty catalog even though pull prod/api would succeed.

Silent under-listing. No 403, no message — alice probably concludes she has no access at all and
gives up.

Trade-off:

A is correct but expensive. Acceptable for small deploys, painful at scale.

B is the chosen approach: over-list at worst, never under-list. Every actual operation is
correctly authorized; the catalog is approximate.

C is broken — it actively hides accessible repos with zero feedback.

The two copilot comments worry about A→B (you lost precision) and B→C (you're underspecifying
conditions). The answer to both is the same: per-request enforcement is the source of truth; the
catalog is an optimization, and the failure mode that's been chosen is "alice sometimes sees a repo
she can't actually pull, and gets a clear 403 when she tries" rather than "alice silently can't
see things she can pull".

matheuscscp requested review from andaaron and rchincha as code owners May 6, 2026 14:56

matheuscscp mentioned this pull request May 6, 2026

Introduce expiration field for accessConfig policies #4036

Closed

matheuscscp force-pushed the conditional-access-control branch from 98d31e3 to 8e79bdb Compare May 7, 2026 08:20

matheuscscp changed the title ~~Introduce conditional access control~~ feat(authz): introduce conditional access control via CEL May 7, 2026

rchincha added this to the v2.1.17 milestone May 8, 2026

andaaron reviewed May 8, 2026

View reviewed changes

Comment thread pkg/api/controller.go Outdated

Comment thread pkg/api/authz.go Outdated

andaaron requested a review from Copilot May 8, 2026 07:58

Copilot started reviewing on behalf of andaaron May 8, 2026 07:59 View session

Copilot AI reviewed May 8, 2026

View reviewed changes

Comment thread pkg/api/authz.go

Comment thread pkg/api/authz.go

matheuscscp force-pushed the conditional-access-control branch 3 times, most recently from 1a92bbe to 92ecdc0 Compare May 8, 2026 14:31

matheuscscp force-pushed the conditional-access-control branch from 92ecdc0 to 5ee9f1f Compare May 8, 2026 16:47

matheuscscp force-pushed the conditional-access-control branch from 7984dbd to cc4d996 Compare May 8, 2026 17:24

andaaron requested a review from Copilot May 8, 2026 19:59

andaaron previously approved these changes May 8, 2026

View reviewed changes

Copilot started reviewing on behalf of andaaron May 8, 2026 20:00 View session

Copilot AI reviewed May 8, 2026

View reviewed changes

Comment thread pkg/api/authz.go

Comment thread pkg/api/authz.go Outdated

Comment thread pkg/api/config/config.go

rchincha requested a review from Copilot May 8, 2026 21:38

Copilot started reviewing on behalf of rchincha May 8, 2026 21:38 View session

Copilot AI reviewed May 8, 2026

View reviewed changes

Comment thread pkg/api/authz.go

Comment thread pkg/api/authz_internal_test.go

feat(authz): introduce conditional access control via CEL

bd53acb

Signed-off-by: Matheus Pimenta <matheuscscp@gmail.com>

matheuscscp dismissed andaaron’s stale review via bd53acb May 8, 2026 22:00

matheuscscp force-pushed the conditional-access-control branch from cc4d996 to bd53acb Compare May 8, 2026 22:00

rchincha requested a review from andaaron May 8, 2026 22:41

andaaron approved these changes May 9, 2026

View reviewed changes

Conversation

matheuscscp commented May 6, 2026

Uh oh!

codecov Bot commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

andaaron left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

matheuscscp commented May 8, 2026

Uh oh!

rchincha commented May 8, 2026

Uh oh!

andaaron commented May 8, 2026

Uh oh!

matheuscscp commented May 8, 2026

Uh oh!

matheuscscp commented May 8, 2026

Uh oh!

matheuscscp commented May 8, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rchincha commented May 8, 2026

Uh oh!

rchincha commented May 8, 2026

Uh oh!

matheuscscp commented May 8, 2026

Uh oh!

rchincha commented May 8, 2026

Uh oh!

matheuscscp commented May 8, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

matheuscscp commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rchincha commented May 9, 2026

Uh oh!

rchincha commented May 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rchincha commented May 9, 2026

Uh oh!

andaaron commented May 9, 2026

Uh oh!

matheuscscp commented May 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov Bot commented May 6, 2026 •

edited

Loading

matheuscscp commented May 8, 2026 •

edited

Loading

rchincha commented May 9, 2026 •

edited

Loading

matheuscscp commented May 9, 2026 •

edited

Loading