Fix `find_publisher_by_issuer` environment filter #13566

di · 2023-05-03T04:41:57Z

Previously, we weren't adequately filtering on environment in find_publisher_by_issuer, which resulted in this function raising a MultipleResultsFound exception if more than one publisher was configured that matched the claimset.

Instead, the behavior should be as follows:

If no environment claim is present
- If there's a publisher that doesn't restrict on environment, return it
- or return None
If an environment claim is present
- If there's a publisher that restricts on this environment, return it
- If there's a publisher that doesn't restrict on environment, return it
- or return None

Fixes https://python-software-foundation.sentry.io/issues/4150013748/.

tests/unit/oidc/test_utils.py

miketheman · 2023-05-03T13:40:06Z

warehouse/oidc/utils.py

+                    repository_name=repository_name,
+                    repository_owner=repository_owner,
+                    repository_owner_id=signed_claims["repository_owner_id"],
+                    environment=signed_claims.get("environment"),


question: It looks like signed_claims could come in without an environment key.

By my reading, the first query in the try block would run a filter_by(..., environment=None and try to get one(), and if nothing comes back, issue a second identical query with one_or_none().

My question is:

If there's 2 claim records, one with an environment, and one without, the first query will get the env-specific one. Yay, that's the intent.

If there's 1 claim record, with no environment, why would the second query get run? Wouldn't the filter_by condition apply just the same?

question: It looks like signed_claims could come in without an environment key.

Yep, that's correct: GitHub's OIDC JWTs don't contain an environment claim if one isn't explicitly configured.

I think the confusing bit here is the possible states:

OIDC token with an environment

OIDC token without an environment

Trusted publisher with an environment configured

Trusted publisher without an environment configured

State (3) must only match state (1), while state (4) can match either (1) or (2). So we need to explicitly carve out an environment=None case.

(This would have been nicer if we'd added the environment claim without broadening the uniqueness constraint to include it, but that would have made it impossible to register separate trusted publishers for different environments under the same workflow...)

Oh, but I see what you mean -- I think the queries are slightly off here: the first should ensure that signed_claims["environment"] is actually present, while the second should remain explicitly environment=None. That would make the states clearer.

If there's 1 claim record, with no environment, why would the second query get run? Wouldn't the filter_by condition apply just the same?

The second query is to capture the case where the signed claims have an environment key, but a publisher is configured with environment=None -- essentially, we can always fall back on this "wildcard" publisher no matter what, if it's present, if there isn't a publisher configured with a matching environment.

I think we need two queries regardless: one to check if there's a publisher that matches the environment, and one to check for a "wildcard" publisher. Using signed_claims.get("environment") in the first query here allows us to satisfy the first case and also short-circuit and only run one query when there is no environment in the signed claims at all. If we didn't do that, we'd have to add some more branching here, like:

I agree this is a little confusing though, let me see if I can make this more clear with some conditionals instead.

Co-authored-by: Mike Fiedler <[email protected]>

woodruffw

LGTM! I think this is more comprehensible, even if we miss a short-circuiting optimization 🙂

miketheman

Thanks for making the logic simpler, based on the environment existing.
Not we should only ever make a single DB call for the publisher.

I'm betting this query could be refactored a little more to construct the query body, and conditionally add the specifics to switch the environment flag, but that's fine to defer to another time.

di added 3 commits May 3, 2023 04:14

Make the tests less mocked

2af3e01

Add some failing tests

8a12190

Conditionally filter publisher search on environment

8133f9c

di requested a review from a team as a code owner May 3, 2023 04:41

Fix additional tests

49e5454

miketheman reviewed May 3, 2023

View reviewed changes

di and others added 3 commits May 3, 2023 11:37

Apply suggestions from code review

bf09d6c

Co-authored-by: Mike Fiedler <[email protected]>

Clarify with conditionals

9cf3bad

Linting

36125d6

woodruffw approved these changes May 3, 2023

View reviewed changes

woodruffw mentioned this pull request May 3, 2023

oidc: Google JWKS/services scaffolding #13569

Merged

miketheman approved these changes May 3, 2023

View reviewed changes

di merged commit 4e91963 into pypi:main May 3, 2023

di deleted the fix-oidc-environment-filter branch May 3, 2023 17:11

di mentioned this pull request May 4, 2023

[oidc] Normalize GitHub environment claim #13576

Merged

woodruffw added the trusted-publishing label May 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix `find_publisher_by_issuer` environment filter #13566

Fix `find_publisher_by_issuer` environment filter #13566

Uh oh!

di commented May 3, 2023

Uh oh!

Uh oh!

Uh oh!

miketheman May 3, 2023

Uh oh!

woodruffw May 3, 2023

Uh oh!

woodruffw May 3, 2023 •

edited

Loading

Uh oh!

di May 3, 2023

Uh oh!

di May 3, 2023

Uh oh!

woodruffw left a comment

Uh oh!

miketheman left a comment

Uh oh!

Uh oh!

Fix find_publisher_by_issuer environment filter #13566

Fix find_publisher_by_issuer environment filter #13566

Uh oh!

Conversation

di commented May 3, 2023

Uh oh!

Uh oh!

Uh oh!

miketheman May 3, 2023

Choose a reason for hiding this comment

Uh oh!

woodruffw May 3, 2023

Choose a reason for hiding this comment

Uh oh!

woodruffw May 3, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

di May 3, 2023

Choose a reason for hiding this comment

Uh oh!

di May 3, 2023

Choose a reason for hiding this comment

Uh oh!

woodruffw left a comment

Choose a reason for hiding this comment

Uh oh!

miketheman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Fix `find_publisher_by_issuer` environment filter #13566

Fix `find_publisher_by_issuer` environment filter #13566

woodruffw May 3, 2023 •

edited

Loading