python: enable summaries from model #12581

yoff · 2023-03-20T07:56:44Z

This requires a change to the shared interface:
Making getNodeFromPath public.

This because Python is doing its own thing and identifying call-backs.

I am unsure if this constitutes a feature yet, or if we should add a CSV parser first?

This requires a change to the shared interface: Making `getNodeFromPath` public. This because Python is doing its own thing and identifying call-backs.

(but no summaries yet)

`base` is already the `CallNode` we want.

and add summaries

RasmusWL

Nice 💪

Although a bit of a nitpick, can we please change Foo in the package name to foo? -- it just stands out as very non-standard.

I think it's a shame that the data-flow tests and taint-tracking tests are almost identical.

For taint-tracking we've mostly used InlineTaintTest.qll, so maybe we can use that alongside the data-flow test, and only have a single TestSummaries.qll file?

RasmusWL · 2023-06-20T11:58:52Z

I looked a bit at where the existing NormalTaintTest was used, and the only case is https://github.com/github/codeql/blob/11c89adbe3b238e3a142256175f0003e19d7972b/python/ql/test/experimental/dataflow/summaries/summaries.py -- I think that could also benefit from being rewritten to have BOTH dataflow and taint-tracking tests, instead of only having taint tests.

also change `Foo` -> `foo`

yoff · 2023-06-20T14:14:46Z

I tried putting it in one file now, I agree we should consolidate all our summary tests at some point..

RasmusWL

Besides the two code comments, I also have a stylistic recommendation for your Python code.

put trailing commas after last argument (means diff for adding a new argument in the future is a bit more clean)
do not indent closing parentheses (

Both follows black formatter (although it would try to put both arguments to ensure_tainted on one line 😮‍💨)

specifically it would change your code like this:

 ensure_tainted(
     tainted_list_el,  # $ tainted
-    tainted_list_el[0]  # $ tainted
-    )
+    tainted_list_el[0],  # $ tainted
+)

RasmusWL · 2023-06-21T08:35:14Z

python/ql/test/experimental/dataflow/model-summaries/model_summaries.py

+
+tainted = MS_identity(TAINTED_STRING)
+ensure_tainted(tainted) # $ tainted


Since you have just shown that we have data-flow, and we know all dataflow steps are also taint-flow steps, I think it's fine to only check taint-flow for the cases where there is NOT dataflow.

I think that will make the test-file a bit easier to read as well.

Good point. I tried to make it more consistent now by not checking taint when dataflow is already established and when we do check taint, check both the collection and the expected element.

Thinking some more about this, I think these tests should be about whether you can write flow summaries in CSV files that do the right thing.

Having all these extra taint steps are closer to what our current taint-tracking does in the end, but from my point of view, doesn't help us achieve the goal of the tests.

I removed the TAINTED_LIST parts locally, but thought it was a bit too controversial to just commit directly to your PR -- instead I've put the commits here: yoff#79 (if you agree, we can add these to main later on 👍)

python/ql/test/experimental/dataflow/model-summaries/model_summaries.py

…maries.py Co-authored-by: Rasmus Wriedt Larsen <[email protected]>

- do not test taint flow whne dataflow is established - test taint of both the collection and the expected element

asgerf

Minor comment otherwise LGTM

javascript/ql/lib/semmle/javascript/frameworks/data/internal/ApiGraphModels.qll

ruby/ql/lib/codeql/ruby/frameworks/data/internal/ApiGraphModels.qll

python/ql/lib/semmle/python/frameworks/data/internal/ApiGraphModels.qll

Co-authored-by: Asger F <[email protected]>

asgerf

LGTM once the tests pass.

There seem to be some test expectations that need updating, probably due to some of the recent changes to inline test expectations.

RasmusWL

Approved for now, tests can be adjusted later 👍

github-actions bot added Python JS Ruby labels Mar 20, 2023

yoff force-pushed the python/enable-summaries-from-models branch from c3e6819 to efa36d2 Compare March 24, 2023 08:53

yoff added 4 commits June 18, 2023 21:52

python: enable summaries from model

18f4b75

This requires a change to the shared interface: Making `getNodeFromPath` public. This because Python is doing its own thing and identifying call-backs.

Py/js/ruby: sync files

3cf9e3e

python: add test for model summaries

6554e80

(but no summaries yet)

python: rename summaries

2296410

yoff force-pushed the python/enable-summaries-from-models branch from efa36d2 to 2296410 Compare June 18, 2023 20:02

yoff added 2 commits June 19, 2023 11:41

python: remove erronous getACall()

eb3c33d

`base` is already the `CallNode` we want.

python: split tests into taint and value

e111a19

and add summaries

yoff marked this pull request as ready for review June 20, 2023 08:52

yoff requested review from a team as code owners June 20, 2023 08:52

python: add changenote

5ceac5a

github-actions bot added the documentation label Jun 20, 2023

RasmusWL requested changes Jun 20, 2023

View reviewed changes

python: consolidate tests

cb2de69

also change `Foo` -> `foo`

yoff requested a review from RasmusWL June 20, 2023 14:14

calumgrant requested a review from asgerf June 21, 2023 08:36

RasmusWL requested changes Jun 21, 2023

View reviewed changes

yoff and others added 2 commits June 22, 2023 11:31

Update python/ql/test/experimental/dataflow/model-summaries/model_sum…

0f8ebd1

…maries.py Co-authored-by: Rasmus Wriedt Larsen <[email protected]>

python: more consistent tests

2264b11

- do not test taint flow whne dataflow is established - test taint of both the collection and the expected element

yoff requested a review from RasmusWL June 22, 2023 09:54

python: format

86dfc7b

asgerf reviewed Jun 23, 2023

View reviewed changes

javascript/ql/lib/semmle/javascript/frameworks/data/internal/ApiGraphModels.qll Outdated Show resolved Hide resolved

ruby/ql/lib/codeql/ruby/frameworks/data/internal/ApiGraphModels.qll Outdated Show resolved Hide resolved

yoff commented Jun 23, 2023

View reviewed changes

python/ql/lib/semmle/python/frameworks/data/internal/ApiGraphModels.qll Outdated Show resolved Hide resolved

Apply suggestions from code review

26856a8

Co-authored-by: Asger F <[email protected]>

yoff requested a review from asgerf June 23, 2023 08:15

asgerf reviewed Jun 26, 2023

View reviewed changes

RasmusWL added 3 commits June 26, 2023 11:34

Merge branch 'main' into python/enable-summaries-from-models

0121263

Python: Updates from inline test being parameterized

6cb0319

Python: Remove one more unnecessary taint test

257f991

asgerf approved these changes Jun 26, 2023

View reviewed changes

RasmusWL approved these changes Jun 26, 2023

View reviewed changes

RasmusWL merged commit 9c5aff3 into github:main Jun 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

python: enable summaries from model #12581

python: enable summaries from model #12581

Uh oh!

yoff commented Mar 20, 2023 •

edited

Loading

Uh oh!

RasmusWL left a comment •

edited

Loading

Uh oh!

RasmusWL commented Jun 20, 2023

Uh oh!

yoff commented Jun 20, 2023

Uh oh!

RasmusWL left a comment

Uh oh!

RasmusWL Jun 21, 2023

Uh oh!

yoff Jun 22, 2023

Uh oh!

RasmusWL Jun 26, 2023

Uh oh!

Uh oh!

asgerf left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

asgerf left a comment

Uh oh!

RasmusWL left a comment

Uh oh!

Uh oh!


		tainted = MS_identity(TAINTED_STRING)
		ensure_tainted(tainted) # $ tainted

python: enable summaries from model #12581

python: enable summaries from model #12581

Uh oh!

Conversation

yoff commented Mar 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RasmusWL left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

RasmusWL commented Jun 20, 2023

Uh oh!

yoff commented Jun 20, 2023

Uh oh!

RasmusWL left a comment

Choose a reason for hiding this comment

Uh oh!

RasmusWL Jun 21, 2023

Choose a reason for hiding this comment

Uh oh!

yoff Jun 22, 2023

Choose a reason for hiding this comment

Uh oh!

RasmusWL Jun 26, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

asgerf left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

asgerf left a comment

Choose a reason for hiding this comment

Uh oh!

RasmusWL left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

yoff commented Mar 20, 2023 •

edited

Loading

RasmusWL left a comment •

edited

Loading