ATM: Remove redundant code #11321

tiferet · 2022-11-17T19:33:40Z

This PR is the code cleanup made possible by the previous few PRs. As such, it has a lot of lines of change (mostly code deletion 🥳), but the changes are conceptually small. I tried to make the commits as clear as possible, tackling one piece of complexity reduction per commit, and explaining it in the commit comment. Commit-by-commit review warmly recommended 😄

Note that this PR is based on the branch tiferet/endpoint-filters, because that branch has not yet been merged and I don't want to see the commits from #11281 in this PR as well.

main code deletions / simplifications

isOtherModeledArgument and isArgumentToBuiltinFunction contained the old logic for selecting negative endpoints for training. These can now be deleted, and replaced by a single base class that collects all EndpointCharacteristics that are currently used to indicate negative training samples: OtherModeledArgumentCharacteristic. This in turn lets us delete code from StandardEndpointFilters that effectively said that endpoints that are high-confidence non-sinks shouldn't be scored at inference time, either.
FilteringReason is no longer being used and can be deleted.
Delete CoreKnowledge and StandardEndpointFilters: All remaining functionality in CoreKnowledge and StandardEndpointFilters is only being used in EndpointCharacteristics, so it can be moved there as a small set of helper predicates.

Timing checks

✅ KPI timing experiment: github/codeql-dca-main#8659
☑️ The local runtime of endpoint_large_scale/ExtractEndpointDataTraining remains like it was after the last PR that affected timing: About 5s.

Closes github/ml-ql-adaptive-threat-modeling#2110

...l/adaptivethreatmodeling/lib/experimental/adaptivethreatmodeling/EndpointCharacteristics.qll

`isOtherModeledArgument` and `isArgumentToBuiltinFunction` contained the old logic for selecting negative endpoints for training. These can now be deleted, and replaced by a single base class that collects all EndpointCharacteristics that are currently used to indicate negative training samples: `OtherModeledArgumentCharacteristic`. This in turn lets us delete code from `StandardEndpointFilters` that effectively said that endpoints that are high-confidence non-sinks shouldn't be scored at inference time, either.

All remaining functionality in `CoreKnowledge` is only being used in `EndpointCharacteristics`, so it can be moved there as a small set of helper predicates.

All remaining functionality in `StandardEndpointFilters` is only being used in `EndpointCharacteristics`, so it can be moved there as a small set of helper predicates.

kaeluka

✅ Approve

This is the kind of PR where I'm extra glad we have CI checks: big surface, but not much new logic ;)

I'll give this a formal approval once the first PR is merged and this has been rebased; but I've done the review now and it all looks good to me. I've left one question for my own understanding.

...l/adaptivethreatmodeling/lib/experimental/adaptivethreatmodeling/EndpointCharacteristics.qll

github-actions bot added the ATM label Nov 17, 2022

tiferet commented Nov 17, 2022

View changes

...l/adaptivethreatmodeling/lib/experimental/adaptivethreatmodeling/EndpointCharacteristics.qll Show resolved Hide resolved

tiferet marked this pull request as ready for review Nov 18, 2022

tiferet requested a review from a team as a code owner Nov 18, 2022

tiferet requested review from kaeluka and removed request for a team Nov 18, 2022

tiferet mentioned this pull request Nov 18, 2022

ATM: Implement the current endpoint filters as EndpointCharacteristics #11281

Open

tiferet force-pushed the tiferet/complexity-reduction branch from e6c76a5 to 9a37061 Compare Nov 19, 2022

tiferet added 5 commits Nov 21, 2022

FilteringReason is no longer being used and can be deleted

43f47ff

Delete CoreKnowledge.

0816343

All remaining functionality in `CoreKnowledge` is only being used in `EndpointCharacteristics`, so it can be moved there as a small set of helper predicates.

Delete StandardEndpointFilters.

1b0e8ac

All remaining functionality in `StandardEndpointFilters` is only being used in `EndpointCharacteristics`, so it can be moved there as a small set of helper predicates.

Oops -- forgot to stage one file in the previous commit :)

fac6641

tiferet force-pushed the tiferet/complexity-reduction branch from 9a37061 to fac6641 Compare Nov 21, 2022

kaeluka reviewed Nov 22, 2022

View changes

...l/adaptivethreatmodeling/lib/experimental/adaptivethreatmodeling/EndpointCharacteristics.qll Show resolved Hide resolved

owen-mc changed the title ~~Remove redundant code~~ ATM: Remove redundant code Nov 22, 2022

ATM: Remove redundant code #11321

ATM: Remove redundant code #11321

tiferet commented Nov 17, 2022 •

edited

kaeluka left a comment

ATM: Remove redundant code #11321

Are you sure you want to change the base?

ATM: Remove redundant code #11321

Conversation

tiferet commented Nov 17, 2022 • edited

main code deletions / simplifications

Timing checks

kaeluka left a comment

tiferet commented Nov 17, 2022 •

edited