ci: add job to verify binary size #475

justus-camp-microsoft · 2024-12-12T23:12:08Z

This PR adds a job to diff binary sizes introduced in changes. As implemented, the action runs git merge-base to find a common ancestor with main, fetches a completed build from CI (it will try to up to 5 commits back in case CI hasn't completed for the commit returned by git merge-base), and outputs a diff.

GitHub actions with a pull_request trigger are unable to comment on PRs. As such, this implementation fails the check if the size difference is greater than a threshold. In the case where we're ok with the size increase, my understanding is that we can force the merge without the check passing.

jstarks · 2024-12-13T01:06:12Z

How is this going to be different from #458?

justus-camp-microsoft · 2024-12-13T18:15:13Z

I wasn't aware of that thread. I'll look into getting a baseline from a pipeline and using it for comparison.

justus-camp-microsoft · 2024-12-13T19:19:56Z

I took a look at FluidFramework, which I used to work on and has a bundle size check as part of their PR workflow. From what I can tell, their way of doing this is to traverse HEAD~n until it finds a completed build and does a size comparison with that. Their CI has a bot that leaves a comment with the comparison but doesn't look like it blocks merging of a PR. What do we think about that approach?

smalis-msft · 2024-12-16T16:22:22Z

Oooooh, prior art, nice.

I think the commit we want to compare against is whatever the merge is based on. That would allow us to get as good a measurement of "this PR adds X bytes compared to not having it" as possible. If that commit is still running through CI maybe we just wait for it? If it fails though then walking backwards on main/release does seem like a reasonable fallback strategy.

I think for ours we'd prefer to have a gate rather than just a comment, so long as there's some way for us to then override the block and say "yes this is acceptable". But a gate would prevent anyone from merging before the bot comments, for example. We could then have a dedicated size_override reviewers group that the gate requires sign off from to override or something.

Also, I'd like to make sure we're actually storing the whole built file that we're using to compare against, not just a pre-computed summary of it. That frees us up to do more complex and involved analysis in the future.

smalis-msft · 2024-12-17T16:58:44Z

Tagging #76

smalis-msft · 2025-01-03T16:32:49Z

Man this is exciting to see, the prior solution has been an annoyance for so long now.

smalis-msft · 2025-01-03T16:34:05Z

xtask/src/tasks/verify_size.rs

+        if total_diff > 100 {
+            anyhow::bail!("{} size verification failed: The total difference ({} KiB) is greater than the allowed difference ({} KiB).", self.new.display(), total_diff, 100);
+        }


We'll need some way to override this check on a PR level, some way to say "Yes this size diff is acceptable". Not sure what github allows us to do here.

GitHub doesn't really have a great way to do this. Ideally we could have this always be required and just hit an "override" button but afaik that's not possible. We're also unable to assign a review team through the action (as I painfully learned from trying to re-enable the unsafe reviewers assignment) because actions are scoped to the repo level and our review teams are scoped at the org level (no access).

My thought here is that we should have this action always succeed as long as it finishes all the way through and have it leave a comment on the PR with a summary of the size diff. The onus would be on the reviewer to look at the comment and make sure that the difference is acceptable.

If that's the best we can do then it's the best we can do I guess. Maybe include some big warning text in the comment if the diff is over a threshold.

We really should figure out some way to get review groups working though. Then we could have the unsafe reviewers group back and create a new binary size reviewer group for large diffs or something.

We'll need a PAT with org-level team read access and then the reviewer assignment would work. My understanding was that we don't want to deal with maintaining the PAT.

Can we have review teams coped to the repo instead of the org? I'm really not familiar with github, so I'm just spitballing. But yeah, maintaining a PAT has bitten us in the past, and definitely isn't ideal.

Hmm, good point. I can look into that and see if that's a possibility. I hadn't thought of bringing the teams down to the repo level.

Looping back around to this - a GitHub action with a pull_request target is unable to comment on PRs (similar limitation to unsafe reviewers check) and as such I think our best bet here is to fail the action if it's over a threshold. In the case where it's over the threshold and we're ok with the size increase my understanding is that we can force the merge with the failing check.

I think that's OK for v1. But I think the pattern here to follow for v2 would be to create an additional workflow that depends on this one/is triggered by this one but comes from the base branch. That would allow it to safely have access to add comments, etc. I think this means using pull_request_target for that workflow.

Or maybe workflow_run.

My understanding here is that for dependent workflows like that we would need to use workflow_run, but that the token passed when triggered has the same permissions as the one triggering it (as in, it would get the pull_request token that doesn't have comment permissions). I could definitely be wrong here as I didn't try it.

flowey/flowey_hvlite/src/pipelines/checkin_gates.rs

xtask/src/tasks/verify_size.rs

flowey/flowey_hvlite/src/pipelines/checkin_gates.rs

This reverts commit 2fc057a.

This reverts commit 0711d49.

jstarks · 2025-01-08T23:28:15Z

flowey/flowey_hvlite/src/pipelines/checkin_gates.rs

+                all_jobs.push(job.finish());
+
+                // emit openvmm verify-size job
+                let job = pipeline


Why not use a single job?

Indeed.

If you actually look inside _jobs::build_and_publish_openhcl_igvm_from_recipe, you'll see that its really just a wrapper around the core build_openhcl_igvm_from_recipe node + some wiring to build multiple IGVM files simultaneously, and then publish various artifacts.

If you just peel back a layer, and have your new _jobs::check_openvmm_hcl_size node interface with that build_openhcl_igvm_from_recipe Node directly, you can sidestep all this multi-job coordination, and just use the IGVM file you built in that job.

Plus - you wouldn't be shackled to the existing openhcl-igvm + openhcl-igvm-extras artifact structure, and could instead have a new, verify-size specific openhcl-igvm-verify-size-baseline artifact that you can then use across jobs (and which would only contain the precise artifacts the verify-size infrastructure cares about)

daprilik · 2025-01-09T00:56:47Z

flowey/flowey_core/src/node.rs

@@ -1793,6 +1793,14 @@ pub mod steps {

            /// `github.token`
            pub const GITHUB__TOKEN: GhContextVar = GhContextVar::new_secret("github.token");
+
+            /// `github.event.pull_request.head.ref`
+            pub const GITHUB__HEAD_REF: GhContextVar =


I think we want to be mindful about how we choose to expose this particular class of context variables in flowey.

All other constants defined in this list are guaranteed to be valid in any pipeline run. The same cannot be said of these new github.event.pull_request constants, which should only be used in PR-triggered workflows. i.e: it seems unwise to make it "trivial" to access these variables in the context of a CI-triggered workflow via the existing get_gh_context_var API, given that the resulting loosely-typed String could sometimes be empty.

My gut feeling is that we want to have some API that would let us model these sorts of context-dependent variables in a type-safe manner, in order to give users a way to get a ReadVar<Option<PullRequestRelatedThing>>.

Consider the following modification to the existing get_gh_context_var API:

impl NodeCtx { fn get_gh_context_var(&mut self) -> GhContextVarReader; } impl GhContextVarReader { fn global(&mut self, GhContextVar) -> ReadVar<String>; fn event(&mut self) -> GhContextVarReaderEvent; } impl GhContextVarReaderEvent { fn pull_request(&mut self) -> GhContextVarReaderEventPullRequest; } // and so on... // // thereby enabling: let global: ReadVar<String> = ctx.get_gh_context_var().global(GhContextVar::RUNNER__TEMP); let pr_specific: ReadVar<Option<String>> = ctx.get_gh_context_var().event().pull_request().head().ref(); // theoretically, with this scheme, we could switch all existing `const GhContextVar` enums to // just hang off a `global()` object, e.g: `global().runner().temp()`

The resulting API is very fluent for end users, and to make our lives easier as implementors, we can leverage the type-state pattern to avoid an explosion of different GhContextVarReader types, and instead, simply transition between various versions of a single backing GhContextVarReader<T> type.

With this API, the leaf-nodes (e.g: pull_request().head().ref()) can then encode any necessary flowey logic to read the raw String data, and then convert it into a Option<String> if need be.

I know this is all somewhat orthogonal to the problem this PR is specifically trying to solve... but we must not forget that flowey is not a framework set in stone. It still has many rough edges and core abstractions that need to be reconsidered / reworked. This sort of flowey rework can and should be done as part of whatever feature work we are doing.

In this case, I don't actually think there's too much work to re-jig the API as I suggest here - there shouldn't be any "backend" flowey work, and it'd simply be some clever refactoring of the user-facing NodeContext APIs.

That said - I would suggest you split this work out into a separate PR, and then rebase this PR on-top of that one.

theoretically, it might even make sense to have a single pull_request() -> ReadVar<Option<GhEventPullRequest>> method, which dumps the entire github.events.pull_request variable as JSON, and then flowey uses a serde defn of the corresponding object to avoid needing to manually write out methods for each field. See https://docs.github.com/en/webhooks/webhook-events-and-payloads#pull_request

Its a "big" object, but its not a huge object, so I think that might be a viable approach - even if the pipeline only ends up using one or two fields of the object that gets parsed in.

And if its hard to transcribe that particular object structure in its entirety into a serde type, we can always go piecemeal (given that serde will just ignore JSON fields during deserialization that it wasn't explicitly told about)

daprilik · 2025-01-09T01:03:05Z

xtask/src/tasks/verify_size.rs

+#[clap(about = "Verify the size of a binary hasn't changed more than allowed.")]
+pub struct VerifySize {
+    /// Old binary path
+    #[clap(short, long, required(true))]


I believe the required(true) is redundant, as these aren't using Option<PathBuf>, nor are they relying on any clap default directive.

daprilik · 2025-01-09T01:13:01Z

xtask/src/tasks/verify_size.rs

+        let original_elf = object::File::parse(&*original).or_else(|e| {
+            anyhow::bail!(
+                r#"Unable to parse target file "{}". Error: "{}""#,
+                &self.original.display(),
+                e
+            )
+        })?;


https://docs.rs/anyhow/latest/anyhow/trait.Context.html is your friend

Suggested change

let original_elf = object::File::parse(&*original).or_else(|e| {

anyhow::bail!(

r#"Unable to parse target file "{}". Error: "{}""#,

&self.original.display(),

e

)

})?;

let original_elf = object::File::parse(&*original).context(

format!(

r#"Unable to parse target file "{}""#,

&self.original.display()

)

)?;

daprilik · 2025-01-09T01:21:10Z

flowey/flowey_lib_common/src/gh_merge_commit.rs

@@ -0,0 +1,55 @@
+// Copyright (c) Microsoft Corporation.
+// Licensed under the MIT License.
+


don't forget your module docs

(here, and elsewhere)

daprilik · 2025-01-09T01:24:18Z

flowey/flowey_lib_common/src/gh_workflow_id.rs

+            let get_action_id = |commit: String| {
+            xshell::cmd!(
+                    sh,
+                    "gh run list --commit {commit} -w '[flowey] OpenVMM CI' -s 'completed' -L 1 --json databaseId --jq '.[].databaseId'"


we def don't want to be hard-coding openvmm specific pipeline names within a flowey_lib_common node. this should be something that gets passed in as a runtime / comptime var.

daprilik · 2025-01-09T01:24:51Z

flowey/flowey_lib_common/src/gh_workflow_id.rs

+                    sh,
+                    "gh run list --commit {commit} -w '[flowey] OpenVMM CI' -s 'completed' -L 1 --json databaseId --jq '.[].databaseId'"
+                )
+                .env("GITHUB_TOKEN", gh_token.clone())


is this actually necessary? I would expect GitHub Actions to already have this set ambiently?

daprilik · 2025-01-09T01:26:36Z

flowey/flowey_lib_common/src/gh_workflow_id.rs

+            gh_workflow_id,
+        } = request;
+
+        let gh_token = ctx.get_gh_context_var(GhContextVar::GITHUB__TOKEN);


hmmm, maybe not strictly required for this PR, but do we need some way API like get_gh_token_with_permissions(impl Iterator<Item = GhPermission>) -> ReadVar<String>? i.e: in case you need the pipeline to have a particular github permission, outside the context of emit_gh_step?

first pass at adding job to verify gh binary size

d5e1b23

justus-camp-microsoft added 2 commits December 30, 2024 14:10

Merge branch 'main' into verify_size

3884a0f

comparison of binaries

99e745b

justus-camp-microsoft changed the title ~~ci: add job to verify binary size~~ WIP: ci: add job to verify binary size Jan 2, 2025

justus-camp-microsoft added 2 commits January 2, 2025 13:35

try comparing two of what should be the same binary

207280e

add done handle, run regen

40aa921

justus-camp-microsoft marked this pull request as ready for review January 2, 2025 22:20

justus-camp-microsoft requested review from a team as code owners January 2, 2025 22:20

justus-camp-microsoft added 3 commits January 2, 2025 14:37

fmt so that ci will actually run

f7ffdcb

clippy

9853a16

fix command line argument

e9854fa

smalis-msft reviewed Jan 3, 2025

View reviewed changes

flowey/flowey_hvlite/src/pipelines/checkin_gates.rs Outdated Show resolved Hide resolved

smalis-msft reviewed Jan 3, 2025

View reviewed changes

xtask/src/tasks/verify_size.rs Outdated Show resolved Hide resolved

smalis-msft reviewed Jan 3, 2025

View reviewed changes

xtask/src/tasks/verify_size.rs Outdated Show resolved Hide resolved

smalis-msft reviewed Jan 3, 2025

View reviewed changes

flowey/flowey_hvlite/src/pipelines/checkin_gates.rs Outdated Show resolved Hide resolved

justus-camp-microsoft added 6 commits January 3, 2025 10:04

swap old and new position

12c39aa

try to download hardcoded artifact

7fe1ab7

remove wrong parameter, clippy

17ac9d5

get merge head to run size check with

9b47097

output error

e9108a5

get stdout and stderr to debug ci issue

51cb2a3

justus-camp-microsoft added 18 commits January 6, 2025 22:52

roundabout way of fetching pr branch hopefully

e4df6d7

hack in the gh token

b6bed7a

run shell from right directory

eea8ab6

print file tree to debug

c6881b3

try to fix binary path

584f248

actually fix path

8a01cfb

fmt

4ac364d

another path change...

2d1bf6e

try to run a release build as part of PR pipeline

46968a8

loop through commits

3d00b0d

use gh cli to comment on PR

0711d49

try adding pr write permissions

2fc057a

Revert "try adding pr write permissions"

1a6c341

This reverts commit 2fc057a.

Revert "use gh cli to comment on PR"

2acb5be

This reverts commit 0711d49.

more rigorous size diff

b1f5fa6

clippy

5a537f2

Merge branch 'main' into verify_size

91e390a

misc fixes

ad99ffc

justus-camp-microsoft changed the title ~~WIP: ci: add job to verify binary size~~ ci: add job to verify binary size Jan 8, 2025

jstarks reviewed Jan 8, 2025

View reviewed changes

fetch main?

ffbcee7

daprilik reviewed Jan 9, 2025

View reviewed changes

justus-camp-microsoft mentioned this pull request Jan 9, 2025

WIP: flowey: use typestate pattern for gh variables #643

Open

Merge branch 'main' into verify_size

ef5977c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: add job to verify binary size #475

ci: add job to verify binary size #475

justus-camp-microsoft commented Dec 12, 2024 •

edited

Loading

jstarks commented Dec 13, 2024

justus-camp-microsoft commented Dec 13, 2024

justus-camp-microsoft commented Dec 13, 2024

smalis-msft commented Dec 16, 2024

smalis-msft commented Dec 17, 2024

smalis-msft commented Jan 3, 2025

smalis-msft Jan 3, 2025 •

edited

Loading

justus-camp-microsoft Jan 7, 2025

smalis-msft Jan 7, 2025

justus-camp-microsoft Jan 7, 2025

smalis-msft Jan 7, 2025

justus-camp-microsoft Jan 7, 2025

justus-camp-microsoft Jan 8, 2025

jstarks Jan 8, 2025

jstarks Jan 8, 2025

justus-camp-microsoft Jan 8, 2025

jstarks Jan 8, 2025

daprilik Jan 9, 2025

daprilik Jan 9, 2025

daprilik Jan 9, 2025

daprilik Jan 9, 2025

daprilik Jan 9, 2025

daprilik Jan 9, 2025

daprilik Jan 9, 2025

daprilik Jan 9, 2025

daprilik Jan 9, 2025

daprilik Jan 9, 2025

		@@ -0,0 +1,55 @@
		// Copyright (c) Microsoft Corporation.
		// Licensed under the MIT License.

ci: add job to verify binary size #475

Are you sure you want to change the base?

ci: add job to verify binary size #475

Conversation

justus-camp-microsoft commented Dec 12, 2024 • edited Loading

jstarks commented Dec 13, 2024

justus-camp-microsoft commented Dec 13, 2024

justus-camp-microsoft commented Dec 13, 2024

smalis-msft commented Dec 16, 2024

smalis-msft commented Dec 17, 2024

smalis-msft commented Jan 3, 2025

smalis-msft Jan 3, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

justus-camp-microsoft commented Dec 12, 2024 •

edited

Loading

smalis-msft Jan 3, 2025 •

edited

Loading