explain: don't create missing postquery plans in some cases #139078

yuzefovich · 2025-01-14T22:13:00Z

bc50897 where we added the support
for showing plans of cascades in EXPLAIN output we needed to introduce
an ability to "create post-query plan if missing". This was needed since
in "vanilla" EXPLAIN we never actually run the main query, so the
cascades are never actually planned. We also enabled this logic in two
other places - in EXPLAIN ANALYZE as well as when populating "plan for
stats". (The latter is used in exec stats feature, e.g. as plan column
in system.statement_statistics.) Both of these cases result in
creating post-query plans after the query execution which could lead
to unexpected behavior. One such behavior was observed when we added the
EXPLAIN support for triggers when the txn captured by the trigger plan
function could have been committed. Another case was recently introduced
where we reset the gist factory after the execbuilding the main query,
yet the gist factory was captured by the cascade plan function.

This shows that the mechanism is fragile. Note that the "vanilla"
EXPLAIN code path is unaffected by these cases because

we created all separate exec.Factory objects (in
execFactory.ConstructExplain and execbuilder.Builder.buildExplain),
so there is no concern about factories being reset after the
"main" optimizer plan was created.
the txn in which the EXPLAIN statement runs is still open
since we're in the middle of the execution of the
explainPlanNode.

Additionally, we want to highlight the cases when cascades and triggers
didn't actually run because the main query didn't modify any rows. We
can address this desire as well as the fragility of the mechanism by
disallowing "creating post-query plans if missing" in EXPLAIN ANALYZE
which is what this commit does. Now, when we try to emit the plan for
a cascade or a trigger and we realize that we didn't cache the plan, we
won't attempt to create it, and we'll instead add short-circuited
attribute to the output.

Due to the same fragility we also disallow "creating post-query plans if
missing" in the "plan for stats" case. This means that the "plan for
stats" could change depending on the data that the query ran over.
(For example, in one case the cascade could be triggered and in another
the cascade could be skipped ("short-circuited") depending on whether the
main query modified some rows or not.) This is a bit unfortunate because
we would produce different "plan for stats" even though the query is the
same, but it seems like an edge case that we can accept to avoid the
fragility.

Fixes: #125509.
Fixes: #135157.
Fixes: #138974.

Release note: None

cockroach-teamcity · 2025-01-14T22:13:11Z

This change is

In bc50897 where we added the support for showing plans of cascades in EXPLAIN output we needed to introduce an ability to "create post-query plan if missing". This was needed since in "vanilla" EXPLAIN we never actually run the main query, so the cascades are never actually planned. We also enabled this logic in two other places - in EXPLAIN ANALYZE as well as when populating "plan for stats". (The latter is used in exec stats feature, e.g. as `plan` column in `system.statement_statistics`.) Both of these cases result in creating post-query plans _after_ the query execution which could lead to unexpected behavior. One such behavior was observed when we added the EXPLAIN support for triggers when the txn captured by the trigger plan function could have been committed. Another case was recently introduced where we reset the gist factory after the execbuilding the main query, yet the gist factory was captured by the cascade plan function. This shows that the mechanism is fragile. Note that the "vanilla" EXPLAIN code path is unaffected by these cases because 1. we created all separate exec.Factory objects (in `execFactory.ConstructExplain` and `execbuilder.Builder.buildExplain`), so there is no concern about factories being reset after the "main" optimizer plan was created. 2. the txn in which the EXPLAIN statement runs is still open since we're in the middle of the execution of the `explainPlanNode`. Additionally, we want to highlight the cases when cascades and triggers didn't actually run because the main query didn't modify any rows. We can address this desire as well as the fragility of the mechanism by disallowing "creating post-query plans if missing" in EXPLAIN ANALYZE which is what this commit does. Now, when we try to emit the plan for a cascade or a trigger and we realize that we didn't cache the plan, we won't attempt to create it, and we'll instead add `short-circuited` attribute to the output. Due to the same fragility we also disallow "creating post-query plans if missing" in the "plan for stats" case. This means that the "plan for stats" could change depending on the data that the query ran over. (For example, in one case the cascade could be triggered and in another the cascade could be skipped ("short-circuited") depending on whether the main query modified some rows or not.) This is a bit unfortunate because we would produce different "plan for stats" even though the query is the same, but it seems like an edge case that we can accept to avoid the fragility. Release note: None

yuzefovich force-pushed the explain branch from 5491368 to 282a545 Compare January 14, 2025 22:21

yuzefovich mentioned this pull request Jan 14, 2025

sql: fix an internal error in EXPLAIN ANALYZE in some cases #139071

Draft

yuzefovich force-pushed the explain branch from 282a545 to a263ffa Compare January 15, 2025 02:35

yuzefovich changed the title ~~explain: indicate short-circuited cascades and triggers~~ explain: don't create missing postquery plans in some cases Jan 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

explain: don't create missing postquery plans in some cases #139078

explain: don't create missing postquery plans in some cases #139078

yuzefovich commented Jan 14, 2025 •

edited

Loading

cockroach-teamcity commented Jan 14, 2025

explain: don't create missing postquery plans in some cases #139078

Are you sure you want to change the base?

explain: don't create missing postquery plans in some cases #139078

Conversation

yuzefovich commented Jan 14, 2025 • edited Loading

cockroach-teamcity commented Jan 14, 2025

yuzefovich commented Jan 14, 2025 •

edited

Loading