Fix performance regression on big `gherkin::Feature`s (#331) #352

tyranron · 2025-01-06T10:51:32Z

Resolves #331

Synopsis

We maintain a large collection of feature files that feed nightly regression tests, the runtime of which has grown significantly recently. These are generally maintained in logically separated feature files, and leverage Scenario Outline tables, sometimes with 100-200 scenarios per feature file.

In experimenting with optimizations on a single tag, I tried splitting a feature file into 4 and observed a massive performance gain. Here is my baseline:
[Summary]
1 feature
257 scenarios (226 passed, 31 failed)
1663 steps (1632 passed, 31 failed)
Tests completed in 266 sec
Here are the same exact tests divided over 4 feature files:
[Summary]
4 features
257 scenarios (226 passed, 31 failed)
1663 steps (1632 passed, 31 failed)
Tests completed in 99 sec

Cause

See #331 (comment):

I've tracked down the cause of this issue. The bigger .feature file is - the bigger gherkin::Feature structure is parsed out of it. Once we have it quite big, the usual Clone operations become quite expensive. We've already used Arc through the codebase for passing them around, but there were still some places doing so unwise. The one I tracked down instanly and fixed in 562e2ba.

However, this is only a part of the problem. Since gherkin types use naive #[derive(Hash, PartialEq)] implementations, they become too expensive for large structures. Frankly, the cargo-flamegraph shows on the repro that <gherkin::Table as Hash>::hash() ate 42% CPU time and <gherkin::Feature as PartialEq>::eq() ate 22% CPU time 😱

Solution

Replace Arc usages on gherkin types with Source, where Source is a transparent wrapper over Arc, implementing PartialEq and Hash based on the pointer value.

impl<T> PartialEq for Source<T> {
    fn eq(&self, other: &Self) -> bool {
        Arc::ptr_eq(&self.0, &other.0)
    }
}

impl<T> Hash for Source<T> {
    fn hash<H: Hasher>(&self, state: &mut H) {
        Arc::as_ptr(&self.0).hash(state);
    }
}

# Conflicts: # src/runner/basic.rs

tyranron added 9 commits January 3, 2025 13:23

Setup

ca787f2

Merge branch 'main' into 331-fix-performance

02ec40d

# Conflicts: # src/runner/basic.rs

Upd

d130970

Replace linked-hash-map crate with more maintained hashlink

40ebe7b

Merge branch 'main' into 331-fix-performance

06cf007

Test

0875990

Upd

f9bd47d

Refactor gherkin::Scenario usage

c13ae55

Remove debug prints [skip ci]

1f07e42

tyranron added enhancement Improvement of existing features or bugfix semver::breaking Represents breaking changes k::refactor Refactoring, technical debt elimination and other improvements of existing code base k::performance Related to performance of library labels Jan 6, 2025

tyranron added this to the 0.22.0 milestone Jan 6, 2025

tyranron self-assigned this Jan 6, 2025

tyranron added 9 commits January 6, 2025 12:55

Make Source transparent

30c1a41

Refactor gherkin::Rule usage

4457046

Refactor gherkin::Step usage

661cc96

Describe guarantees in Runner interface

b2579c4

Fix

fe2857a

Try fix that tests flakiness

2f9baf7

Fix book tests

af87a07

Mention in CHANGELOG

004045f

Polish

36b5c86

tyranron marked this pull request as ready for review January 7, 2025 11:23

tyranron merged commit 862df28 into main Jan 7, 2025
43 checks passed

tyranron deleted the 331-fix-performance branch January 7, 2025 11:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix performance regression on big `gherkin::Feature`s (#331) #352

Fix performance regression on big `gherkin::Feature`s (#331) #352

tyranron commented Jan 6, 2025

Fix performance regression on big gherkin::Features (#331) #352

Fix performance regression on big gherkin::Features (#331) #352

Conversation

tyranron commented Jan 6, 2025

Synopsis

Cause

Solution

Fix performance regression on big `gherkin::Feature`s (#331) #352

Fix performance regression on big `gherkin::Feature`s (#331) #352