Parallel matching #330

szymon-zygula · 2024-08-09T14:11:47Z

This PR adds a parallel matching algorithm using relaxed memory ordering atomic operations in the UnionFind, allowing for parallel path compression. Theoretically, this is overhead-free on x86 (because of total store order), other than some cache coherence magic. Some people may remember it from conversations at PLDI 2024. This makes parallel matching scale well with the number of threads.

The functionality is hidden behind parallel-matching feature. When the feature is activated, you can set parallel_matching to true in Runner to use it.

Atomics in the UnionFind are always used, regardless of the parallel-matching feature. There could be two implementations, the new one and the old one, depending on parallel-matching, although in my opinion it would add unnecessary maintenance overhead. Opinions on this topic are welcome.

mwillsey

Hi, thanks for the contribution!

I like the idea of merging this, but I'm not willing to accept any breaking changes for it (when you aren't using the feature). I also think that the unionfind change isn't necessary to parallel matching; you should just be able to us the existing one.

Thoughts?

mwillsey · 2024-08-20T21:42:18Z

src/unionfind.rs

@@ -1,50 +1,66 @@
-use crate::Id;
+use crate::{AtomicId, Id};


I don't think we need any changes to the unionfind to enable parallel matching.

It's not strictly necessary. Just to check I did some more benchmarking, and although the parallel path compression does help in some scenarios, the gains are not too big (under 10%), so using current UnionFind should not affect performance. I'm reversing this change then.

src/lib.rs

src/rewrite.rs

mwillsey · 2024-08-20T22:39:10Z

src/language.rs

@@ -28,11 +28,11 @@ use thiserror::Error;
 ///
 /// See [`SymbolLang`] for quick-and-dirty use cases.
 #[allow(clippy::len_without_is_empty)]
-pub trait Language: Debug + Clone + Eq + Ord + Hash {
+pub trait Language: Debug + Clone + Eq + Ord + Hash + Send + Sync {


I don't think it's acceptable to change the bounds of Language at this point in egg's lifecycle. Getting rid of this I think is the main challenge the merging this.

I added a new trait MaybePar, which is

automatically implemented for every type when parallel-matching is not set,

automatically implemented for Sync + Send types when parallel-matching is not set.

Language, Analysis, Analysis::Data, and some other traits now have bounds involving this trait, so nothing changes when parallel-matching isn't set. Let me know what you think about this solution. If trait aliases ever get stabilized, they could be used instead.
Another solution could be creating a separate hierarchy of traits (ParLanguage, ParAnalysis, etc.), but this creates a lot of boilerplate code, and after trying this out I really don't think it's a good idea.

mwillsey · 2024-08-30T22:25:05Z

I think this is a cool contribution, but I will close it for now in favor of this commit I just added: d014800.

It should be quite easy to add (rule-based) parallelism on the client side without any further changes to egg. I have documented an example of doing so.

szymon-zygula added 11 commits August 6, 2024 14:51

Add AtomicId

cc2d402

Use AtomicId in UnionFind

891d71d

Add rayon

e78ec54

Add parallel-matching feature

7dd2537

Add parallel matching over rules

5ab359b

Add parallel matching over e-classes

5cb0906

Add parallel matching tests

dded3c8

Remove debug println

deafd24

Fix documentation for par_search_rewrite

8213f18

Fix formatting

984fe1e

Fix nits

8bc098c

mwillsey requested changes Aug 20, 2024

View reviewed changes

szymon-zygula added 6 commits August 28, 2024 16:37

Replace map ... flatten with flat_map

302c80e

Add some trait bounds only with parallel-matching

e21e60d

Restore original UnioFind

09e0a74

Remove AtomicId

df3b70a

Add MaybePar bound to RewriteScheduler

a542d71

Add documentation for RunnerHook and MaybePar

2b14aea

szymon-zygula requested a review from mwillsey August 30, 2024 11:01

mwillsey closed this Aug 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallel matching #330

Parallel matching #330

szymon-zygula commented Aug 9, 2024

mwillsey left a comment

mwillsey Aug 20, 2024

szymon-zygula Aug 30, 2024

mwillsey Aug 20, 2024

szymon-zygula Aug 30, 2024 •

edited

Loading

mwillsey commented Aug 30, 2024

Parallel matching #330

Parallel matching #330

Conversation

szymon-zygula commented Aug 9, 2024

mwillsey left a comment

Choose a reason for hiding this comment

mwillsey Aug 20, 2024

Choose a reason for hiding this comment

szymon-zygula Aug 30, 2024

Choose a reason for hiding this comment

mwillsey Aug 20, 2024

Choose a reason for hiding this comment

szymon-zygula Aug 30, 2024 • edited Loading

Choose a reason for hiding this comment

mwillsey commented Aug 30, 2024

szymon-zygula Aug 30, 2024 •

edited

Loading