-
Notifications
You must be signed in to change notification settings - Fork 572
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ifpack2/KokkosKernels? Performance regression with 4.3.1 in AdditiveSchwarz initialization with overlap #13013
Comments
Automatic mention of the @trilinos/ifpack2 team |
Automatic mention of the @trilinos/ifpack2 team |
@kliegeois @brian-kelley We saw substantial increases in read off disk time for the sparc problem as well, though we couldn't isolate that to the Kokkos/KK promotion. Kim reworked the test to remove the disk read from Remainder |
@csiefer2 Sounds like that should be a separate issue, if it's reproducible and caused by a code change. |
@csiefer2 to be precise, the disk read was not in the remainder before #12997. |
@csiefer2 @kliegeois I think i figured both of these things out (but they're not related)
|
BTW, sparc was not because host-device copying either, because replacing cusparse with native (also on GPU obviously) decreases the time of that warmup apply from 2.36s -> 0.0038s. So the matrix was already synced to device. |
But I'll close this one as "not a code issue" |
Between 45d800 and 53b714, the "remainder" component of preconditioner initialization (AdditiveSchwarz + ILUk, overlap level 1) slowed down significantly on eclipse and amber (for example, 14.2->17.9s on amber for one abnormal energy problem). The remainder is supposed to not include any of the expensive things like RILUK setup, filter construction or overlapping row matrix construction.
The text was updated successfully, but these errors were encountered: