inplace_abn for transformer #227

17dacheng · 2022-09-17T00:11:38Z

dear author, the transformer use the layer norm in stead of batch norm, is it possible to apply inplace abn to transformer-based models? or is there any way to lower those models' gpu memory? thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

inplace_abn for transformer #227

inplace_abn for transformer #227

17dacheng commented Sep 17, 2022

inplace_abn for transformer #227

inplace_abn for transformer #227

Comments

17dacheng commented Sep 17, 2022