Skip to content

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models#6553

Open
gyou2021 wants to merge 4 commits intomicrosoft:masterfrom gyou2021:configurable_autoTP