The Implicit Bias of Gradient Descent on Separable Multiclass Data | Read Paper on Bytez