On this work, we analyze a pre-trained mT5 to find the attributes of cross-lingual connections discovered by this mannequin. By a statistical interpretation framework over 90 language pairs throughout three duties, we present that switch efficiency may be modeled by a couple of linguistic and data-derived options. These observations allow us to interpret cross-lingual understanding of the mT5 mannequin. By these observations, one can favorably select the perfect supply language for a process, and may anticipate its coaching information calls for. A key discovering of this work is that similarity of syntax, morphology and phonology are good predictors of cross- lingual switch, considerably extra than simply the lexical similarity of languages. For a given language, we’re in a position to predict zero-shot efficiency, that will increase on a logarithmic scale with the variety of few-shot goal language information factors.