Transformer pro and contra qte77 · September 1, 2022 ml theory transformer pro vs con complexity TIME SPACE path length parallelization transfer learning pre-training Transformer pro and contra O(n^2) O(1) Parallelization Transfer learning Pre-training Share: Twitter, Facebook