Question 6

Domain 2: Core Machine Learning, AI, and Transformer Foundations

What is the main purpose of layer normalization in transformers?