AI-102 Practice Q40

A. Switch to a different Azure region for better translation performance

B. Add more high-quality parallel sentence pairs (aim for 50,000+) and ensure domain-specific terminology is well-represented

BLEU is an evaluation metric for machine translation, and a score of 28 on only 5,000 aligned English–French sentence pairs indicates the model is undertrained for the target domain. In practice, the most effective fix is to increase the size and quality of the parallel corpus substantially—on the order of tens of thousands of sentence pairs, here 50,000+—because the model needs more aligned examples to learn technical phrasing and terminology consistently. For a custom translation system, domain coverage matters as much as volume, so ensuring manuals’ specialized terms are well represented directly addresses the weak BLEU performance.

C. Enable document translation mode instead of text translation

D. Reduce the training data to 1,000 highest-quality pairs

Question 40

Explanation

Why each option is right or wrong