MLS-C01 Practice Q15

A. Run another offline evaluation on the same validation dataset

Offline evaluation uses existing data rather than live production behavior.

B. Replace the model for all users and observe overall metrics

A/B testing compares live outcomes between alternatives instead of switching everyone at once.

C. Conduct an online evaluation using an A/B test in production

The team has already completed offline evaluation on a held-out dataset and now wants to measure whether the new model changes actual user behavior after deployment. Online evaluation addresses live production impact, and A/B testing is the named method for comparing the new model against a baseline in that setting.

D. Skip evaluation because the held-out dataset already showed good performance

Offline results do not establish real-world production impact on users.

Question 15

Explanation

Why each option is right or wrong