Question 13
Domain 2A machine learning team is reviewing how to assess a newly built text-generation system before wider release. Based on the syllabus topic provided, which capability is the primary focus of this evaluation activity?
Correct answer: B
Explanation
Evaluating generative AI solutions focuses on assessing systems that generate new content rather than systems that only classify, rank, or retrieve existing information. — pmle_syllabus.txt
Why each option is right or wrong
A. Measuring how accurately the system assigns predefined labels to inputs
Classification evaluates assignment of existing categories, not generation of new content.
B. Assessing how well the system produces new content in response to prompts
The source material identifies the topic as evaluating generative AI solutions, and generative AI refers to systems whose defining function is generating content. In this scenario, a text-generation system should therefore be evaluated on how well it produces new output from prompts.
C. Determining whether the system retrieves the most relevant stored document
Retrieval concerns finding existing information, not producing original generated output.
D. Calculating how consistently the system groups similar records into clusters
Clustering organizes existing data patterns and does not evaluate generated responses.