Question 26

Domain 3: Applications of Foundation Models

An education company is building a chatbot whose target audience is teenagers. The company is training a custom large language model (LLM). The company wants the chatbot to speak in the target audience's language style by using creative spelling and shortened words. Which metric will assess the LLM's performance?

A. F1 score B. BERTScore C. Recall-Oriented Understudy for Gisting Evaluation (ROUGE) D. Bilingual Evaluation Understudy (BLEU) score

Previous Next

Question 26

Explanation

Why each option is right or wrong