Generative AI Leader Practice Q37

A. Output length / max output tokens

Under the standard generation controls used by LLM APIs, the parameter that sets the hard stop on how many tokens the model may emit is the output-length limit, commonly exposed as max_output_tokens or max_tokens. By contrast, parameters like temperature or top_p only change sampling behavior and do not impose a numeric ceiling on the response length.

B. Top-p

Top-p limits token selection to a probability mass, affecting diversity rather than response length.

C. Temperature

Temperature changes randomness in token choice, not the maximum number of tokens generated.

D. Safety threshold

Safety threshold filters or blocks unsafe content; it is not a length-control setting.

Question 37

Explanation

Why each option is right or wrong