Question 11

Domain 2: Data Preparation

After changing the response generating LLM in a RAG pipeline from GPT-4 to a model with a shorter context length that the company self-hosts, the Generative AI Engineer is getting the following error: What TWO solutions should the Generative AI Engineer implement without changing the response generating model? (Choose two.)

A. Decrease the chunk size of embedded documents B. Reduce the number of records retrieved from the vector database C. All of the above

Previous Next

Question 11

Explanation

Why each option is right or wrong