GenAI Associate Exam Prep
GenAI Associate Exam Glossary - 73 Terms
Search the terminology pack for Databricks Certified Generative AI Engineer Associate. Use these definitions with the study guide and practice questions.
A
- Agent Bricks
- A Databricks feature set for solving problems with specialized agent patterns, including Knowledge Assistant, Multiagent Supervisor, and Information Extraction.
- Agent Framework
- A Databricks framework used to deploy and track LLMs or agents, including with AI Gateway.
- AI Gateway
- A Databricks capability that includes Inference Tables, Usage Tables, and rate limiting for tracking deployed LLMs or agents.
- ai_query()
- A Databricks function used to identify and handle batch inference workloads appropriately.
- aliases
- Named pointers used to promote prompts after they pass validation while preserving version history and enabling rollback. In the question, they are used with MLflow versions.
C
- chunking
- A data preparation strategy that splits documents into smaller pieces to fit document structure and model constraints for retrieval and generation.
- CI/CD
- Continuous integration and continuous delivery/deployment practices used here for updating indexes, promoting prompts, and testing components.
- context length
- The maximum token length an embedding model can handle; the exam includes choosing among context length options based on cost, latency, and quality.
- custom Scorers
- Databricks scoring tools used to evaluate agents and LLMs.
D
- Databricks App
- An application built on Databricks that can provide an interactive user-facing interface. In the question, it is used for customer support agents asking questions and receiving grounded answers.
- Databricks Certified Generative AI Engineer Associate
- A Databricks certification exam that assesses the ability to design and implement LLM-enabled solutions using Databricks, including RAG applications, LLM chains, model selection, governance, deployment, and monitoring.
- Databricks Secrets
- A Databricks feature for securely storing sensitive values such as API keys. In the question, it is used to store the external MCP server API key.
- Delta Lake
- A Databricks storage layer used here as the destination for writing chunked text tables in Unity Catalog.
E
- embedding dimension
- The size of the vector representation produced by an embedding model.
- embedding model
- A model that converts text into vector representations; the exam covers selecting one based on context length, source documents, queries, and optimization strategy.
- embeddings
- Vector representations of items used for similarity search. The question uses the number of embeddings as one of the factors for configuring vector search.
- evaluation
- The process of assessing model or application outputs, including qualitative assessment, scoring, and comparison of metrics.
- external MCP server
- An MCP server hosted outside Databricks that requires connection details and, in the question, an API key stored in Databricks Secrets.
F
- feature store
- A store for structured features used in the sample question as an alternative data source for transaction-based information.
- fine tune
- To train a custom model further on task-specific data. In the question, it refers to fine-tuning a custom embedding model for vector search.
- fine-tune
- To update a model using a tuning dataset so it better reflects desired or current information.
- Foundation Model APIs
- Databricks APIs used to serve LLM applications leveraging foundation models.
G
- ground truth
- Reference-correct answers required by some evaluation judges.
- guardrails
- Controls used to prevent negative outcomes from LLM applications, including protection against malicious inputs and unsafe outputs.
H
- Hugging Face Transformers
- A related online tool/service used for working with transformer-based models in generative AI applications.
- hybrid search
- A search setup that combines vector-based retrieval with other retrieval methods, typically keyword-based search, to improve results. In the question, it can be enabled together with reranking.
I
- Inference Tables
- Tables used to track inference activity for deployed models or agents.
- Information Extraction
- An Agent Bricks option used to extract structured information from content.
K
- Knowledge Assistant
- An Agent Bricks option used to solve problems by providing knowledge-based assistance.
L
- LangChain
- A tool used in generative AI applications for building chains and related workflows.
- latency
- The time delay before a search or model response is returned. In the question, it is the most critical metric for the search workload.
- legal/licensing requirements
- Legal and licensing constraints on data sources that must be followed to avoid legal risk in GenAI applications.
- LLM
- A large language model used in generative AI applications; the exam expects knowledge of current LLMs, their capabilities, and how to select them for tasks.
- LLM chains
- Multi-step application flows that combine an LLM with other components such as tools, retrievers, or prompts to produce an output.
- LLM-as-a-judge
- An evaluation approach where a language model scores or judges model outputs instead of, or in addition to, human raters. In the question, it is proposed as a way to rescore responses.
M
- managed MCP server
- An MCP server provided and operated by Databricks for a supported data source. In the question, it is configured by specifying the server type as managed and providing the server identifier.
- masking techniques
- Data protection techniques used as guardrails to meet performance objectives while limiting exposure of sensitive data.
- MCP
- Model Context Protocol. In the question, MCP servers are integrated to give an agent access to external and managed data sources.
- MCP server
- A server that exposes tools or data sources to an agent through the Model Context Protocol. In the question, both managed and external MCP servers are used.
- MCP servers
- Servers that can be integrated as managed, external, or custom components based on application requirements.
- MLflow
- A Databricks-supported tool for managing the solution lifecycle, and for developing, evaluating, tracing, and monitoring models and agents.
- mlflow.genai.evaluate()
- An MLflow GenAI evaluation function used for consistent agent evaluation with aligned judgments. In the question, it is the place where calibrated SME judgments are applied.
- model card
- Documentation describing a model’s attributes and metadata used to help select the best model for a task.
- model hub
- A repository or marketplace from which models can be selected for a task based on metadata or model cards.
- Model Serving
- A Databricks capability for deploying models and solutions through serving endpoints.
- model signature
- The defined input and output schema for a model, used when assembling a RAG application.
- Mosaic AI Agent endpoint
- An endpoint that an app can call to interact with a Mosaic AI agent. In the question, it must be called from the backend without exposing long-lived browser tokens.
- Mosaic AI Vector Search
- A Databricks vector search offering whose key concepts and components are covered by the exam.
- Multiagent Supervisor
- An Agent Bricks option used to coordinate multiple agents in solving a problem.
P
- PAT
- Personal access token. In the question, it is a Databricks token that should not be stored in browser JavaScript because it would expose long-lived credentials.
- prompt engineering
- The practice of designing prompts to elicit desired model behavior, including specific formatting and output changes.
- prompt generation
- The creation of prompts for generative AI applications.
- prompt templates
- Reusable prompt structures used to standardize how prompts are authored and managed across environments. In the question, they must be promoted from development to staging to production with version history and rollback support.
- prompt version control
- The practice of tracking and managing prompt changes across versions and environments.
- pyfunc
- A model flavor used to code a chain with pre-processing and post-processing.
- Python
- A programming language the exam expects candidates to use for RAG applications, agent development, and related APIs and libraries.
R
- RAG
- Retrieval-augmented generation, an application pattern that combines retrieved source context with generation to build performant applications.
- rate limiting
- A control that limits the rate of requests to a deployed LLM or agent.
- re-ranking
- A retrieval step that reorders retrieved results to improve the information retrieval process.
- reranking
- A post-retrieval step that reorders candidate search results to improve relevance. In the question, it is an optional setting for vector search.
- rubrics
- Defined scoring criteria used to evaluate outputs consistently. In the question, clear rubrics are used to calibrate SMEs and reduce rater inconsistency.
S
- Sentencizer
- A text-processing component that splits text into sentences. In the question context, it is presented as a possible NLP task category, but the text does not further define it.
- SME
- Subject matter expert. In the question, SMEs are the domain experts who review sampled answers and need calibration for consistent judgments.
- SME feedback
- Feedback from subject matter experts used to improve agent performance.
- standard vector search
- A vector search configuration that is not storage optimized. In the question, it is contrasted with storage optimized vector search.
- storage optimized vector search
- A vector search configuration optimized for storage efficiency. In the question, it is one of the setup options for handling a very large inventory.
- subject matter expert
- An expert whose feedback is used to improve agent performance.
- summarization
- An NLP task category that reduces a longer text to a shorter gist while preserving intent.
T
- Text Classification
- A natural language processing task category where a model assigns one or more labels to text. In the question context, it is presented as a possible evaluation category for the memo-to-gist application.
- text2text generation
- A natural language processing task category where a model transforms input text into output text. In the question context, it is one of the candidate task categories for converting a paragraph-long memo into a single-sentence gist.
U
- Unity Catalog
- A Databricks governance layer used for data governance and for registering models and storing chunked text in tables.
- Usage Tables
- Tables used to track usage for deployed models or agents.
V
- Vector Search
- A Databricks tool for semantic similarity search and retrieval, used to create and query vector search indexes and support RAG applications.
About These Definitions
These definitions are loaded from the shared release pack. Use them with the study guide and practice questions to connect vocabulary to exam scenarios.