NCP-AAI Practice Q33

A. Deploy all agents in a single pod with all resources allocated statically.

B. Deploy each agent as a separate Kubernetes Deployment with appropriate resource requests/limits, use Horizontal Pod Autoscaler (HPA) for CPU-based agents, use node selectors for GPU allocation, and implement rolling update strategy.

Kubernetes workloads are managed at the Deployment level, and each Deployment can define its own pod template, replica count, and update policy under the apps/v1 API. For CPU-driven agents, the HorizontalPodAutoscaler in autoscaling/v2 can scale between a configured minimum and maximum replica count based on observed CPU utilization, while GPU-bound pods are scheduled onto GPU-capable nodes using node selectors or node affinity in the pod spec. Resource requests and limits are enforced by the scheduler and kubelet, and a rollingUpdate strategy is the default Deployment update mechanism, replacing pods incrementally to keep service available during changes.

C. Deploy all agents on a single large node with all GPUs.

D. Manually scale agents by changing replica counts based on time of day.

Question 33

Explanation

Why each option is right or wrong