AIP-210 Practice Q29

A. Estimating gradients with finite differences by perturbing each parameter slightly

B. Applying the chain rule to propagate gradients backward through the computation graph

Backpropagation is the standard reverse-mode differentiation procedure for a computational graph: it evaluates partial derivatives at each node and combines them using the chain rule so the gradient of the loss with respect to earlier parameters can be obtained from later ones. In a feedforward network, this means the error signal is passed from the output layer back through each preceding layer, with each layer’s contribution computed from the derivatives of the operations that produced it.

C. Sampling random directions and averaging their effects on the loss

D. Solving a system of linear equations for each layer's parameters

Question 29

Explanation

Why each option is right or wrong