MLS-C01 Practice Q26

A. Apply quantile binning to group the data into categorical bins to keep any relationships in the data by replacing the magnitude with distribution.

Quantile binning discretizes values into buckets; it changes representation, not proper feature scaling.

B. Apply the Cartesian product transformation to create new combinations of fields that are independent of the magnitude.

Cartesian products create interaction features between variables; they do not address unequal numeric scales.

C. Apply normalization to ensure each field will have a mean of 0 and a variance of 1 to remove any significant magnitude.

The appropriate preprocessing step is feature standardization under the general data-scaling requirement in machine learning, because the inputs are on very different numeric ranges and the model would otherwise weight high-magnitude variables disproportionately. In standard statistical terms, this is the z-score transform, defined as x' = (x - μ)/σ, which produces features with mean 0 and standard deviation 1; that directly addresses the stated concern about magnitude dominance before training.

D. Apply the orthogonal sparse bigram (OSB) transformation to apply a fixed-size sliding window to generate new features of a similar magnitude.

OSB is a text-style feature extraction idea, not a standard method for scaling numeric economic variables.

Question 26

Explanation

Why each option is right or wrong