MLA-C01 Practice Q8

A. Create a DataBrew dataset by using the S3 path. Clean and normalize the data by using a DataBrew profile job.

Profile jobs analyze data quality and statistics, not perform the main transformation workflow.

B. Create a DataBrew dataset by using the S3 path. Clean and normalize the data by using a DataBrew recipe job.

AWS Glue DataBrew operates on a defined dataset, and an Amazon S3 location is a supported source for creating that dataset; the service then applies transformations through a recipe and executes them in a recipe job. In practice, the S3 path is used to point DataBrew at the bucket contents, and the recipe job is the mechanism that performs the cleaning and normalization steps required for the ML workflow.

C. Create a DataBrew dataset by using a Java Database Connectivity (JDBC) driver to connect to the S3 bucket. Clean and normalize the data by using a DataBrew profile job.

Amazon S3 is object storage, so JDBC is not the normal way to define a DataBrew dataset.

D. Create a DataBrew dataset by using a Java Database Connectivity (JDBC) driver to connect to the S3 bucket. Clean and normalize the data by using a DataBrew recipe job.

Recipe jobs transform data, but the dataset connection method is wrong because S3 is not accessed by JDBC.

Question 8

Explanation

Why each option is right or wrong