live chatMcAfee Secure sites help keep you safe from identity theft, credit card fraud, spyware, spam, viruses and online scams
Pass4Test 10%OFF Discount Code

Databricks Certified Machine Learning Professional - Databricks-Machine-Learning-Professional Exam Questions

QUESTION NO: 1
A machine learning engineer is monitoring label values for a production machine learning classification model. The engineer believes that the relative prevalence of the classes is becoming changing in more recent data. Which tool can the machine learning engineer use to assess their theory?
Correct Answer: B
Explanation: Only visible for Pass4Test members. You can sign-up / login (it's free).
QUESTION NO: 2
A machine learning engineer has developed a model and registered it using the FeatureStoreClient fs. The model has model URI model_uri. The engineer now needs to perform batch inference on customer-level Spark DataFrame spark_df, but it is missing a few of the static features that were used when training the model. The customer_id column is the primary key of spark_df and the training set used when training and logging the model.
Which of the following code blocks can be used to compute predictions for spark_df when the missing feature values can be found in the Feature Store by searching for features by customer_id?
Correct Answer: D
QUESTION NO: 3
A machine learning engineering team has written predictions computed in a batch job to a Delta table for querying. However, the team has noticed that the querying is running slowly. The team has already tuned the size of the data files. Upon investigating, the team has concluded that the rows meeting the query condition are sparsely located throughout each of the data files. Based on the scenario, which optimization technique could speed up the query by colocating similar records while considering values in multiple columns?
Correct Answer: C
QUESTION NO: 4
A Machine Learning Engineer has previously built a feature table for model training and inference using a batch mode approach:

They have been informed that they now require these features to be available in "real-time", with latency on the order of a minute. Their manager has informed them there is now a Kafka stream from which they can stream live data, and they need to have this ingested and available for low- latency feature lookups.
Which change to their existing code will achieve this?
Correct Answer: C
Explanation: Only visible for Pass4Test members. You can sign-up / login (it's free).
QUESTION NO: 5
A data scientist set up a machine learning pipeline to automatically log feature importance data from a CSV with each run. They now want to view the feature importance data in Databricks.
Which location in Databricks will show this data?
Correct Answer: A
Explanation: Only visible for Pass4Test members. You can sign-up / login (it's free).
QUESTION NO: 6
A machine learning engineer is attempting to create a webhook that will trigger a Databricks Job job_id when a model version for model model transitions into any MLflow Model Registry stage.
They have the following incomplete code block:

Which lines of code can be used to fill in the blank so that the code block accomplishes the task?
Correct Answer: B
QUESTION NO: 7
A data scientist is utilizing MLflow to track their machine learning experiments. After completing a series of runs for the experiment with experiment ID exp_id, the data scientist wants to programmatically work with the experiment run data in a Spark DataFrame. They have an active MLflow Client client and an active Spark session spark. Which of the following lines of code can be used to obtain run-level results for exp_id in a Spark DataFrame?
Correct Answer: C
QUESTION NO: 8
A machine learning engineer wants to log feature importance data from a CSV file at path importance_path with an MLflow run for model model. Which code block will accomplish this task inside of an existing MLflow run block?
Correct Answer: C
QUESTION NO: 9
A data scientist wants to examine the data in the Feature Store table table from the database dev as a Spark DataFrame. They have access to Feature Store Client fs. Which line of code can be used to gel the data from table as a Spark DataFrame?
Correct Answer: A
Explanation: Only visible for Pass4Test members. You can sign-up / login (it's free).
QUESTION NO: 10
Which of the following tools can assist in real-time deployments by packaging software with its own application, tools, and libraries?
Correct Answer: C
QUESTION NO: 11
Which of the following is a simple, low-cost method of monitoring numeric feature drift?
Correct Answer: D
QUESTION NO: 12
A machine learning engineer has developed a model and registered it using the FeatureStoreClient fs. The model has model URI model_uri. The engineer now needs to perform batch inference on the training set logged with the model, but a few of the feature values in the column spend have since been updated and arc present in the customer-level Spark DataFrame spark_df. The customer_id column is the primary key of spark_df and the training set used when training and logging the model. Which code block can be used to compute predictions for the training set while overwriting its old spend values with the new spend values from spark_df?
Correct Answer: B
Explanation: Only visible for Pass4Test members. You can sign-up / login (it's free).