
Microsoft Design and Implement Big Data Analytics Solutions - 070-475 Exam Questions
QUESTION NO: 1
A company named Fabricam, Inc, has a web app hosted in Microsoft Azure. Millions of users visit the app daily.
All of the user visits are logged in Azure Blob storage. Data analysts at Fabrikam built a dashboard that processes the user visit logs.
Fabrikam plans to use an Apache Hadoop cluster on Azure HDInsight to process queries. The queries will access the data only once.
You need to recommend a query execution strategy.
What is the best to recommend using to achieve the goal?
More than one answer choice may achieve the goal. Select the
A company named Fabricam, Inc, has a web app hosted in Microsoft Azure. Millions of users visit the app daily.
All of the user visits are logged in Azure Blob storage. Data analysts at Fabrikam built a dashboard that processes the user visit logs.
Fabrikam plans to use an Apache Hadoop cluster on Azure HDInsight to process queries. The queries will access the data only once.
You need to recommend a query execution strategy.
What is the best to recommend using to achieve the goal?
More than one answer choice may achieve the goal. Select the
Correct Answer: D
Explanation: Only visible for Pass4Test members. You can sign-up / login (it's free).
QUESTION NO: 2
Which service solution and which table storage solution should you recommend for DB2? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.

Which service solution and which table storage solution should you recommend for DB2? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.

Correct Answer:

Explanation

Box 1: Azure SQL Data Warehourse
Scenario: Relecloud plans to implement a data warehouse named DB2.
Box 2: Clustered Columnstore index
Columnstore index is a new type of index introduced in SQL Server 2012. It is a column-based non-clustered index geared toward increasing query performance for workloads that involve large amounts of data, typically found in data warehouse fact tables.
A clustered columnstore index is the physical storage for the entire table.
Scenario:
Relecloud identifies the following requirements for DB2:
DB2 must be able to store more than 40 TB of data.
References: https://docs.microsoft.com/en-us/sql/relational-databases/indexes/columnstore-indexes-overview
QUESTION NO: 3
You plan to create a Microsoft Azure Data Factory pipeline that will connect to an Azure HDInsight cluster that uses Apache Spark.
You need to recommend which file format must be used by the pipeline. The solution must meet the following requirements:
* Store data in the columnar format
* Support compression
Which file format should you recommend?
You plan to create a Microsoft Azure Data Factory pipeline that will connect to an Azure HDInsight cluster that uses Apache Spark.
You need to recommend which file format must be used by the pipeline. The solution must meet the following requirements:
* Store data in the columnar format
* Support compression
Which file format should you recommend?
Correct Answer: C
Explanation: Only visible for Pass4Test members. You can sign-up / login (it's free).
QUESTION NO: 4
You deploy a Microsoft Azure SQL database.
You create a job to upload customer data to the database.
You discover that the job cannot connect to the database and fails.
You verify that the database runs successfully in Azure.
You need to run the job successfully.
What should you create?
You deploy a Microsoft Azure SQL database.
You create a job to upload customer data to the database.
You discover that the job cannot connect to the database and fails.
You verify that the database runs successfully in Azure.
You need to run the job successfully.
What should you create?
Correct Answer: A
Explanation: Only visible for Pass4Test members. You can sign-up / login (it's free).
QUESTION NO: 5
You need to configure the alert to meet the requirements for ETL.
Which settings should you use for the alert? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.

You need to configure the alert to meet the requirements for ETL.
Which settings should you use for the alert? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.

Correct Answer:

Explanation

Scenario: Relecloud identifies the following requirements for extract, transformation, and load (ETL): An email alert must be generated when a failure of any type occurs during ETL processing.
QUESTION NO: 6
You use Microsoft Azure Data Factory to orchestrate data movements and data transformations within Azure.
You plan to monitor the data factory to ensure that all of the activity slices run successfully. You need to identify a solution to rerun failed slices. What should you do?
You use Microsoft Azure Data Factory to orchestrate data movements and data transformations within Azure.
You plan to monitor the data factory to ensure that all of the activity slices run successfully. You need to identify a solution to rerun failed slices. What should you do?
Correct Answer: D




