Certbus > Databricks > Databricks Certifications > DATABRICKS-CERTIFIED-DATA-ANALYST-ASSOCIATE > DATABRICKS-CERTIFIED-DATA-ANALYST-ASSOCIATE Online Practice Questions and Answers

DATABRICKS-CERTIFIED-DATA-ANALYST-ASSOCIATE Online Practice Questions and Answers

Questions 4

Which of the following approaches can be used to connect Databricks to Fivetran for data ingestion?

A. Use Workflows to establish a SQL warehouse (formerly known as a SQL endpoint) for Fivetran to interact with

B. Use Delta Live Tables to establish a cluster for Fivetran to interact with

C. Use Partner Connect's automated workflow to establish a cluster for Fivetran to interact with

D. Use Partner Connect's automated workflow to establish a SQL warehouse (formerly known as a SQL endpoint) for Fivetran to interact with

E. Use Workflows to establish a cluster for Fivetran to interact with

Browse 45 Q&As
Questions 5

Data professionals with varying titles use the Databricks SQL service as the primary touchpoint with the Databricks Lakehouse Platform. However, some users will use other services like Databricks Machine Learning or Databricks Data Science and Engineering.

Which of the following roles uses Databricks SQL as a secondary service while primarily using one of the other services?

A. Business analyst

B. SQL analyst

C. Data engineer

D. Business intelligence analyst

E. Data analyst

Browse 45 Q&As
Questions 6

A data engineering team has created a Structured Streaming pipeline that processes data in micro-batches and populates gold-level tables. The microbatches are triggered every minute.

A data analyst has created a dashboard based on this gold-level data. The project stakeholders want to see the results in the dashboard updated within one minute or less of new data becoming available within the gold-level tables.

Which of the following cautions should the data analyst share prior to setting up the dashboard to complete this task?

A. The required compute resources could be costly

B. The gold-level tables are not appropriately clean for business reporting

C. The streaming data is not an appropriate data source for a dashboard

D. The streaming cluster is not fault tolerant

E. The dashboard cannot be refreshed that quickly

Browse 45 Q&As
Questions 7

Which of the following approaches can be used to ingest data directly from cloud-based object storage?

A. Create an external table while specifying the DBFS storage path to FROM

B. Create an external table while specifying the DBFS storage path to PATH

C. It is not possible to directly ingest data from cloud-based object storage

D. Create an external table while specifying the object storage path to FROM

E. Create an external table while specifying the object storage path to LOCATION

Browse 45 Q&As
Questions 8

A data analyst wants to create a dashboard with three main sections: Development, Testing, and Production. They want all three sections on the same dashboard, but they want to clearly designate the sections using text on the dashboard. Which of the following tools can the data analyst use to designate the Development, Testing, and Production sections using text?

A. Separate endpoints for each section

B. Separate queries for each section

C. Markdown-based text boxes

D. Direct text written into the dashboard in editing mode

E. Separate color palettes for each section

Browse 45 Q&As
Questions 9

After running DESCRIBE EXTENDED accounts.customers;, the following was returned:

Now, a data analyst runs the following command:

DROP accounts.customers;

Which of the following describes the result of running this command?

A. Running SELECT * FROM delta. `dbfs:/stakeholders/customers` results in an error.

B. Running SELECT * FROM accounts.customers will return all rows in the table.

C. All files with the .customers extension are deleted.

D. The accounts.customers table is removed from the metastore, and the underlying data files are deleted.

E. The accounts.customers table is removed from the metastore, but the underlying data files are untouched.

Browse 45 Q&As
Questions 10

A data analyst created and is the owner of the managed table my_ table. They now want to change ownership of the table to a single other user using Data Explorer.

Which of the following approaches can the analyst use to complete the task?

A. Edit the Owner field in the table page by removing their own account

B. Edit the Owner field in the table page by selecting All Users

C. Edit the Owner field in the table page by selecting the new owner's account

D. Edit the Owner field in the table page by selecting the Admins group

E. Edit the Owner field in the table page by removing all access

Browse 45 Q&As
Questions 11

A data analyst runs the following command:

INSERT INTO stakeholders.suppliers TABLE stakeholders.new_suppliers;

What is the result of running this command?

A. The suppliers table now contains both the data it had before the command was run and the data from the new_suppliers table, and any duplicate data is deleted.

B. The command fails because it is written incorrectly.

C. The suppliers table now contains both the data it had before the command was run and the data from the new_suppliers table, including any duplicate data.

D. The suppliers table now contains the data from the new_suppliers table, and the new_suppliers table now contains the data from the suppliers table.

E. The suppliers table now contains only the data from the new_suppliers table.

Browse 45 Q&As
Questions 12

They are using the following incomplete command:

Which of the following lines of code can they use to fill in the blank in the above code block so that it successfully completes the task?

A. array distinct(products)

B. explode(products)

C. reduce(products)

D. array(products)

E. flatten(products)

Browse 45 Q&As
Questions 13

A data analyst has created a user-defined function using the following line of code:

CREATE FUNCTION price(spend DOUBLE, units DOUBLE)

RETURNS DOUBLE

RETURN spend / units;

Which of the following code blocks can be used to apply this function to the customer_spend and customer_units columns of the table customer_summary to create column customer_price?

A. SELECT PRICE customer_spend, customer_units AS customer_priceFROM customer_summary

B. SELECT priceFROM customer_summary

C. SELECT function(price(customer_spend, customer_units)) AS customer_priceFROM customer_summary

D. SELECT double(price(customer_spend, customer_units)) AS customer_priceFROM customer_summary

E. SELECT price(customer_spend, customer_units) AS customer_priceFROM customer_summary

Browse 45 Q&As
Questions 14

How can a data analyst determine if query results were pulled from the cache?

A. Go to the Query History tab and click on the text of the query. The slideout shows if the results came from the cache.

B. Go to the Alerts tab and check the Cache Status alert.

C. Go to the Queries tab and click on Cache Status. The status will be green if the results from the last run came from the cache.

D. Go to the SQL Warehouse (formerly SQL Endpoints) tab and click on Cache. The Cache file will show the contents of the cache.

E. Go to the Data tab and click Last Query. The details of the query will show if the results came from the cache.

Browse 45 Q&As
Questions 15

Which of the following statements about a refresh schedule is incorrect?

A. A query can be refreshed anywhere from 1 minute to 2 weeks.

B. Refresh schedules can be configured in the Query Editor.

C. A query being refreshed on a schedule does not use a SQL Warehouse (formerly known as SQL Endpoint).

D. A refresh schedule is not the same as an alert.

E. You must have workspace administrator privileges to configure a refresh schedule.

Browse 45 Q&As
Questions 16

A data analyst has created a Query in Databricks SQL, and now they want to create two data visualizations from that Query and add both of those data visualizations to the same Databricks SQL Dashboard. Which of the following steps will they need to take when creating and adding both data visualizations to the Databricks SQL Dashboard?

A. They will need to alter the Query to return two separate sets of results.

B. They will need to add two separate visualizations to the dashboard based on the same Query.

C. They will need to create two separate dashboards.

D. They will need to decide on a single data visualization to add to the dashboard.

E. They will need to copy the Query and create one data visualization per query.

Browse 45 Q&As
Questions 17

An analyst writes a query that contains a query parameter. They then add an area chart visualization to the query. While adding the area chart visualization to a dashboard, the analyst chooses "Dashboard Parameter" for the query parameter associated with the area chart.

Which of the following statements is true?

A. The area chart will use whatever is selected in the Dashboard Parameter while all or the other visualizations will remain changed regardless of their parameter use.

B. The area chart will use whatever is selected in the Dashboard Parameter along with all of the other visualizations in the dashboard that use the same parameter.

C. The area chart will use whatever value is chosen on the dashboard at the time the area chart is added to the dashboard.

D. The area chart will use whatever value is input by the analyst when the visualization is added to the dashboard. The parameter cannot be changed by the user afterwards.

E. The area chart will convert to a Dashboard Parameter.

Browse 45 Q&As
Questions 18

A data team has been given a series of projects by a consultant that need to be implemented in the Databricks Lakehouse Platform.

Which of the following projects should be completed in Databricks SQL?

A. Testing the quality of data as it is imported from a source

B. Tracking usage of feature variables for machine learning projects

C. Combining two data sources into a single, comprehensive dataset

D. Segmenting customers into like groups using a clustering algorithm

E. Automating complex notebook-based workflows with multiple tasks

Browse 45 Q&As
Exam Name: Databricks Certified Data Analyst Associate
Last Update: Mar 19, 2025
Questions: 45 Q&As

PDF

$49.99

VCE

$55.99

PDF + VCE

$65.99