Data Engineering Practitioner Exam

What to know:

  • Only Playfair+ Core and Premium members are eligible
  • You will need the Playfair+ Exam Dataset
  • This test is designed to take 60 – 90 minutes
  • You must answer 20 of 25 questions correctly to pass
  • Following the exam, we’ll review your answers and respond within 3 – 5 business days.

Step 1 of 27

Which of the following does not apply to conditional formatting?
You are tasked with taking a dataset to perform some type of analysis, what is the preferred data structure of this dataset?
True or False: If you perform a join on two datasets, you can have more records than you started with.
Which of these data types would best be used to classify the value ‘HELLO’?
The process of using ETL to configure a dataset into the proper output for analysis is a key part of data engineering. What does the term ETL stand for?
Which of the following could be a use case for a Primary Key?
If you are provided a dataset with 5,000,000 rows, which of the following outputs would not work?
APIs are used commonly to extract data from various sources. What does API stand for?
What is the high-level goal of a data pipeline?
What is a data lake and when would it be used?
Which would create more rows in the final dataset, a union or join? (assuming there are no duplicates and both datasets have a one to one relationship)
You have Table A and Table B. You are doing an INNER JOIN on the tables. What will be the result?
What is the result of the following formula when Order Date = 11/8/2021 and Ship Date = 11/11/2021? Order Date >= Ship Date
Which of these rows of data is PII (Personal Identifiable Information)?
Using SQL, how would you select all of the columns in a dataset?
Which of the following cannot be used as a delimiter in a CSV File?
True or False: A NULL and a Blank are the same within a database.
If Postal Code = 42420 and Region = South, which of the following IF statements would produce ‘X’?
If Sales = $261, $731, $958, and $49, which of the following would produce $500?
What is the result of DATEADD(month, 2, ‘2021-11-20’)?
If Order Date = 1/3/2019, 1/4/2019, 1/5/2019, 1/6/2019, how was the data sorted?
Which is not an intended function of the DISTINCT function?
What transformation steps would you use to get each unique category of a dataset and the sum of sales for each category?

The following questions must be answered using the provided dataset. If you haven't already downloaded the dataset, use the link below.

Download here
The following questions must be answered using the provided dataset. If you haven't already downloaded the dataset, use the link below.
Download here
What was the average sales in the Furniture Category for the 2nd best performing region (by total overall sales)?
What is the Order ID for the second largest order (in terms of Profit) on 9/4 in the East Region?

Let us know who is taking the test: