Data Prep Advanced Exam

What to know:

  • Only Playfair+ Core and Premium members are eligible
  • You will need the Proxy dataset
  • This test is designed to take 60 – 90 minutes
  • You must answer 20 of 25 questions correctly to pass
  • Following the exam, we’ll review your answers and respond within 3 – 5 business days.

Step 1 of 26

1. Which is an example of building with scale in mind?
2. What is pseudocode?

3. Which icon commonly denotes a database? (Answer on next screen)

3. Which icon commonly denotes a database?

4. Which icon commonly denotes a filter? (Answer on next screen)

4. Which icon commonly denotes a filter?
5. When working with client data what is NOT something we need to consider?
6. Which law is most relevant when working with data related to student records?
7. Which querying language is most likely to be used to query big data?
8. Which data structure works best for big data?
9. What is the benefit of storing data in a Cube / Parquet?
10. Which dataset would be most likely to be stored in a Cube / Parquet format?
11. What is a benefit of storing data in JSON?
12. What is a drawback of JSON?
13. How can you reduce the size of your data?
14. What issues could be created with an append-only ETL pipeline?
15. What are the benefits of using an append or partial refresh?
16. Why is data governance important?
17. What are the benefits of using a view over a table?
18. Which of the following would be a reason to automate a process?
19. What is oauth?
20. Which regex statement would you use to find email domains?
21. Which regex statement would you use to identify phone numbers in the following format: 123-456-7890?
22. What does this REGEX statement do: (\D{8})?
23. Which REGEX snippet would work best to isolate 12-digit order numbers from open text feedback forms?
24. Which query could be used to find duplicate orders in the Sales Table?
25. Which of these queries would not change the number of rows of the sales table?

Let us know who is taking the test: