# Lecture 38: Certification Overview

In this video, we will see an overview of the certification exam.

  - You will learn about the format and structure behind the exam.
  - Describe the topics covered in the exam.
  - And lastly, recognize the different types of questions provided on the exam.


## Exam Details

- You have **90 minutes** to complete the certification exam.

- There are **45 multiple choice questions**.

- And your score must be **70%** or better.

  This translates to correctly answering a minimum of **32 out of 45 questions**.

- Each attempt of the certification exam will cost you **$200**.

  And you will be able to retake the exam as many times as you would like. But you will need to pay **$200** for each attempt.



## Exam Topics

The questions expected during the exam will be distributed in the following way:

- **11 questions** out of 45 on the use and the benefits of using the **Databricks Lakehouse platform**.
- **13 questions** on **building ETL pipelines** with **Apache Spark SQL** and **Python**.
- **10 questions** on **processing data incrementally**.
- **7 questions** on building **production pipelines**, in addition to **Databricks SQL queries** and **Dashboards**.
- Lastly, **4 questions** on **data governance and security practices**.

We have fully covered all these topics during our course.



### Topics Not Required for the Exam

The following topics are **not** expected on the Associate-level Data Engineer exam:

- All topics related to **Spark internals**, **Databricks CLI** and **API**, **Change Data Capture**, **modeling concepts**, and **data protection regulations**.
- Same for **monitoring and logging production jobs**, **dependency management**, and **testing**.



## Coding and Exam Platform

### Coding

During the exam, **data manipulation code** will always be provided in **SQL** when possible.

In all other cases, the code will be in **Python**.



### Exam Platform

**Databricks certifications** are taken through the **Webassessor platform** via this link (https://www.webassessor.com/databricks).

You just need to sign up in order to schedule and take your exam.



### Exam Proctoring

- During the exam, you will be monitored via webcam by a **Webassessor Proctor**.

- You will be asked to provide a **valid photo-based identification**.

- The proctor will monitor you during the exam and can provide technical support if needed.

  However, they will not provide assistance on the content of the exam.

- In addition, there will be **no test aids** during the exam.

### Exam Results and Certification

The certification exams are automatically graded, and you will receive your pass or fail grade immediately.

However, the **badge** and **certificate** will be received within **24 hours** of passing the exam.

You will receive them via: https://credentials.databricks.com



## Types of Questions

Remember, all the exam questions are **multiple choice questions**.
This means there is only **one correct answer** for each question.

The exam has two types of questions: **conceptual** or **code-based** questions.

  - **Conceptual questions** usually ask for definitions.

    Example: *What something is? or does?*

    **Question example:** Which part of the Databricks Lakehouse platform can data engineers use to orchestrate jobs?

    - **Options:**
      - Repos
      - Workflows
      - Data Explorer
      - Databricks SQL
      - Cluster

    **Answer:** Workflows.

    In the **Workflows** tab, you can create and orchestrate jobs in Databricks.

  - **Code-based questions** could ask you, for example, to identify an **error** in a code block or to **complete** a code block by filling in the blanks.

    **Question example:** Here, you have a code block of a **stream write operation** with a missing configuration.

    ```python
    spark.table("sales") .writeStream
    .option("checkpointLocation", checkpointPath)
    ._____
    .table("new_sales")
    ```

    The question asks:

    If you want the query to execute a single micro-batch to process all of the available data, which of the following lines of code should you use to fill in the blank?

    - `trigger(once=True)`
    - `trigger(continuous="once")`
    - `processingTime("once")`
    - `trigger(processingTime="once")`
    - `processingTime(1)`

    **Answer:** `Trigger.Once`

    With `Trigger.Once`, you execute a **single micro-batch** to process all of the available data, as we saw in the **Structured Streaming lecture**.



## Practice Test

Lastly, **Databricks** offers a **practice test** for the Associate-Level Data Engineer exam.

This practice test is available in [**PDF format**](../../assets/resources/PracticeExam_DatabricksDataEngineerAssociate.pdf).

Let us take a look at it.

Here is the **official practice test** from Databricks.
You have **45 questions**.

These questions are representative of the questions that are on the actual exam, but they are no longer on the actual exam.

Once you have completed the practice exam, evaluate your score using the **correct answers** at the end of this document.



## Final Advice

My **last advice** before appearing for the exam is to try to **practice by yourself** all the notebooks that we saw during the course.

I wish you all the best in your certification exam, and please let me know about your results via **LinkedIn**.