# Welcome to the Databricks UI

In this exercise, you will get your first taste of the Databricks UI and start to explore the features on offer!

Sierra Publishing, a global book publishing company, has decided to introduce Databricks to help ensure they better manage and utilize their data. As a data analyst working for Sierra Publishing, you know you will work with Databricks for many future projects.

The IT department has provided you with your own access to Databricks - to get comfortable with the new tool, it's time to explore the UI.

**Instructions**

1. You will now see the home page for Databricks! On the left-hand side of the screen, you will notice the main menu displaying the different areas of the platform. These are organized by the kind of process you are carrying out on the platform. 
    - Click on the Catalog section to check out where you can explore all data connected to the Databricks environment.

2. In order to use Databricks for your Sierra Publishing projects, you will need to bring data into the platform from different sources or upload files directly to Databricks. 
    - Click on the Data Ingestion button and review the different sources that can be used to ingest data.

3. As the data analyst at Sierra Publishing, you know that the company stores data in various sources and file formats. You have to make sure that the sources with Sierra Publishing data are all available for use in Databricks. Which data sources can be used to ingest data into Databricks?

- Azure Blob Storage ❌
- MySQL ❌
- Google Ads ❌
- All of the above ✅

# Why pick a Data Intelligence Platform

The Chief Information Officer at Sierra Publishing was the main champion of pitching and driving the idea of the Databricks platform to your company's board of directors. He has asked you to help him communicate the benefits of this decision to other colleagues.

**Instructions**

Which of the following THREE reasons helps explain the benefits of adopting the Data Intelligence Platform?

- You want great performance for both BI and machine learning workloads. ✅
- You want a single architecture to support all of your data workloads.. ✅
- You want a proprietary technology stack that limits your integrations and workload support. ❌
- You want the most cost-effective architectural solution for your organization. ✅

# Databricks for all users

In this exercise, you will explore the platform and understand some of the capabilities of the platform for different data personas.

The CTO was impressed with your previous support in helping other department members understand the benefits of Databricks. As a result, they asked you to help describe to the different data teams in your organization how the platform can help them.

This will require you to perform some SQL, but your CTO provided some specific syntax they want to run in Databricks:

`SELECT * FROM samples.nyctaxi.trips LIMIT 100` 

**Instructions**

1. The data engineers and data scientists in the company like to use notebooks to create and run their code. Databricks provides a notebook feature, which is an all-purpose coding option for any supported language in Databricks. 
    - Click on the Workspace menu on the left-hand side of the screen.
    - Then, click on Create and select the Notebook option to view the notebook feature.
2. Data scientists and machine learning engineers ideally want a space to test their large language models (LLMs) side-by-side. Databricks has the Playground section for this.
    - To view this feature, click on the Playground option on the left-hand menu, under the Machine Learning section. You might need to scroll down the menu to see it.
3. As a data analyst, if you want to run SQL code to query and work with data in Databricks, then the SQL Editor is the feature Databricks offers.
    - Click on SQL Editor menu on the left-hand side of the screen.
    - In the new query section, type out the following code and hit the Run button to see the results.
    `SELECT * FROM samples.nyctaxi.trips LIMIT 100` 
4. In the results section, what is the first value displayed for the column trip_distance?
    `1.4`


# Control Plane and Compute Plane relationship

**Instructions**

Which of the following statements is the most accurate analogy of the Databricks Control Plane and Compute Plane?

- The Control Plane is the "muscles" of the platform, and houses all of the physical entities of the platform. The Compute Plane is the "brains" of the platform, controlling all processing and creating clusters. ❌
- The Control Plane and the Compute Plane perform similar functions but in different cloud environments. ❌
- The Control Plane is the "brains" of the platform, controlling all processing and creating clusters. The Compute Plane is the "muscles" of the platform, and houses all of the physical entities of the platform. ✅
- The Control Plane and the Compute Plane are not related at all, and operate totally independently. ❌

# Databricks and external systems

Your finance team is worried about adopting Databricks and the Data Intelligence Platform, as they have a BI application they need to do their work. One member of the finance team gave you this response:

"Databricks can only connect to files stored in the data lake. It cannot connect to other systems or applications I own in the cloud."

**Instructions**

Which of the following is the most accurate response to this statement?

- Correct, Databricks can only connect to our data in the data lake, so we will have to migrate all of our data there. ❌
- Actually, Databricks can connect to most, if not all, of our other data systems, not just the data in the data lake. ✅
- Databricks can connect to external systems, but it is hard and the lakehouse architecture is not recommended for us. ❌
- Databricks can either connect to data in a data lake, or to other data systems, but cannot do both at the same time. ❌

# Different types of administrators

You have created a new implementation of Databricks, and want to start working with your team to better run the environment. Since you will need their administrative help, you want to make sure you assign them the right privileges for their responsibilities.

**Instructions**

Which statement best reflects the difference between an Account Admin and a Workspace Admin?

- Account Admins and Workspace Admins have the same privileges, but Workspace Admins only focus on a particular workspace. ❌
- Account Admins have permissions to the entire account and back-end configurations, while Workspace Admins have permissions for the operations within a specific workspace. ✅
- There is no difference between the two kinds of Admins in Databricks. ❌
- Workspace Admins have permissions to the entire account and back-end configurations, while Account Admins have permissions for the operations within a specific workspace. ❌

# The marketplace

In this exercise, you will explore the marketplace feature in Databricks that allows you to discover and acquire third-party datasets.

When selecting which new books to publish and how, Sierra Publishing carries out different type of research activities that also include acquiring third-party datasets. The CFO has made it clear that this has not always been an easy process for the company. You decide to show the CFO how easy it will be to do it using Databricks Marketplace.

The Databricks Marketplace is one area where Databricks can connect with external data providers. Other companies can provide datasets, models, notebooks, and many other data assets for consumption in your Databricks environment.

**Instructions**

1. Navigate to the Databricks Marketplace section of the platform.
2. The company is publishing some new books on financial markets. You know other departments spent a lot of time researching important market conditions.
    - In the Search for products bar, search for "S&P" assets.
3. Several different products appear related to different financial and market topics.
    - Scroll through the search results and select the "Crude Supply Risk" product to learn more about this asset.
    - Do not select "Request access"
4. When viewing a product in the Marketplace, you can see descriptions about the asset you have selected, what type of asset it is, and any access and pricing information too. The CFO liked the Crude Supply Risk asset that you found. What is the update frequency of this asset?
    - Every day
    - Every week
    - Every month

# Connecting to partners

In this exercise, you'll explore Databricks Partner Connect, simplifying integrating external systems.

Sierra Publishing depends on various technologies for research and operations. The CTO recognizes that Partner Connect makes integration seamless.

You can easily link Databricks to your preferred external systems with built-in connectors.

**Instructions**

1. The Partner Connect feature is in the Marketplace section of Databricks.
    - Navigate to the Marketplace portion of the platform to locate the Partner Connect feature.
2. Within the Marketplace, locate the Partner Connect integrations, and select the option to view all the different types of integrations.
3. Partner Connect simplifies integrating external systems with Databricks using built-in connectors for seamless functionality.
    - Navigate to the BI and visualization section within the Partner Connect page.
4. How many options are available for external systems within the BI and visualization section? `7`