# Week 1

Business intelligence, or BI,
involves automating processes and information channels in order to transform relevant data into actionable insights that are easily available to decision-makers. In other words, by showing decision-makers what is currently happening, organizations become more intelligent and successful. 

ETL,
which stands for extract, transform, and load.
It's all about getting data where and how you need it
so you can analyze and monitor it. 


## Week 1: Data-driven results through business intelligence

Begin your journey into business intelligence! Find out what’s in store for Course 1 and the whole certificate program. You will explore the Coursera platform, procedures, and content types, and meet other learners in the program. Then, you will begin learning about the BI industry and the roles of BI analysts and engineers. You will be introduced to BI tools and techniques for making business decisions and improving processes. Finally, you will learn the similarities and differences between the two fields of BI and data analytics. 

### Expectations for First Course

What you will learn:

    Business intelligence’s place in the data ecosystem and key distinctions between business intelligence and data analysis
    Typical career paths and the industries that use business intelligence
    Common roles and responsibilities of industry professionals
    The stages of business intelligence, plus key terms and concepts
    Strategies for ensuring metrics are relevant and effective
    Successful business intelligence practices and processes

Skill sets you will build:

    Proven approaches to business intelligence projects
    Identifying the uses and benefits of data models, pipelines, data visualizations, and dashboards
    Collaboration with team members, stakeholders, and users, as well as an understanding of their roles and priorities
    Planning processes for a successful project
    Networking to tap into the business intelligence community
    Demonstration of key knowledge and skills during a job search

### Orgs that benefit from BI

BI analysts gather requirements from stakeholders,
partners, and team members.
They use their understanding of
large datasets to retrieve,
organize, and interpret data.
Then they create visualizations, dashboards,
and reports to use when
presenting and communicating insights to others.
The intelligence they share
might be used to make decisions,
develop a new process,
or create a business strategy,
or it could be applied to even deeper analysis. 

BI engineers are responsible for designing,
configuring, implementing,
and improving data tools and processes.
They evaluate and streamline various devices,
infrastructures, and
information channels called pipelines.
Engineers are excellent troubleshooters and
help find solutions to security issues,
application permissions and updates,
and other technical challenges. 

In other words, a BI engineer
manages tools and processes,
which enables a BI analyst
to put those tools and processes to work.
However, it's also important to note
that some companies don't distinguish
between those two BI jobs, using
the terms analyst and engineer interchangeably. 

**Business intelligence* involves automating processes and information channels in order to transform relevant data into actionable insights that are easily available to decision-makers. 

### Ficticious Scenario 

Consider a fictitious scenario about a fast-food restaurant chain. Leaders at this company have huge amounts of data to manage, such as:

    customer transactions

    marketing data related to promotions

    customer satisfaction

    employee information

**The problem**

The restaurants need to have ingredients to cook and serve customers, but if they have too much, that extra stock often goes to waste. Company leadership consults with their BI team to consider how to approach two concerns:

    How to ensure the restaurants’ numerous locations have enough ingredients to meet customer demand

    How to reduce food waste


**The solution**

In order to address the stakeholder’s needs, the BI team spends time gathering information about current metrics and processes. They first use this information to determine what data they have and how it’s being used. They discover that there are already useful metrics being applied in other ways by various teams in the company, including:

    How many ingredients are delivered to each location

    How much of each menu item is made each day

    How much of each menu item is actually being ordered each day

By comparing these existing metrics, the company can better understand how much food is going to waste. Thus, the BI analysts are able to gather the necessary information on incoming food delivery, customer orders, and food consumption in the form of a dashboard for stakeholders to monitor food waste. The BI analysts then organize this data within the database systems and deliver it to new tables that report the results for stakeholders to consider as they strategize how to reduce food waste. 


**The results**

Knowing how much food is actually going to waste now enables stakeholders to better achieve their goals. The restaurant chain discovers that the largest source of food waste is the French fries. Across their locations, 10–20% of French fries are left over at the end of the month.

### People BI Works With

**API Pros**

First up, API professionals. API stands for application programming interface.
This is a set of functions and procedures that integrate computer programs
forming a connection that enables them to communicate.
There are so many different roles that involve
APIs.
Including
API strategists, developer, engineer, and
product owner.
When you're working in business intelligence, you might collaborate with
API professionals in order to create the interface that you need for a particular
project, especially if some of your data is coming from a third party platform.
APIs bring that data into the internal company database in order to build
reporting tools and dashboards

**Data Warehousing Pros**

You may work with data warehousing specialists who develop processes and
procedures to effectively store and organize data.
These people also help ensure BI professionals can easily access the data
they need. 

**Data Governance Pros**

These team members are responsible for
the formal management of an organization's data assets.
This may involve managing the availability,
integrity, and security of data based on internal standards and policies.
This is very important for making sure data is trustworthy and
doesn't get misused or corrupted. 

**Data Analysts**

Of course data analysts are key partners as they collect, transform and
organize data.
They are dataset specialists reviewing and validating the data all the time.
They also identify and implement exciting new analysis methodologies. 

**IT Pros**

Information technology professionals test install, repair, upgrade, and maintain
the hardware and software solutions that organizations use every single day.
BI professionals work with the
IT department in order to maximize all of
the available data and data tools. 

**Program Managers**

In addition, project managers are key collaborators, as they handle the
day-to-day project steps, scope, schedule, budget, resources, and much more. 



## BI Structures and Stages

### Differences between BI and DA
First, let's establish that both BI and DA
professionals enable
data-driven decision-making in their organizations.
They're also both a key part
of their company's data maturity. 

Data maturity is the extent
to which an organization is able to
effectively use its data in
order to extract actionable insights.
Whether that data includes
information about your customers,
staff, products, suppliers, finances, or something else,
it's worthless if it can't be put to work. 

Data analysts are primarily
involved with using data to
answer questions about what happened.
On the other hand, BI professionals
aim to achieve higher levels of
data maturity by building data
reporting tools such as dashboards. 

Dashboards continuously analyze and monitor data.
The work isn't complete just
because the tool has been created.
The analysis and monitoring are
ongoing. This is because BI involves
establishing repeatable methods to understand how things
are operating and by knowing the current state,
company leaders can take
action to improve the future state. 

BI is all about near-real-time rapid monitoring,
BI insights are most effective
when they make an impact right now.
Therefore, the people who use
the reporting tools wants to make
sure they are practical and efficient.
BI professionals are expert tool builders. 

hese tools must be impactful and easy to interpret.
Even for non-technical people.
If you like thinking about what you can
create that meets the needs of many different users,
BI will let you do just that. 



In [2]:
import pandas as pd

# dict of table in course
W1_DA_BI_dict = {'Data Tasks':[
                    "Involvement",
                    "Answering questions",
                    "Gathering data",
                    "Storage systems",
                    "Descriptive and predictive analytics",
                    "Presenting insights",
                    "Iteration"
                    ],
                'DA':[
                    "The first step of the data analysis cycle is to define the business problem and establish stakeholder expectations." ,
                    "Data analysts are often tasked with deciding what data they need to answer their stakeholders’ questions and gathering that data for use",
                    "Once data has been gathered, data analysts must ensure that it is clean and ready for use. They also perform transformations on the data to prepare it for analysis.",
                    "Data analysts must adhere to organization conventions and store historical data for analysis.",
                    "Data analysis focuses on descriptive analysis that describes historical trends.",
                    "After analysis, data analysts present their findings to inform the stakeholders’ ultimate decision.",
                    "After the initial analysis, data analysts may repeat their analysis based on their findings or new information."
                    ],
                "BI":[
                    "In addition to defining business problems and asking questions to establish expectations, BI professionals observe current processes to determine how they can be improved to align more with stakeholder needs.",
                    "BI professionals evaluate the data needs of their stakeholders, identify necessary sources, and design pipeline systems that automatically and continuously gather that data for stakeholders to access.",
                    "BI professionals build tools that clean and transform data automatically within a pipeline so that these processes occur to all data being ingested by the pipeline process.",
                    "BI professionals develop storage systems that allow intake from multiple source systems into a destination database, while governing the database schema and optimizing the system.",
                    "BI uses analysis of historical trends to perform predictive analytics that enable organizations to determine likely future trends and act accordingly.",
                    "BI analysts create tables, reports, and dashboards that empower stakeholders with access to the data they need to inform their whole decision-making process.",
                    "BI analysts continue to iterate on processes to improve and optimize the systems and tools they have built to ensure they continue to be useful for stakeholders."
                ]                
                 }

df = pd.DataFrame(W1_DA_BI_dict)
df.to_csv("W1_DA_vs_BI.csv", index=False)
df

Unnamed: 0,Data Tasks,DA,BI
0,Involvement,The first step of the data analysis cycle is t...,In addition to defining business problems and ...
1,Answering questions,Data analysts are often tasked with deciding w...,BI professionals evaluate the data needs of th...
2,Gathering data,"Once data has been gathered, data analysts mus...",BI professionals build tools that clean and tr...
3,Storage systems,Data analysts must adhere to organization conv...,BI professionals develop storage systems that ...
4,Descriptive and predictive analytics,Data analysis focuses on descriptive analysis ...,BI uses analysis of historical trends to perfo...
5,Presenting insights,"After analysis, data analysts present their fi...","BI analysts create tables, reports, and dashbo..."
6,Iteration,"After the initial analysis, data analysts may ...",BI analysts continue to iterate on processes t...


## Phases of BI

The data life cycle is a sequence of
stages that data experiences,
which include plan, capture,
manage, analyze, archive, and destroy.
The data analysis process occurs in six phases. 
*Ask, prepare, process, analyze, share, and act.*


In BI, the three stages are
*capture, analyze, and monitor*.
As you advance through each one,
the process requires a deeper level of
exploration and investigation,
so each becomes more complex. The business intelligence phases of capture, analyze, and monitor help determine BI’s business value, as well as an organization’s data maturity level. Data maturity is the extent to which an organization is able to use its data in order to extract actionable insights.


### Capture
Let's start with capture,
which is the "what happened" point in the BI process.
This stage pertains
to static backward-looking data. For instance,
if you were to query a database to return a data set
about what your customers purchased last month,
that's an example of capturing information. Or you
might access a spreadsheet that lists
profits and losses from the previous quarter.
That's also an example of
capturing or record of certain information. 

### Analyze
The "why did it happen" part of BI.
You already know a lot about this stage,
but as a quick reminder,
data analysis is when we draw conclusions,
make predictions, and drive informed decision-making.
Because the analyze stage explorers why things happen,
it's much more likely to result in
effective plans and strategies.
In this way, it enables BI professionals to
better understand key relationships between data points.
They do this by examining data more deeply, more broadly,
or even side-by-side, in order to identify
connections that at first might not have been apparent. 

### Monitor

Monitor is the "what's happening now" stage,
and here's where BI professionals
really get to show off their unique skills and expertise.
In this stage, you use
automated processes and information channels,
such as data models,
ETL pipelines, dashboards and more.
These exciting tools take
the data that organizations create
every single day and transform it into
true intelligence that is
easily available to stakeholders. 


## How BI Makes an Impact

### BI Strategy

Business intelligence strategy is the management of the people,
processes, and tools used in the business intelligence process.
As you're discovering, BI is complex and
it requires a 360 degree view of the organization. 

Is everyone coordinated? Are the right questions for vision laid out initially? What is the desired outcome?

The second element of BI strategy is process.
By now, you've established who will be responsible for the rules and
policies that govern BI processes.
So some questions to ask these people include,
what solutions are we using and how?
Which of them bring value?
What types of solutions do we plan to implement?
How will we deliver them?
How will we support them?
In addition,
having a user support framework is a critical part of the process.
So be sure to dedicate enough time in training and education,
a feedback system, and ensuring users are getting value from the tools. 

One of the most important concepts to note here is to choose each tool with
the user in mind.
Consider which dashboards, reports and other solutions will be most effective.
Ask, do different users, teams and departments require different
technologies? Which technologies do we have access to?
Can we gain access to others if needed?
How will we measure success?
Here you want to establish key performance indicators or KPIs for
each particular business need.

### KPI

KPI is a quantifiable value,
closely linked to business strategy, which is used to track progress towards a goal.
In other words, KPIs direct you towards achieving your desired results.
But in order for KPIs to do their job, it's important that the tools you select
align with the KPIs established for each particular project. 

The last step in this process is documenting everything you've learned.
Many BI professionals use specific BI documents to note stakeholder
requirements, project requirements, and strategy.
These are essential tools that can really help you capture the big picture,
stay organized, and make an impact at your organization. 

## Necessary Documents
Each of these documents builds on the previous one. Instead of three separate documents, think about them as three phases of your project planning process.


### Stakeholder Requirements Doc
The Stakeholder Requirements Document enables you to capture stakeholder requests and requirements so you understand their needs before planning the rest of the project details or strategy. It should answer the following questions:

- Business problem: What is the primary question to be answered or problem to be solved?

- Stakeholders: Who are the major stakeholders of this project, and what are their job titles?

- Stakeholder usage details: How will the stakeholders use the BI tool?

- Primary requirements: What requirements must be met by this BI tool in order for this project to be successful?

Here are some questions BI professionals ask in order to successfully complete this document:

- What questions must be answered before starting this project?

- What does the BI team need to know before starting this project?

- What are the questions that must be answered/problems that must be solved by this project?

- What datasets are considered important to this project?

- Who should have access to the dashboard? Will the entire dashboard be visible to all stakeholders?

Typically, the Stakeholder Requirements Document is a one-pager with notes, but it can be longer and more detailed for complex projects.


### Project Requirements Doc

Once you have established the stakeholder requirements, you can start thinking about the project requirements that need to be met to achieve the stakeholder requirements. The Project Requirements Document contains the following details:

- Purpose: Briefly describe why this project is happening and explanation of why the company should invest its resources in it.

- Key dependencies: Detail the major elements of this project. Include the team, primary contacts, and expected deliverables. Are there any inter-team deliverables required?

- Stakeholder requirements: List the established stakeholder requirements, based on the Stakeholder Requirements Document. Prioritize the requirements as: R - required, D - desired, or N - nice to have.

- Success criteria: Clarify what success looks like for this project. Include explicit statements about how to measure success. Use SMART criteria.

- User journeys: Document the current user experience and the ideal future experience.

- Assumptions: Explicitly and clearly state any assumptions you are making.

- Compliance and privacy: Include compliance, privacy, or legal dimensions to consider.

- Accessibility: List key considerations for creating accessible reports for all users. Who needs to access this feature? How are they viewing and interacting with it?

- Roll-out plan: Briefly describe the expected scope, priorities and timeline. Consider at what points during the rollout will measurements be made to determine whether the feature is performing as expected? Is there a rollback plan and timeline if this feature does not meet its intended goals?

In addition, some companies will ask you to include a list of references. If so, it’s a best practice to be liberal in citing references; you can never have too many. References might include:

- Documents or websites you read and researched while working on this project

- Laws and policies: Any regulations driving the project requirements

- Project tracking: A link to tracking spreadsheet, bug hotlist, etc.

- Similar projects: A description of anything similar that has been attempted in the past or any parallel efforts.

Similar to the Stakeholder Requirements Document, the Project Requirements Document will vary depending on the complexity of the project. It might just be an email sent out to stakeholders to keep them updated on expectations and check-in points, or it could be a multi-page document with a spreadsheet that outlines the project plan and key tasks.

### Strategy Doc
Finally, you will create a Strategy Document for your project. This is the final phase of the planning process. The Strategy Document is a collaborative place to align with stakeholders about project deliverables. You will work together to establish information about dashboard functionality and associated metrics and charts.

This is a time to flesh out what metrics will be required, how metrics are calculated, and any limitations or assumptions that exist about the data. Stakeholders think through these details and help the BI professional make final project decisions. Then, the BI professional provides stakeholders with a dashboard mockup to get valuable feedback.

Generally, the BI professional will create the document and request review and sign-off from important stakeholders. Then they can begin working on the project with all of the details they need.

### BI Intelligence Pro Toolbox

In BI, mapping a route requires a *data model*,
which is the first tool in your toolbox.
Data models organize data elements and how they relate to one another.
They help keep data consistent across systems and
explain to users how the data is organized.
This gives BI professionals clear directions when navigating a database. 

the second tool in your toolbox, is the *data pipeline*.
A data pipeline is a series of processes that transports data from different
sources to their final destination for storage and analysis.
Think of the data pipeline as train tracks, spanning, passing, and
crossing over vast distances.
Data is transported along these channels in a smooth automated flow from
original sources to target destination. 
*ETL *is a type of data pipeline that enables
data to be gathered from source systems, converted into a useful format,
and brought into a data warehouse or other unified destination system.
The process of ETL plays a key role in data integration because it
enables BI professionals to take data from multiple sources,
consolidate it, and get all that data working together. 

Our third tool, *data visualizations*.
You likely know that data visualization is the graphical representation of data.
Some popular data viz applications are Tableau and Looker.
These apps make it possible to create visuals that are easy to understand and
tell a compelling story. 

Our fourth tool is a *dashboard*; an interactive visualization tool that
monitors live
incoming data. Picture the dashboards
used by train drivers. They pay close attention to these tools
in order to constantly observe the status of the train engine and other important equipment.
Dashboards keep the drivers connected with the control center to ensure that
routes are clear and signals are functioning properly. 

Iteration involves repeating a procedure
over and over again, in order to keep getting closer to the desired result. 

### Technology and Best Practices

- Modular Design: 

Modular design principles can enable the development of individual pieces of a pipeline system so they can be treated as unique building blocks. Modular design also makes it possible to optimize and change individual components of a system without disrupting the rest of the pipeline. In addition, it helps users isolate and troubleshoot errors quickly. 
Other best practices related to modular design include using version control to track changes over time and undo any as needed. Also, BI professionals can create a separate development environment to test and review changes before implementing them.  

- Verify data accuracy and integrity:  

Because of this, it’s necessary to ensure the accuracy and integrity of the data, no matter what tools you are using to construct the system. Some important things to consider about the data in your pipelines are:

    Completeness: Is the data complete?
    
    Consistency: Are data values consistent across datasets?

    Conformity: Do data values conform to the required format?

    Accuracy: Do data values accurately represent actual values?

    Redundancy: Are data values redundant within the same dataset?

    Integrity: Are data values missing important relationships?

    Timeliness: Is the data current?


- Testing Env: 

Building the pipeline processes is only one aspect of creating data pipelines; it’s an iterative process that might require you to make updates and changes depending on how technology or business needs change. Because you will want to continue making improvements to the system, you need to create ways to test any changes before they’re implemented to avoid disrupting users’ access to the data. This could include creating a separate staging environment for data where you can run tests or including a stable dataset that you can make changes to and compare to current processes without interrupting the current flow. 

In [4]:
pd.read_csv("W1_DashBoards.csv", delimiter="|")

Unnamed: 0,Element of the dashboard,BI professional tenets,Stakeholder tenets
0,Centralization,Creating a single source of BIdata for all sta...,Working with a comprehensive view of data that...
1,Visualization,Showing data in near-real time,Spotting changing trends and patterns more qui...
2,Insightfulness,Determining relevant information to include,Understanding a more holistic story behind the...
3,Customization,Creating custom views dedicated to a specific ...,Drilling down to more specific areas of specia...


### Effective Visualizations

- Pre-attentive attributes: marks and channels 

Creating effective visuals involves considering how the brain works, then using specific visual elements to communicate the information effectively. Pre-attentive attributes are the elements of a data visualization that people recognize automatically without conscious effort. The essential, basic building blocks that make visuals immediately understandable are called marks and channels. 

- Design principles

Once you understand the pre-attentive attributes of data visualization, you can go on to design principles for creating effective visuals. These design principles are vital to your work as a data analyst because they help you make sure that you are creating visualizations that convey your data effectively to your audience. 

- Avoiding misleading or deceptive charts 

Make your visualizations accessible and useful to everyone in your audience by keeping in mind the following:

    Labeling

    Text alternatives

    Text-based format

    Distinguishing

    Simplifying

## Glossary

Application programming interface (API): A set of functions and procedures that integrate computer programs, forming a connection that enables them to communicate 

Business intelligence (BI): Automating processes and information channels in order to transform relevant data into actionable insights that are easily available to decision-makers

Business intelligence governance: A process for defining and implementing business intelligence systems and frameworks within an organization

Business intelligence stages: The sequence of stages that determine both BI business value and organizational data maturity, which are capture, analyze, and monitor

Business intelligence strategy: The management of the people, processes, and tools used in the business intelligence process

Data analysts: People who collect, transform, and organize data

Data governance professionals: People who are responsible for the formal management of an organization’s data assets

Data maturity: The extent to which an organization is able to effectively use its data in order to extract actionable insights

Data model: A tool for organizing data elements and how they relate to one another

Data pipeline: A series of processes that transports data from different sources to their final destination for storage and analysis

Data warehousing specialists: People who develop processes and procedures to effectively store and organize data

ETL (extract, transform, and load): A type of data pipeline that enables data to be gathered from source systems, converted into a useful format, and brought into a data warehouse or other unified destination system

Information technology professionals: People who test, install, repair, upgrade, and maintain hardware and software solutions

Iteration: Repeating a procedure over and over again in order to keep getting closer to the desired result

Key performance indicator (KPI): A quantifiable value, closely linked to business strategy, which is used to track progress toward a goal

Portfolio: A collection of materials that can be shared with potential employers

Project manager: A person who handles a project’s day-to-day steps, scope, schedule, budget, and resources
