# Prioritizing and analyzing data

## Discerning important data

- *Signal*: An obsrvable change that helps determine the overall health of the project and identify early signs that something is not quite right. 
- Identify which task contribute more for the overall goal. 
- Prioritize the data or metrics that are most valuable to stakeholders. 
- Stakeholders can look to your project plan for a high-level overview for answers to important questions, determined succes criteria, artifacts, and the overall health of your project. 


# Data ethics considerations
In the previous video, you learned how to use knowledge to discern data and how signals help prioritize data. This reading will cover the importance of data ethics and two key principles: data privacy and data bias.

## Data ethics

As a project manager, data collection and analysis will be a key part of your projects. As you’ve learned, you’ll collect data from a variety of sources, including focus groups, interviews and questionnaires. The data you collect will usually hold PII (personally identifiable information)—information that could be used to directly identify, contact, or locate an individual. A lot of times, you will also need to report on the data you collect to stakeholders, customers, and your project team. Collecting, analyzing, and sharing this data in an ethical way is extremely important for maintaining the integrity of your organization, your projects, and your position.

Data ethics is the study and evaluation of moral challenges related to data collection and analysis. This includes generating, recording, curating, processing, sharing, and using data in order to come up with ethical solutions.

Businesses apply data ethics practices so they can:

- Comply with regulations
- Show that they are trustworthy
- Ensure fair and reasonable data usage
- Minimize biases
- Develop a positive public perception

Data ethics is rooted in several principles. In this reading, we will focus on two of these principles: data privacy and data bias.

## Data privacy
Data privacy is a key part of data ethics. Data privacy deals with the proper handling of data. This includes the purpose of data collection and processing, privacy preferences, the way organizations manage personal data, and the rights of individuals. It focuses on making sure the ways we collect, process, share, archive, and delete data are all in accordance with the law.

As a project manager, it is your responsibility to protect the data you collect. You can help ensure the privacy of data collected from users, stakeholders, and others for your projects by:

- Increasing data privacy awareness. Make sure every member of your project team—plus the vendors, contractors, and other stakeholders from outside of your company—are made aware of your organization's data security and privacy protocols.

- Using security tools. Free security tools, like encrypted storage solutions and password managers, can decrease your project’s vulnerability to a data breach. In a lot of applications, like ones that are part of Google Workspace and Microsoft OneDrive, privacy settings can be adjusted to only give access to specific individuals.

- Anonymizing data. Data anonymization refers to one or more techniques such as blanking, hashing, or masking personal and identifying information to protect the identities of people included in the data. This helps protect individuals’ personal information by keeping them anonymous. Once the information has been anonymized, it can then be used and shared freely. Types of data that should be anonymized include names, telephone numbers, social security numbers, email addresses, photographs, and account numbers.

## Data bias
Another important data ethics practice is making sure that the data you collect does not indicate any biases. Data bias is a type of error that tends to skew results in a certain direction. Maybe the questions on your survey had a particular slant to influence answers, or maybe your sample group was not fully representative of the population you want to study. Bias can also happen if a sample group lacks inclusivity. For example, if your sample does not include people with disabilities. The way you collect data can also bias a dataset. Say you give people only a short time to answer questions, this can lead to rushed responses. When people are rushed, they tend to make more mistakes, which can affect the quality of their data and create biased outcomes. As a project manager, you have to think about bias and fairness from the moment you start collecting data to the time you present your conclusions.

## Types of biases
There are different types of biases to keep in mind when setting up your data collection processes. Here are four of the most common types of biases that could impact your data:

- Sampling bias is when a sample is not representative of the population as a whole. For example, maybe your sample did not include people above the age of 65. Or maybe you excluded people from certain socioeconomic groups.

- Observer bias is the tendency for different people to observe things differently. For example, stakeholders from different parts of the world might view the same data differently and draw different conclusions from it. 

- Interpretation bias is the tendency to always interpret situations that don’t have obvious answers in a strictly positive or negative way, when, in fact there is more than one way to understand the data. Data that does not provide an obvious set of conclusions makes some people feel anxious, which can lead to interpretation bias. For example, a team member might interpret inconclusive survey results negatively, while other team members might be able to think more carefully and assess the data from different angles. 

- Confirmation bias is the tendency to search for or interpret information in a way that confirms pre-existing beliefs. For example, you might ask only specific stakeholders for feedback on parts of your project because you know they are the most likely to have the same perspective as you.

Each of these types of bias affect the way you collect and make sense of the data, so it is important to be aware of them when setting up your data collection processes. 

## Key takeaway
According to the Project Management Institute’s 
Code of Ethics & Professional Conduct, “Ethics is about making the best possible decisions concerning people, resources, and the environment. Ethical choices diminish risk, advance positive results, increase trust, determine long term success, and build reputations. Leadership is absolutely dependent on ethical choices."

A key way you can show your leadership skills is by exercising sound judgment when it comes to data ethics. In order to tell a project’s data-informed story to stakeholders, project team members, and others in an ethical way, you have to make sure you think about both privacy and bias-related concerns in how you conduct, analyze, and share that data.



## Using data analysis to inform decisions

- *Data analysis*: The process of collecting and organizing information to help drwa conclusions. 
- *Quantitative data*: Statistical and numerical facts. 
- *Qualitative data*: Subjective qualities that can not be measured with numerical data. 

## The six steps of data analysis
In an earlier video, you learned that data analysis is the process of collecting and organizing information to help draw conclusions, solve problems, make informed decisions, and support your goals. In this reading, we will go over the key parts of the data analysis process. 

There are six main steps involved in data analysis: *Ask, prepare, process, analyze, share and act*. Let’s break these down one by one. 

During the Ask phase, ask key questions to help frame your analysis, starting with: What is the problem? When defining the problem, look at the current state of the business and identify how it is different from the ideal state. Usually, there is an obstacle in the way or something wrong that needs to be fixed.  At this stage, you want to be as specific as possible. You also want to stay focused on the problem itself, not just the symptoms. For example, imagine you are doing data analysis for a gym that is losing memberships. You could ask: Why do we keep losing members? But a better and more specific question would be: What factors are negatively impacting the member experience? That way, when you set off to do your research, you know exactly what to look for. 

Another part of the Ask stage is identifying your stakeholders and understanding their expectations. There can be lots of stakeholders on a project, and each of them can make decisions, influence actions, and weigh in on strategies. Each stakeholder will also have specific goals they want to meet. It is pretty common for a stakeholder to come to you with a problem that needs solving. But before you begin your analysis, you need to be clear about what they are asking of you. For example, if your manager assigns you a project related to analyzing the gym’s business risk, it would be a good idea to confirm whether they want you to analyze all types of risks that could affect the gym or just risks related to weather or seasonal trends.

After you have a clear direction, it is time to move to the Prepare stage. This is where you collect and store the data you will use for the upcoming analysis process. 

Let’s turn back to our gym membership example. To collect data on the member experience, you decide to send surveys to the gym’s members asking for feedback about their experience. To make sure you get specific answers, you ask them to offer feedback in three distinct categories: upkeep of the facility, customer service, and membership cost. You also leave room for them to write in a response. When you get the member surveys back, it is important that you have an organized system for tracking and filing them.   

This stage is when it is time to Process your data. In this step, you will “clean” your data, which means you will enter your data into a spreadsheet, or another tool of your choice, and eliminate any inconsistencies and inaccuracies that can get in the way of results.  While collecting data, be sure to get rid of any duplicate responses or biased data. This helps you know that any decisions made from the analysis are based on facts and that they are fair and unbiased. For example, if you noticed duplicate responses from a single gym member when sorting through the surveys, you would need to get rid of the copies to be sure your data set is accurate.  

During this stage, it is also important to check the data you prepared to make sure it is complete and correct and that there are no typos or other errors. 

Now it is time to Analyze. In this stage, you take a close look at your data to draw conclusions, make predictions, and decide on next steps. Here, you will transform and organize the data in a way that highlights the full scope of the results so you can figure out what it all means. You can create visualizations using charts and graphs to determine if there are any trends or patterns within the data or any need for additional research. 

In our gym membership example, let’s say you notice 50% of the members wrote in an additional response on the survey citing that the equipment is outdated. The survey also showed that 75% of the responses cited  “expensive membership fees.” When looking at the 50% of responses citing “outdated equipment” and 75% of responses citing “expensive membership fees” side by side on a graph, you may be able to deduce that these responses inform one another. Members feel like the experience just isn’t worth the price. You might conclude that the gym should invest in new equipment if they want to keep members and add value to the membership fee. 

Once you have asked questions to figure out the problem—then prepared, processed, and analyzed the data—it is time to Share your findings. In this stage, you use data visualization to organize your data in a format that is clear and digestible for your audience. When sharing, you can offer the insights you gained during your analysis to help stakeholders make effective, data-driven decisions for solving the problem. 

And finally, you are ready to Act! In the final stage of your data analysis, the business takes all of the insights you have provided and puts them into action to solve the original business problem. 

Conducting a data analysis is an essential process for understanding a business’ needs and challenges and determining effective solutions. These six foundational steps—ask, prepare, process, analyze, share, and act—will help set you up for success! 




