# Introduction to Product Analytics

This tutorial is part of the [Zero to Data Science Bootcamp by Jovian](https://zerotodatascience.com).

![](https://i.imgur.com/Gb6SsM1.png)

Product analytics is a subdomain of data science concerned with understanding how users interact with digital products. We use several digital products everyday (websites, mobile applications, wearables, smart speakers etc.). Many of these products are used by millions of people, sometimes even billions. Understanding user behavior is key to improving digital products. 

There are two ways to understand user behavior: qualitative (talking to individual users) and quantitative (collecting data from users' actions). Due to the sheer volume of users, qualitative studies are often too expensive or inefficient. For quantitative analysis, products are often instrumented to track every action of user, and this data is analyzed in the aggregate to understand user behavior. 

The following topics are covered in this tutorial:

- User journeys for digital products and the Pirate funnel (AAARRR)
- Key metrics & measures for different stages of the Pirate funnel
- Tools for data collection, storage, analysis and visualization
- Applying machine learning to predict behavior and improve products





> **EXERCISE**: You're the owner of a E-commerce store that sells shoes online. List 5 question that you'd like to answer by understanding user behavior.
> 
> 1. ???
> 2. ???
> 3. ???
> 4. ???
> 5. ???
>
> How will you collect the data required to answer these questions? What data analysis, visualization or machine learning techniques will you apply? Which tools, libraries or frameworks will you use to answer these questions?

## User Journeys for Digital Products

Before we can analyze user behavior, it's important to first understand how users interact with a digital product over time. As an example, let's consider Nike's online store: https://www.nike.com/ . E-commerce accounts for over 30% of Nike's global sales, so improving the customer experience on the online store is just as important as improving the customer experience in a physical store.

The journey of a typical user (let's call him Jim) may look something like this:

1. Jim sees an advertisement for Nike shoes on their Facebook feed. 

<img src="https://i.imgur.com/LSqYMMm.png" width="400">

2. Jim clicks on the ad, lands on https://www.nike.com and starts exploring the products available on the site.

<img src="https://i.imgur.com/UEsyBKU.png" width="480">

3. Jim adds some products to the cart, and creates an account to save his cart.

<img src="https://i.imgur.com/iOmQ8Vk.png" width="480">

4. Jim uses his credit card to make an online payment to complete the order.

<img src="https://i.imgur.com/mBzl61x.png" width="360">

5. A few months later, Jim returns to the site (via a link from a promotional newsletter) and makes another purchase.

<img src="https://i.imgur.com/mA8Vu9p.png" width="400">

6. Jim is quite happy with this shoes he's purchased. He recommends his friends Dana, Tim and Bernard to check out http://www.nike.com for buying their next pair of shoes.

<img src="https://i.imgur.com/gnJdMye.png" width="480">


Note that the Jim's user journey spans over several months, and the journey for Dana, Time and Bernard may look somewhat different, although they are likely to follow the same general trend. 




## The Pirate Funnel (AAARRR)

The Pirate funnel (so-called because of the acronym AAARRR) is a commonly-used framework for tracking the product usage lifecycle of users and identifying areas for improvement. The funnel has the following stages:

1. **Awareness**: A user learns about the product
2. **Acquisition**: The user tries out the product
3. **Activation**: The users takes the first important step
4. **Revenue**: The user makes a payment (becomes a customer)
5. **Retention**: The user makes repeated payments
6. **Referral**: The user brings other users to the product

Note that not all of these stages apply to every product and some stages may involve multiple steps. The framework is called a "funnel" because there's a possibility of users dropping off at each stage. 

Here's what the graph showing the number of users who have completed each stage might look like([source](https://growwithward.com/aaarrr-pirate-funnel/)): 

<img src="https://i.imgur.com/lD0oy9Y.png" width="480">

To improve the user experience, it's important to measure the conversion from each step to the next, and identify the biggest bottlenecks:

<img src="https://gustdebacker.com/wp-content/uploads/2021/02/Bottleneck-research.png" width="460">

In the above example, you can see that user retention is quite poor. The funnel helps companies identify the areas for improvement and allocate efficiently resources to maximize revenue.

> **EXERCISE**: Explain how the various stages of the Pirate funnel apply to the user journey for nike.com ? How can the conversion from each step to the next be improved?

> **EXERCISES**: Explain how the various stages of the Pirate funnel apply to the user journeys for the following products:
> 
> 1. Netflix 
> 2. Facebook (users)
> 3. Facebook (adverstisers)
> 4. Amazon (buyers)
> 5. Amazon (sellers)
> 6. Jovian
> 7. Airbnb
> 8. Uber (riders)
> 9. Uber (drivers)
> 
> For each of these products, come up ideas for increasing the conversion from one stage to the next.

> **EXERCISES**: List five or more digital products that you use regularly, and identify which stage of the Pirate funnel you belong to for each product.
> 
> 1. ???
> 2. ???
> 3. ???
> 4. ???
> 5. ???
>
> What would it take for you to move to the next stage of the funnel, for each of these products?

## Key metrics for different stages of the Pirate funnel

To increase conversion at every stage of the funnel, it's important to understand the steps involved in each stage, use the right tools measure them using appropriate metrics, and determine product-specific strategies to improve the metrics over time.

> **Metric**: A metric is a quantifiable measure used to track and assess the status or effectiveness of a process.


### Awareness

Users typically learn about a digital product (e.g. nike.com ) on another digital platform (e.g. Facebook, WhatsApp etc.) or via a physical channel (word of mouth, flyers, newspaper ads etc.). There are two types of channels for generating awareness about a product: organic and paid.

Here are some organic channels for generating awareness:

- Google search
- YouTube videos
- Posts or messages by existing users
- Company's social media handles 
- Emails & newsletters
- Word of mouth

Here are some paid channels for generating awareness:

- Online Ads (Google, Facebook, Twitter, Instagram, TikTok etc.)
- Partnerships with other companies
- Influencer/celebrity endorsements
- TV Ads, Billboards and newspaper ads
- Incentivized referrals
- Promotional SMS campaigns

Companies typically carry out digital marketing campaigns to generate awareness for their products. The effectiveness of marketing campaigns is typically measured using the following metrics

- **Impressions**: The number of times a post/ad about your product was seen
- **Conversion / Click Through Rate**: The percentage of impressions that led to a desired action (typically a link click).
- **Cost per Conversion/Click (CPC)**: The amount charged per conversion event. 


Most advertising platforms offer tools & dashboards for measuring paid (and sometimes organic) marketing campaigns.

#### Google Search Console

<img src="https://1.bp.blogspot.com/-m8Xl8ml5q-Y/WlM9em83yWI/AAAAAAAACqI/jGg8r_swEU0T9bbDTFWgnuaWWZ0XpibdQCLcBGAs/s1600/sc01.png" width="480">


#### Facebook Ads Dashboard

<img src="https://images.klipfolio.com/website/public/80d03872-3e74-47b7-8028-a66bd4c2e84a/fbad-performance-dash.jpg" width="480">

#### Twitter Ads Dashboard

<img src="https://business.twitter.com/content/dam/business-twitter/analytics/campaigns-dashboard/campaign_dashboard.jpg.twimg.1920.jpg" width="480">

> **EXERCISE**: List some digital marketing campaigns that you have been exposed to recently. Identify the channels used for carrying out the campaign and the tools used to measure the effectiveness of the campaigns.

> **EXERCISE**: Describe some strategies for improve the effectiveness of digital marketing campaigns. *Hint*: Look up the terms targeting criteria, retargeting, A/B testing.

### Acquisition

Acquisition can mean different things for different products. The terms "acquisition" and "activation" are sometimes also used interchangeably, or combined into a single stage. In this tutorial, we'll use the term "acquisition" to refer to users visiting and browsing through your site/app/product.

Here are some important metrics used to track acquisition:

- **Website visitors (monthly/weekly)**: The number of people who visit a site (or download an app) every month/week.
- **Average Pages per visit**: The number of pages/screens users browse though within the site/app in a single visit (also called session).
- **Average Session length**: The amount of time spent on the site/app in a single session.
- **Bounce rate**: The percentage of users who clicked the link but left the site without interacting with it.

Apart from the above, it is also useful to look at information like age, sex, location, device type (desktop/mobile), network connection type etc. to better understand website visitors.

Most websites use a tool like Google Analytics to monitor website traffic:



<img src="https://www.clickmatix.com.au/soozutin/2022/05/Google-Analytics-4-Report.png" width="640">

Google Analytics is [easy to set up](https://support.google.com/analytics/answer/1008015?hl=en) - it just requires adding a small piece of tracking code to a site. A great way to explore the Google Analytics dashboard is to [check out the demo account](https://support.google.com/analytics/answer/6367342), which shows real data from the [Google Merchandise Store](https://shop.googlemerchandisestore.com/)


Here's a series of tutorials on Google Analytics: https://www.youtube.com/watch?v=kKPTZNjaKcY&list=PLI5YfMzCfRtZU5Qwzx57HjmZyNE2o5YDN

> **EXERCISE**: Explore the Google Analytics [demo account](https://support.google.com/analytics/answer/6367342), which shows real data from the [Google Merchandise Store](https://shop.googlemerchandisestore.com/)

### Activation

Activation refers to the user completing an important step on the product which is highly likely to lead to a purchase or repeated usage the product. Typical activation events are:

- Providing an email ID to unlock some content
- Creating an account on the site/app
- Adding a product to the cart (on an online store)
- Other core actions e.g. adding a friend, writing a post, sending a message etc. 

The following metrics are used to measure activation:

- **Sign up / Activation Rate**: The percentage of site/app visitors who sign up or complete the core activation event
- **Monthly/weekly activations**: The number of new activations every month/week
- **Customer Acquisition/Activation Cost**: The amortized cost per activation (Total marketing spend / No. of activations)

It is also common to several secondary events, apart from the primary activation event. Tools like Mixpanel, Amplitude and Firebase are used for event-based analytics and funnel tracking.

Finding the right activation event for a product is one of the key challenges for early stage companies. Here's how Facebook found their activation event (or "aha" moment): https://mode.com/blog/facebook-aha-moment-simpler-than-you-think/

#### Amplitude

<img src="https://assets.contents.io/asset_Jv5d2IV6.png" width="480">

#### Mixpanel 

<img src="https://storage.googleapis.com/cdn-mxpnl-com/static/readme/Dashboard.svg" width="480">

#### Firebase Analytics 

<img src="https://miro.medium.com/max/1400/1*4oNGPpC71GDdwNFNUzdVBw.png" width="480">



> **EXERCISE**: Identify the activation events (or "aha" moments) for 5 digital products you've interacted with in the past month. What changes can you make to these products to increase the percentage of users who complete the activation event?
> 


### Revenue

Revenue, as the name suggests, refers to a payment made by a user for a product or a service. There are several free products, which don't generate revenue directly from their users, and there are several marketplace products that enable transactions between different parties and collect a percentage of the transaction amount as commissions.

- **Monthly (Recurring) Revenue**: The total payment received from all users in a month. Recurring revenue refers to revenue from subscriptions.
- **Paying users**: The number of users of the site/app that currently pay for the product. This can also be expressed as a percentage of total users.
- **Average Revenue per Customer**: The average payment received per paying customer (typically annualized)
- **Trial to Paid Conversion Rate**: The percentage users who convert from a free trial to a paid plan (wherever applicable)
- **Life Time Value**: The total expected revenue per paying user, from the point they start paying to the point they stop using the site/app.


Most websites and apps accept payments via payment gateway like Stripe. Payment gateways often offer revenue-tracking dashboards, and allow data to be synced into other analytics platforms.


#### Stripe

<img src="https://static.tildacdn.com/tild6239-3464-4831-a563-313366366563/Screen_Shot_2022-03-.png" width="480">

#### Baremetrics

<img src="https://baremetrics-marketing.imgix.net/front-page/homepage-control-center.png?auto=format,compress" width="480">

> **EXERCISE**: List 5 digital products that you're currently paying for. What convinced you to pay for each of these products? Estimate your lifetime value as a customer of the product.

### Retention 

Retention is a way of measuring how users interact with a product over time. 



- Churn rate
- **Monthly active users**: 
- 30/60/90-day retention rate
- Net Promoter Score
- Time between purchases

Most analytics tools offer some features to measure retention over time.

#### Cohort Analysis

Retention is typically measured by looking at monthly/weekly cohorts of users.

![](https://i.imgur.com/cjkO3Ai.png)

#### Retention Curve

The retention curve for product shows the average retention rate over time. The retention curve usually flattens out over time for sticky products, and goes down to zero for products that users don't find useful over the long term.

![](https://images.ctfassets.net/h6ufgtwb6nv1/7pTFD0ZlTtvD5RfIiMywZ8/23b72d4f34457dacdc91f232c2cdc7a5/netflix-retention.png)

> **EXERCISE**: List 5 digital products that you've been using (and possibly paying) for over a year. Why are you continuing to pay for the product?

> **EXERCISE**: List 5 digital products that you've stopped using (or paying for) over the past year. Why did you decide to stop paying for the product?

### Referral

Referral refers to act of a user recommending a product to others (friends, colleagues, family, etc). Referrals are a strong sign that users find a product valuable, and it may even become the primary acquisition channel for a product.  It is not always possible to measure referrals accurately. However, many products contain built-in ways to refer other users to a platform, and some even incentivize referrals by offering exclusive discounts to the referrer and the referee. Here's an example:

![](https://i.imgur.com/cNPGsit.png)

The following metrics are used to track referrals:

- No. of users who send invites
- No. of invites sent per user
- Percentage conversion of invites
- K factor: The average number of new users that an existing user invites to the platform.

By building product features to promote and incentivize referrals, products can create a virtuous feedback loop of viral organic growth. Learn more about referral campaign tracking here: https://referralrock.com/blog/referral-tracking/


> **EXERCISE**: Come up with 5 examples of products (digital or physical) that you were referred to by a friend or a colleague. What did the referral experience look like, did the product have any built-in or incentivized referral features?

> **EXERCISE**: Come up with 5 examples of products (digital or physical) where you referred a friend or a colleague. What did the referral experience look like, did the product have any built-in or incentivized referral features?

## Analytics Tools and Software

There are three main categories of tools that are used for product analytics:

1. **Event tracking**: Google Analytics, Segment, Firebase etc.
2. **Data Warehouses**: Relational Databases, BigQuery, AWS RedShift etc.
3. **Querying & Visualization**: Google Analytics, Mixpanel, Amplitude, Tableau, Metabase etc.

> **EXERCISE**: Learn about each of the above tools by referring to tutorials online. Can you determine the use case for each tool?

## Predicting Behavior and Improving Products with Machine Learning

While product analytics tools tell us how users are using a product, machine learning can be used to predict how users will interact with a product in the future or to improve the experience of a user while they're using a product. In this section, we'll look at some applications of machine learning to digital products.

### Ad Targeting

Advertising platforms offer several targeting criteria to determine the audience for an ad, and they also use machine learning models to determine how likely a user is to engage with a given advertisement. These models are used to show the most relevant ads to users in their social feeds.

![](https://i.imgur.com/Da0jQKM.png)

Another common application of data science & machine learning is retargeting:

<img src="https://www.wordstream.com/wp-content/uploads/2021/09/facebook-audience-types.png" width="640">




> **EXERCISE**: Find 5 examples of targeted ads across different platforms that you've come across in the past month. 

### Personalization

Machine learning is used to personalize the user experience on several products like Google, Facebook, Twitter etc.

#### Google

<img src="https://cdn1.link-assistant.com/images/news/personalized-search/screen-11.png" width="480">


#### Facebook

<img src="https://www.gannett-cdn.com/-mm-/ffa8370e75f659a649843416166c5b2d294a1cdb/c=0-2-580-328/local/-/media/2017/05/04/USATODAY/usatsports/newsfeed_large.png?width=580&height=326&fit=crop&format=pjpg&auto=webp" width="640">

#### Twitter

<img src="https://buffer.com/library/content/images/library/wp-content/uploads/2017/11/ranked-tweets.png" width="480">

> **EXERCISE**: List 5 examples of products that use personalization to improve the user experience. Can you identify the machine learning techniques used to build such a system?

### Recommendations

A common form of personalization is recommendations: suggesting content that a user is most likely to interact with, using content-based on collaborative filtering techniques.

#### YouTube

<img src="https://cdn.vox-cdn.com/thumbor/szR7Ec1Qjx3ku0RPrpriEXM-Ze0=/0x0:1200x800/1200x800/filters:focal(504x304:696x496)/cdn.vox-cdn.com/uploads/chorus_image/image/65784178/homepage.0.jpeg" width="480">


#### Netflix

<img src="https://arc-anglerfish-arc2-prod-spectator.s3.amazonaws.com/public/DX4TP2F7ZRHEFG5GT7UTRKZ5DQ" width="640">

#### Amazon

<img src="https://www.versionmuseum.com/images/websites/amazon-website/amazon-website%5E2019%5Ehomepage-august.jpg" width="480">

#### Twitter

<img src="https://images.techhive.com/images/idge/imported/article/nww/2013/04/042913blog-twitter-follow-100274623-orig.png" width="360">

> **EXERCISE**: List 5 more examples of products that use recommendations to enhance the user experience. Can you identify the machine learning techniques used to build such a system?

### Purchase Prediction

Many sites attempt to predict how likely a user is to make a purchase and offer personalized discounts to improve the likelihood of purchase and maximize revenue. This works especially well for software products with zero marginal cost.

<img src="https://www.pushengage.com/wp-content/uploads/2022/02/Miro-Welcome-Email.png" width="400">

<img src="https://i.imgur.com/sJk61DV.png" width="360">

<img src="https://support.rechargepayments.com/hc/article_attachments/360037169933/Captura_de_pantalla_2018-11-14_a_la_s__16.11.23.png" width="480">

Here's a Kaggle competition where you can build a purchase prediction model: https://www.kaggle.com/c/allstate-purchase-prediction-challenge

> **EXERCISE**: Come up with 5 examples (real or hypothetical) where a purchase prediction model can help increase revenue for a product.

### User Segmentation

Segmentation of users is typically done by apply clustering algorithms to inputs such as demographics, income, purchase history, duration of usage etc. Based on these segments, the product experience can be customized for each set of users:

<img src="https://miro.medium.com/max/1400/1*rFATWK6tWBrDJ1o1rzEZ8w.png" width="640">

Customer segmentation is also used to determine pricing plans:

<img src="https://d2x3xhvgiqkx42.cloudfront.net/12345678-1234-1234-1234-1234567890ab/8769cf44-f342-494c-b25f-cc98c9da3e82/2018/07/04/6d10af7f-6f68-4d43-ad5f-c3daa3834ee0.png" width="640">

> **EXERCISE**: Come up with 5 examples of products (real or hypothetical) that can use use segmentation to enhance the user experience. For each product, try to identify the features required to perform clustering effectively?

### Churn Prediction

Machine models can be used to determine if a user is about to churn (stop using/paying for a product). Here's what a typical churn prediction model might look like ([source](https://www.braze.com/docs/user_guide/predictive_suite/predictive_churn/)): 

<img src="https://i.imgur.com/lsJ6lv2.png" width="640">

Check out this churn prediction competition on Kaggle: https://www.kaggle.com/c/customer-churn-prediction-2020

### Support Automation

Customer support is a non-scalable component of many digital products. Machine learning can be used to automate resolution of support queries, at least partially, by analyzing queries and offering solutions from an existing knowledge base using customer support chatbots:

<img src="https://cdn-images-1.medium.com/max/920/0*7Rim1MwmScAy3tIq." width="640">

Check out these tutorials on building chatbots using machine learning:

- https://towardsdatascience.com/how-to-create-a-chatbot-with-python-deep-learning-in-less-than-an-hour-56a063bdfc44
- https://towardsdatascience.com/chatbots-made-easier-with-rasa-2-0-b999323cdde

> **EXERCISE**: Identify examples of support automation system that you have interacted with in the recent past. How satisfied were you with the support you received? How would you improve the systems you didn't feel satisfied with?

### Spam, Fraud and Abuse Detection

<img src="https://miro.medium.com/max/1400/1*Fm58r_RQ53sEHfwFa28LpA.png" width="480">

Here's a tutorial of building a spam classification model: https://towardsdatascience.com/how-to-build-and-apply-naive-bayes-classification-for-spam-filtering-2b8d3308501

<img src="https://www.researchgate.net/profile/Balaji-Gn/publication/325658124/figure/fig1/AS:635524247207936@1528532196111/The-Credit-Card-Fraud-Detection-Process.png" width="640">

Here's a dataset for training a fraud detection model: https://www.kaggle.com/mlg-ulb/creditcardfraud

> **EXERCISE**: Try to come up with 5 more applications of machine learning to predict user behavior and improve user experience.

## Summary and References

The following topics were covered in this tutorial:

- User journeys for digital products and the Pirate funnel (AAARRR)
- Key metrics & measures for different stages of the Pirate funnel
- Tools for data collection, storage, analysis and visualization
- Applying machine learning to predict behavior and improve products

Check out the following resources to learn more:

- https://amplitude.com/product-analytics
- https://growwithward.com/aaarrr-pirate-funnel/
- https://www.simplilearn.com/tutorials/machine-learning-tutorial/machine-learning-applications

## Revision Questions
1.	What is product analytics?
2.	What are the two ways to understand user’s behaviour? Explain each with an example.
3.	What is a user’s journey?
4.	What does AAARRR stand for?
5.	What is a Pirate Funnel? What happens in each stage?
6.	What is a Bottleneck research?
7.	What is a metric? 
8.	Name a few organic and paid channels for generating awareness.
9.	What are the different metrics used in measuring the effectiveness of Awareness marketing campaigns?
10.	What are some advertising platforms that offer tools and dashboards for measuring (paid and organic) marketing campaigns?
11.	What are the important metrics used to track acquisition? 
12.	What are the tools used to monitor website traffic?
13.	What are the different metrics used to measure activation?
14.	What are the tools used for event-based analytics and funnel tracking?
15.	What are the different ways to track revenue?
16.	Name a few payment gateways used for revenue-tracking?
17.	What are different ways to measure retention?
18.	What is a churn rate?
19.	What is a net promoter score?
20.	What is a Cohort analysis?
21.	What is a retention curve?
22.	What are the different metrics to track referrals? 
23.	What are the different categories of tools used for product analytics? Explain each with an example.
24.	What are some applications of Machine Learning in product analytics?
25.	What is Ad targeting?
26.	What is retargeting?
27.	What is personalization?
28.	What are recommendations? How does it work?
29.	What is purchase prediction? What is the purpose of it?
30.	What is user segmentation?
31.	What is churn prediction? How does it help?
32.	What is support automation?