## Analysis of Harmful Ingredients in Skincare Products and Their Impact on Customer Sentiment and Ratings


### Introduction

The beauty industry is growing rapidly, with ingredients continually evolving to be safer and better for our health. Consumers are increasingly aware of the harmful ingredients used in their products, making it crucial to understand how these potentially harmful substances affect consumer sentiment and product ratings. This project aims to analyze the prevalence of harmful ingredients in skincare products and assess their impact on consumer perceptions and ratings.

In this study, we will:
1. **Identify Harmful Ingredients**: Utilize a reliable database to classify ingredients based on their safety profiles.
2. **Determine Prevalence**: Calculate how common these harmful ingredients are in various skincare products.
3. **Analyze Consumer Sentiment**: Perform sentiment analysis on consumer reviews to gauge awareness and reactions to harmful ingredients.
4. **Evaluate Product Ratings**: Assess how the presence of harmful ingredients affects product ratings.
5. **Correlation Analysis**: Explore the relationship between harmful ingredients, consumer sentiment, and product ratings.
6. **Provide Insights and Recommendations**: Offer actionable recommendations for manufacturers and regulators based on our findings.

In [1]:
pip install pandas

Note: you may need to restart the kernel to use updated packages.


In [2]:
pip install numpy 

Note: you may need to restart the kernel to use updated packages.


In [3]:
pip install matplotlib

Note: you may need to restart the kernel to use updated packages.


In [7]:
pip install xlrd

Collecting xlrd
  Obtaining dependency information for xlrd from https://files.pythonhosted.org/packages/a6/0c/c2a72d51fe56e08a08acc85d13013558a2d793028ae7385448a6ccdfae64/xlrd-2.0.1-py2.py3-none-any.whl.metadata
  Downloading xlrd-2.0.1-py2.py3-none-any.whl.metadata (3.4 kB)
Downloading xlrd-2.0.1-py2.py3-none-any.whl (96 kB)
   ---------------------------------------- 0.0/96.5 kB ? eta -:--:--
   ------------ --------------------------- 30.7/96.5 kB 1.4 MB/s eta 0:00:01
   ---------------------------------------- 96.5/96.5 kB 1.4 MB/s eta 0:00:00
Installing collected packages: xlrd
Successfully installed xlrd-2.0.1
Note: you may need to restart the kernel to use updated packages.


In [9]:
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns 

In [13]:
# Load the datasets into dataframe
df_skincare = pd.read_csv('Skincare.csv')
df_ingredients = pd.read_excel('Skincare_Ingredients.xls')
print("Skincare datasets attributes:",df_skincare.dtypes)
print("Skincare ingredients attributes:", df_ingredients.dtypes)


Skincare datasets attributes: Label           object
brand           object
name            object
price            int64
rank           float64
ingredients     object
Combination      int64
Dry              int64
Normal           int64
Oily             int64
Sensitive        int64
dtype: object
Skincare ingredients attributes: CAS                              object
List Name                        object
TSCA Chemical Name               object
List Call                        object
Caveat - Chemical Use            object
Edit Description                 object
Date of Edit             datetime64[ns]
dtype: object


In [14]:
df_skincare.head()

Unnamed: 0,Label,brand,name,price,rank,ingredients,Combination,Dry,Normal,Oily,Sensitive
0,Moisturizer,LA MER,Crème de la Mer,175,4.1,"Algae (Seaweed) Extract, Mineral Oil, Petrolat...",1,1,1,1,1
1,Moisturizer,SK-II,Facial Treatment Essence,179,4.1,"Galactomyces Ferment Filtrate (Pitera), Butyle...",1,1,1,1,1
2,Moisturizer,DRUNK ELEPHANT,Protini™ Polypeptide Cream,68,4.4,"Water, Dicaprylyl Carbonate, Glycerin, Ceteary...",1,1,1,1,0
3,Moisturizer,LA MER,The Moisturizing Soft Cream,175,3.8,"Algae (Seaweed) Extract, Cyclopentasiloxane, P...",1,1,1,1,1
4,Moisturizer,IT COSMETICS,Your Skin But Better™ CC+™ Cream with SPF 50+,38,4.1,"Water, Snail Secretion Filtrate, Phenyl Trimet...",1,1,1,1,1


In [15]:
df_ingredients.tail()

Unnamed: 0,CAS,List Name,TSCA Chemical Name,List Call,Caveat - Chemical Use,Edit Description,Date of Edit
752,68605-97-0,"Fatty acids, tallow, hydrogenated, compds. wit...","Fatty acids, tallow, hydrogenated, compds. wit...",Green [Circle],,Chemical added to the list,2012-12-21
753,26590-05-6,"2-Propen-1-aminium, N,N-dimethyl-N-2-propenyl-...","2-Propen-1-aminium, N,N-dimethyl-N-2-propen-1-...",,,Chemical removed from list,2012-12-21
754,68989-22-0,"Zeolites, NaA","Zeolites, NaA",Green [Circle],,Chemical added to the list,2012-12-21
755,1318-02-1,Zeolites,Zeolites,Green [Circle],,Chemical added to the list,2012-12-21
756,27593-14-2,Octyldimethylbetaine,"1-Octanaminium, N-(carboxymethyl)-N,N-dimethyl...",Green [Circle],,Chemical added to the list,2012-12-21
